Avatar
I'm Drew Breunig and I obsess about technology, media, language, and culture. I live in New York, studied anthropology, and work in advertising technology.

These are reactions to things I feel are important.

Follow me on Twitter.

Likes

Posts tagged language

When will legal writing become a programming language?

New York Magazine reports on an interesting development in computer/human relations:

The Southern District of New York recently became the nation’s first federal court to explicitly approve the use of predictive coding, a computer-assisted document review that turns much of the legal grunt work currently done by underemployed attorneys over to the machines.

Last month, U.S. Magistrate Judge Andrew J. Peck endorsed a plan by the parties in Da Silva Moore v. Publicis Groupe — a sex discrimination case filed against the global communications agency by five former employees — to use predictive coding to review more than 3 million electronic documents in order to determine whether they should be produced in discovery, the process through which parties exchange relevant information before trial.

Analysts expect decisions like this to open the door for an eruption of computer analyzed legal work.

A few questions:

  1. How long will it be before lawyers are explicitly trained to write in a way which will be favorably interpreted by software?
  2. After this happens, how long will it be before legal writing evolves into a scripting language, more code to be compiled than words to be understood by humans? When will the first O’Reilly book be published for this language?
  3. Will computer/human standards emerge for other fields or discourses? SEO copywriting is already on its way. What other fields might follow suit? Sports journalism?

Would love to see languages spoken online overlaid. (Via The Economist)

Badgeville mobilizes gamification with mobile SDK
GigaOm’s headline just hit a buzzword bingo.

The Department of Homeland Security Is Searching Your Facebook and Twitter for These Words 

Joel Johnson’s been at ANIMAL, for what, a month and he’s already commissioned a Jeremy-Lin-In-Space mural on my block and turned our attention to this:

The Department of Homeland Security monitors your updates on social networks, including Facebook and Twitter, to uncover “Items Of Interest” (IOI), according to an internal DHS document released by the EPIC. That document happens to include a list of the baseline terms for which the DHS–or more specifically, a DHS subcontractor hired to monitor social networks–use to generate real-time IOI reports.

Click through for the giant list. Odds are you’re in their database several times.

Hilariously, “social media” and “China” are on the list. Imagine the noise on those. (Also “2600”, which is completely absurd. Our federal government thinks said publication is as illicit as my friends and I thought in junior high.)

Now begins the wait for two apps:

  1. A website that scans your Twitter & Facebook feed and estimates how many times you occur in the database and why.
  2. A Twitter app or app plugin that hashes any of these terms into unique keys for publishing and decrypts them for reading.

(Via ANIMAL)

First, Thibodeau and Boroditsky asked 1,482 students to read one of two reports about crime in the City of Addison. Later, they had to suggest solutions for the problem. In the first report, crime was described as a “wild beast preying on the city” and “lurking in neighbourhoods”. After reading these words, 75% of the students put forward solutions that involved enforcement or punishment, such as calling in the National Guard or building more jails. Only 25% suggested social reforms such as fixing the economy, improving education or providing better health care

Not Exactly Rocket Science describes a Stanford experiment illustrating the power of metaphors and language.

The words you use matter.

140 Characters Go Farther in China

James Fallows, covering Jeremy Lin’s reception in China, quotes a tweet:

On Feb. 12, Mao Maozi, a cameraman with the state-run Shanghai Education Television network, tweeted an answer to that question on Sina Weibo: “If Jeremy Lin lived on the mainland, he would either be a semi-literate CBA [Chinese Basketball Association, China’s state-run professional league] player or an ordinary undergraduate who likes basketball in his spare time. We admire him not because he is an ethnic Chinese, but because he has proved for a fact that the main reason that Chinese don’t play basketball well is because of the system, and not their physique!”

And, Yes, for the record, that’s all one tweet! The writing system of the Chinese language has its drawbacks, but one of the pluses is that with 140 characters you can say a whole lot more in Chinese.

I’m really enjoying the linguistic quirks and negotiations as exported technology encounters methods of communication which their interfaces never thought to consider. China is nearly always a participant, as it’s scale cannot be ignored. As more and more locally designed tech is exported and more and more Chinese citizens explore the bounds of the web, the friction and workarounds will be fascinating to watch. The Economist notes:

More than 300m Chinese internet users have at least one microblog account, and some use virtual private networks (VPNs) to get around the infamous “great firewall” of China. The Chinese government is being dragged, click by click, out of its cone of silence.

(Fallows via Matt Yglesias)

Mountain Lion in China 

Jason Snell on the big updates for Apple’s soon-to-be largest market:

Mountain Lion will offer better suggestions and corrections via a dynamically updated dictionary, something an Apple representative told me was because Chinese word usages are evolving rapidly. Apparently English words are often inserted in Chinese text, so Mountain Lion allows the mixing of Pinyin and English without switching between keyboard layouts. Apple says Mountain Lion also doubles the number of characters recognized by trackpad-based handwriting recognition.

On the Internet services side, Mountain Lion offers support for Chinese alternatives to several worldwide services. Search-engine Baidu is now an option in Safari. Chinese microblogging service Sina weibo is supported in Share Sheets, just as Twitter is. In addition to Vimeo and Flickr, Mountain Lion will support sharing to Chinese video-sharing sites Youku and Tudou. And Mail, Contacts, and Calendar syncing will be supported to Chinese service providers QQ, 126, and 163.

Implementing text support in Lion must be linguistically fascinating and head-smashingly complex.

On the Language of Marriage 

Nick Kam dives into the Perry v. Brown opinion and discovers “the law clears at the 9th Circuit are having too much fun”:

We need consider only the many ways in which we encounter the word “marriage” in our daily lives and understand it, consciously or not, to convey a sense of significance. We are regularly given forms to complete that ask us whether we are “single” or “married.” newspapers run announcements of births, deaths, and marriages. We are excited to see someone ask, “will you marry me?”, whether on bended knee in a restaurant or in text splashed across a stadium Jumbotron. Certainly it would not have the same effect to see “will you enter into a registered domestic partnership with me?”

Groucho Marx’s one-liner, “marriage is a wonderful institution… but who wants to live in an institution?” would lack its punch if the word “marriage” were replaced with the alternative phrase. So too with Shakespeare’s “A young man married is a man that’s marr’d,” Lincoln’s “marriage is neither heaven nor hell, it simply purgatory,” and Sinatra’s “A man doesn’t know what happiness is until he’s married. By then it’s too late.” We see tropes like “marrying for love” versus “marrying for money” played out again and again in our films and literature because of the recognized important and permanence of the marriage relationship. Had Marilyn Monroe’s films been called How to Register a Domestic Partnership with a Millionaire, it would not have conveyed the same meaning as did her famous movie, even though the underlying drama for same-sex couples is not different. The name “marriage” signifies the unique recognition that society gives to harmonious, loyal, enduring and intimate relationships.”

There is of course nothing new about fears accompanying the emergence of a new communications technology. In the fifteenth century, the arrival of printing was widely perceived by the Church as an invention of Satan, the hierarchy fearing that the dissemination of uncensored ideas would lead to a breakdown of social order and put innumerable souls at risk of damnation. Steps were quickly taken to limit its potentially evil effects. Within half a century of Gutenberg’s first Bible (1455), Frankfurt had established a state censorship office to suppress unorthodox biblical translations and tracts (1486), and soon after, Pope Alexander VI extended censorship to secular books (1501). Around 400 years later, similar concerns about censorship and control were widespread when society began to cope with the political consequences of the arrival of the telegraph, the telephone, and broadcasting technology. The telegraph would destroy the family and promote crime. The telephone would undermine society. Broadcasting would be the voice of propaganda. In each case, the anxiety generated specifically linguistic controversy. Printing enabled vernacular translations of the Bible to be placed before thousands, adding fuel to an argument about the use of local languages in religious settings which continues to resonate today. And when broadcasting enabled selected voices to be heard by millions, there was an immediate debate over which norms to use as correct pronunciation, how to achieve clarity and intelligibility, and whether to permit local accents and dialects, which remains as lively a debate in the twenty-first century as it was in the twentieth.
From David Crystal’s Language and the Internet, 1997

“Content” Creep

Publishers have stopped referring to their products as journalism, writing, literature, photography, or art. Today, everything is simply “content”.

Those working in media, especially digital media, can attest to the word’s popularity. Quantitatively we can observe this trend in the chart above: over the last ten years, annual financial reports from The New York Times have leaned more and more heavily on the word “content” when describing their business. The share of “journalism” has remained relatively flat.

The NYT and other publishers rely on the word “content” to help them understand the breadth of their output. But by reducing their writing, photography, videos and more to a single, nondescript term they’re setting themselves up for failure.

The Rise of “Content”

“Content” emerged with the rise of the Internet, which detached pieces of work from their primary media. Before the web, we referred to works by the media format which delivered them: as newspapers, magazines, paintings, photographs, records, CDs, and so on. As digital representations grew in popularity these monikers became increasingly awkward. Is a newspaper still a “paper” when the majority of its readers view it on screens? To abate this awkwardness, we began to search for a more apt term. We landed on “content”, a bucket term which we ask to describe anything a publisher could publish, from the most revelatory art to the most hackneyed rags.

At this point, “content” was an innocent, sloppy fix. A stopgap until the Internet settled down and a proper term could be coined. Unfortunately, the pace of innovation quickened and today language is unable to keep pace with rapidly emerging new ideas, art1, and businesses.

So we’ve stuck with “content”.

The Assumptions & Allure of Content

To achieve its representative breadth, the word “content” makes two assumptions:

  1. Each piece of “content” is equal and is therefore interchangeable: As stated earlier, “content” is used to represent a wide breadth of works. A Pulitzer winning report and a Business Insider slideshow are both single instances of “content.” The word must remain formless, devoid of emotion, and of indefinite form and quality. Any characteristic which might differentiate two works must be ignored. This rhetoric categorization gives rise to the second assumption.
  2. “Content” production is trivial: Since each bit of “content” is interchangeable, “content” is only as hard to create as the easiest instance.

Publishers buy into these two assumptions because “content” allows them to easily measure and analyze their output. Messy qualitative measures are hidden so output fits neatly within Excel cells. This is the allure of “content”: it allows comforting, structured data which simplifies the complexity of a large business and makes decisions less intimidating. Executives aren’t making qualitative picks regarding art or an artist, they’re merely signing off on whichever “content” produces more valuable metrics.

At The New York Times it’s conceivable that editors and executives have a handle on their output. But businesses with strategies dependent on massive levels of “content” production cannot know the quality of everything that ships. Think YouTube, where users upload more than an hour of video every hour. Or content farms like Demand Media, which claims to have created 2 million articles and 200,000 videos as of June 2010.

Read more

Next page Something went wrong, try loading again? Loading more posts