Archive for the ‘Foreign Languages’ Category

Folktales as digital picture-books in many languages

Saturday, September 15th, 2007

If you like traditional folktales, Digital EHON is the site for you. You can read folktales from Japan and elsewhere in the original language and translations into various other languages. Not only that, you can also download and listen to some of them in Japanese and English with pictures and text.

The audio picture-book players are only about 3MB each and download in seconds via DSL, but I get gibberish for the Japanese text. However, the Japanese text renders fine on the Web site itself.

Right now they have (at least in Japanese) tales with paintings from:

  • Japan (including some original stories)
  • Taiwan
  • Korea
  • Papua New Guinea
  • Mongolia
  • Mexico
  • Peru
  • Brazil
  • Bolivia
  • Sweden
  • Indonesia and
  • China.

The site has an index and at least some content in:

  • English
  • Japanese
  • Indonesian
  • Spanish
  • Norwegian
  • Swedish
  • Chinese
  • Korean
  • German
  • Italian
  • French and
  • Ami (a language of Taiwan’s indigenous Ami people).

EHON is a great place for multicultural/multilingual/translation fun.

Language note:
EHON is probably a play on words. Hon 本 is the Japanese word for “book.” Ehon 絵本 is “picture book” but could also be “E-book” (electronic book) with a change in pronunciation from “eh” to English “ee.”

Chinese (and Japanese) Internet slang

Friday, September 14th, 2007

You probably know about English speakers’ netspeak/IM speak/textspeak, such as LOL for laughing out loud (and for Australians, m8 for mate). If you’ve read my post Spanish texting / IM abbreviations, you know about Spanish speakers’ usage, such as l100to [l ciento to] for lo siento (“I’m sorry,” “im sry”).

Now it’s time for Chinese Internet slang, also known as “Martian language” (火星文 Huo3xing1wen2).

  • They use text as pictures, such as the orz of 3Qorz.
    • The orz is from Japanese Internet usage. The form orz represents a person in profile who is so disappointed over failure (or now in Taiwan and Mainland China: so awed by admiration) as to fall on his or her hands and knees. The “o” is the head; “r” is the the shoulders, back and arm; and “z” is the curves of the butt and bent legs (illustrations).
    • 3Q means “thank you.” San1 (三 “three”) + Q approximates the sound of “thank you.”
    • The 得 (de5) is a grammar particle that links the verb thank to its adverb manner phrase (“prostrate before someone,” “with great admiration”).
    • Thus, 3Q得orz represents 感谢得五体投地 (gan3xie4 de5 wu3ti3tou2di4, “thank you with great admiration”).
  • They use homophones and near homophones, such as the Arabic numeral 4 (四 si4, “four”) for 死 (si3, “to die”) or for 是 (shi4, “yes”).
  • They use characters that contain the intended character, such as 饿 (e4, “hungry”) for (wo3, “I”/”me”) because the former contains the latter as a phonetic element (in Ancient Chinese pronunciation), plus a meaning element of “food.”
  • They break up Chinese characters into separate pieces. Instead of the character (qiang2, “strong”), they type the two characters and (gong1, “archery bow” + sui1, “although”). Another example: instead of the character (ding3, “summit”), they write the separate characters and (ding1, “fourth in a series” + ye4, “page of a book”).

American parents mostly try to figure out their kids’ abbreviations and some numbers or combinations of characters substituting for letters (leetspeak such as 3 for E or “/\/\” for M). Chinese and Japanese parents must be really lost. Perhaps they’re falling prostrate in despair.

2007 International Linguistics Olympiad, USA’s first time

Tuesday, August 21st, 2007

The fifth annual International Linguistics Olympiad for high school students was held in St. Petersburg, Russia, earlier this month. I was hoping the U.S. would finally join in. It did and fielded two teams, one of which won the team competition. One member of the other U.S. team won the individual competition (results, participants).

U.S. National Science Foundation press release

North American Computational Linguistics Olympiad
(This year was the first, sort of the U.S. Olympic trials.)

See also my posts:
2006 International Linguistics Olympiad
Results: 2006 Linguistics Olympiad

U.K. vs. U.S. Harry Potter; French Potter dictionary

Saturday, July 21st, 2007

Given the complete lack of coverage by any print, television, or online source, you’re probably unaware that the seventh and final Harry Potter book, Harry Potter and the Deathly Hallows (Bloomsbury Publishing), went on sale today.

In honor of this pop-culture event, I’ll offer two Harry Potter language-related items.

1. A couple of fans did a line-by-line British/American text comparison of the first book, Harry Potter and the Sorcerer’s Stone [U.K. original: Harry Potter and the Philosopher's Stone], such as American:

  • “on [in] the next street”
  • “had gotten [had got] the new computer”
  • “almost bald except for his bangs [fringe]“
  • “make him and your father even [quits]“
  • “This needs [want] thinking about”

2. French fans who want to use Harry Potter for English learning or want to discuss the books with English-speaking fans can now get help from the Dictionnaire Harry Potter Anglais-Français, with over 3,600 English-to-French terms from Harry Potter volumes 1-6.

According to this article:

Thus “namby-pamby” means “gnagnan” in French and “nutter” becomes “cingle”, but notions from within the Potter-world such as “muggle” or “quidditch” are not listed.

Vive le muggle!

2007 Maori Language Week

Friday, July 20th, 2007

This year’s Maori Language Week in New Zealand will be July 23-29, 2007. The theme is “Tāpoi – Tourism.” It celebrates the language of the aboriginal people of New Zealand, a language with official legal status.

For those of us far from New Zealand, have a look at the 2002 movie Whale Rider. It has Maori actors and local Maori extras. It’s mostly in English, but there are some Maori (te reo Māori) words (taboo is tapu) and some subtitled exchanges.

There’s also a Wikipedia version in the Maori language.

See also last year’s post:
Maori Language Week

Spanish texting / IM abbreviations

Wednesday, July 18th, 2007

If you’ve ever wondered what b/c (because) type of typing would be in other languages, Oxford University Press has provided some Spanish ones for free in this 165 KB PDF (plus how to write an old-fashioned letter in Spanish).

Here are some of them:

  • atamñn: hasta mañana (["until tomorrow"] talk to you tomorrow, ttyt)
  • d+sia2 [de mas sia dos]: demasiados (too many, 2 many)
  • eia: ella (she/her)
  • gf [ge ("hay") efe]: jefe (boss)
  • j+ [j mas]: jamás (never)
  • kf [ka efe]: café (coffee/cafe)
  • l100to [l ciento to]: lo siento (I’m sorry, im sry)
  • mdr: muerto de risa (dying of laughter, rofl [rolling on the floor laughing])
  • msj: mensaje (message, msg)
  • na- [na menos]: nada menos (nothing much)
  • npn: no pasa nada (nothing’s going on)
  • q&d [cu and d]: cuando (when)
  • qay d nvo?: ¿qué hay de nuevo? (what’s new?)
  • s3ad [es tres ad]: estresado (stressed)
  • stas bn? [es tas be ene]: ¿estás bien? (you O.K.?, u ok?)
  • t&q: te quiero (I love you, i luv u)
  • taptec1kf? [te a pe te ce uno ka efe]: ¿te apetece un café? (do you want coffee?)
  • x: por (for, 4)
  • xfa: por favor (please, pls)
  • xq: porque (because, b/c [or bc])

See also (English-language texting abbreviations):

Techdictionary.com
Macmillan English Dictionary

Free audio phrase books for phones

Friday, July 6th, 2007

Just under a year ago (Mobile words) I wrote about Coolgorilla’s free audio phrase books for iPods. Those are now free with a purchase of something from travel/tourism site lastminute.com. But the new version, Talking Mobile Phrase Books, for cellphones/mobile phones is completely free for now at least (also sponsored by lastminute.com).

You can download French, German, Greek, Italian, Portuguese, and Spanish. They’re planning on adding Chinese, Japanese, Polish, Swedish, and other languages in the future.

You don’t have to be able to pronounce the foreign language; just find the expression you want and let the phrase book say it.

Press release

Thanks to Roy Forsdick, Managing Director, iDev Entertainment (apparently Coolgorilla is their brand) for informing me about this.

Annoying Internet words?

Tuesday, July 3rd, 2007

Do recent Internet terms bother you? They seem to bother some. The British polling organization Yougov (site) reportedly found that the most annoying Internet-related term is folksonomy.

Yougov poll

Other irritating terms from the Yougov poll were blogosphere, blog, netiquette, blook (book from a blog, such as Language Log‘s), webinar, vlog (video blog), social networking, cookie, wiki, podcast, avatar, and user-generated content. Yougov polled 2091 adults in June 2007 on behalf of the Lulu Blooker Prize, an award for blooks.

A folksonomy is a “folk taxonomy,” or a classification system done by anyone and without any agreed-upon hierarchy or definitions. Description tags on Flickr photos or Youtube videos are examples. I think it’s an inelegant word and of little use. Even if you’re a huge fan of the collaborative Wikipedia, imagine it with no editorial oversight at all. That’s a folksonomy.

On Youtube some people write sentences that get broken up into single-word tags including the words is or that. I just did a search for “is” on Youtube and got the word is in the titles and descriptions (but seemingly not tags) of 6,110,000 videos. Interestingly, the first video (embedded below) was on Web 2.0, another annoying word for some.

Link to video

Switched.com poll

Switched is running its own poll of annoying words with Web 2.0, the verb to Google, podcast, internets, as well as the pronunciations “dubdubdub” for www and “earl” for URL (thanks to Jake the outreach person for Switched).

Some of these words just take getting used to, such as wiki, cookie, or even blog. However, I don’t like vlog because “vl-” just isn’t part of English phonology (the system of sounds and sound combinations). I wish it were V-blog. But then again, I can say the Russian name “Vladimir” and “Vlad the Impaler” (the human Dracula), so I guess I’ll get used to that, too.

I don’t like podcast because it makes no sense. Nobody broadcasts to iPods or to any other mp3 player. After the recording is over, the file goes up on a Web site with a syndication feed. Then if you are a subscriber to the feed, the file downloads to your computer. You can then listen to it on your computer. You can also move the file to your player, but you don’t have to.

“Earl” for “U-R-L” doesn’t bother me. I can see it catching on. Actual (approximate) conversation from 1997 or 1998 between a coworker (no, Brits, that’s not “cow-orker”) and me:

Coworker: What’s an earl?

Me: (surprised) It’s like a duke or a count.

Coworker: (laughs) No, U-R-L.

Me: Oh, that’s U-R-L. I think it’s “universal resource locator” [it's actually "uniform"]. It’s an Internet address.

Coworker: The computer has some message in Japanese with “URL” in it.

The message was “URL cannot be displayed” or some such.

I can’t say I’m a fan of “dubdubdub” for “double-u double-u double-u.” I can see that it’s easier to say. The Tonight Show‘s Jay Leno was telling people to send in funny headlines to “ww.nbc.com/tonightshow.” Now he just says “nbc.com/tonightshow” (which for some reason goes to the trouble of redirecting to the longer http://www.nbc.com/The_Tonight_Show_with_Jay_Leno/). However, why are we even bothering putting “www” in URL addresses? It isn’t necessary. Even if we do use “www” in URLs, it will usually get filled in without typing anyway.

On a side note and back to Japan, in Japanese they use the letter W to represent the borrowed word daburu (“double”), such as W chiizubaagaa (“double cheeseburger”).

What Internet words are bothersome to you?

Youtube goes multilingual

Friday, June 22nd, 2007

Last week I mentioned the rumor that Youtube might launch foreign versions (Flickr, possibly Youtube go multilingual). This week they did just that, with versions in the UK, Ireland, Brazil, France, Italy, Japan, the Netherlands, Poland, and Spain. Oddly, there’s no Germany yet.

Eventually, the sites will become more localized via “country-specific video rankings and comments, as well as Video, Channel, Categories and Community sections.”

Here is the US site (www.youtube.com), with some relevant terms: (more), (less), From, Views, Added, Time, [tags, category, ratings]

Here are the new sites and some relevant terms, which are mostly like the UK site:

  • UK (uk.youtube.com): (more), (less), Tags, Added, Category, From, Views, ratings
  • Ireland (ie.youtube.com): (more), (less), Tags, Added, From, Views, ratings
  • Netherlands nl.youtube.com): (meer), (minder), Labels, Toegevoegd, categorie, Van, Aantal keren bekeken, beoordelingen
  • Brazil (br.youtube.com): (mais), (menos), Palavras-chave, Adicionado em, categoria, De, Exibições, avaliações
  • France (fr.youtube.com): (suite [continuation]), (moins), Tags, Ajoutée, Catégorie, De, Vues, avis [opinion]
  • Italy (it.youtube.com): (altro), (meno dettagli), Tag, Aggiunto, categoria, Da, Visualizzazioni, voti
  • Spain (es.youtube.com): (más), (menos), Etiquetas, Añadido, categorí­a, De, Reproducciones, puntuaciones [scores]
  • Poland (pl.youtube.com): (więcej), (mniej), Tagi, Dodano, kategorii, Od, Liczba wyświetleń, Oceny
  • Japan (jp.youtube.com): (続き), (一部表示), タグ , 追加 , カテゴリ ‚ 投稿者 , 再生回数 , 評価
  • Japan (jp.youtube.com): (tsuzuki [continuing]), (ichibu hyouji [portion display]), tagu ["tag"], tsuika [addition], kategori ["category"], toukousha [contributor], saisei kaisuu [playback number-of-times], hyouka [judgment]

Welcome to easier navigation, lots of the world.

Flickr, possibly Youtube go multilingual

Friday, June 15th, 2007

If you’re not an English speaker and want to post and view photos online on Flickr (owned by Yahoo), it’s about to get easier. Flickr is adding traditional Chinese (Taiwan, Hong Kong, Macau), French, German, Italian, Korean, Portuguese, and Spanish versions.

Meanwhile, video-sharing site Youtube (owned by Google) may add Chinese and French versions.

It’s nice having everyone together on one site, but people all over the world shouldn’t have to use English. It will probably also help with bandwidth to parcel the world out over several sites.