Right now, I’m in a class called Field Methods. The goal of this class is to describe (at least in part) a language, using information obtained by working with a native speaker of the language in question. It’s very interesting, both for the language, and for the experience of getting data from a speaker. However, I’ve also found it hugely entertaining, in that some of the sentences we elicit (ask for translations of) are completely absurd and quite funny.

Eliciting Data

Now, there are two ways to go about this. Preferable is to have the speaker tell you a story or narrative, which you can then go through and analyze line by line. This provides good, natural speech, and also lets you see a variety of constructions as used in real life.

The other option is to elicit translations of individual sentences. This seems to be the way that most language description begins. First, you ask the speaker how to say, for instance, “sheep”, and then ask them how one would say “I saw a sheep”. From there, you might ask how to say “You saw the sheep”, and keep slightly modifiying the sentences until you start to get enough data to do more complex analysis.

However, there are times where you want to figure things out, but don’t want to wait for them to occur in a narrative. When you’re fishing for certain grammatical forms and slowly making sentences more and more complex, the sentences look less and less plausible, and usually end up seeming quite funny, no matter the language.

The road to absurdity is paved with grammatical intentions

On one afternoon, we started with a rather normal transitive (has both an agent and a patient) sentence “He hit you”, then changed to “I hit you”. Then, we decided to look into verb Tense (timeframe) and Aspect (defining this is a whole post of its own). So, we went to “I hit you this morning”, still with good intentions, and then “I hit you many times this morning”. “I hit you last year” was next, followed by “I used to hit you last year”.

Then, things developed a more threatening tone. Looking to see if the future tense acted any differently, we asked our speaker how one might say “I will hit you”. From there, we asked for “In the future (but not now), I will hit you many times”. Then, “In the future, I will be hitting you regularly”, and finally, “In the future, I will be hitting you (not just once, but many times), regularly”.

At that point, we realized that we’d gotten a tad absurd, and went back to more normal subject matter (“I’ll be seeing you regularly”).

Sometimes, we just hop right to crazy

However, there’s not always a buildup. Sometimes, in the heat of the linguistic moment, we’ll stumble upon a certain contruction and want to substitute another noun or word, to see if it still works or if it changes the sound system. These can be truly wonderful.

Through this process, we’ve ended up with the rather disturbing “Sell me to him [the sheep]“, the slightly creepy “This is indeed my female sheep here”, the prophetic “Tomorrow, you WILL see vultures”, and the polygamous “the young man will marry all these women”.

Also, sometimes, you’ll want to test certain noun-forming suffixes. For instance, we we were given the word for “bad man”, and naturally, we wanted to know how to say “bad sheep”. There, we went to “the bad sheep made the kids drink alcohol yesterday”, and then ended up discussing a very bad wild boar.

Be careful what you say

Even with phonetics training and several years of language study under our proverbial belts, we can still mispronounce things. Usually, this just makes the sentence unintelligible to the speaker, but some times, we can mess up for comedic gold. For instance, in the language we’re studying, “ai go: fu:” means “I am at home”. When I said it back to the speaker, I misspoke and said “ai ga fu:”, which, after a bout of laughter, he translated as “I’m going to fart”. Although funny enough in a classroom, I’ve no doubt that these sorts of errors have caused more than their share of embarrassment, and maybe even a fight or two.

So, although linguistics is a serious discipline, the actual study process isn’t always completely serious. We manage to have some laughs, even while picking apart unfamiliar grammars, and I think that’s really one of the best parts of the job.

There are other perks, too. Although I’ve not had the occaision to use it yet, it is rather nice to know how to call somebody a “bad sheep” in Zarma.

Tagged with Conventional Linguistics, Language Humor, Language Usage | 4 Comments


So, I’m somewhat obsessed with checking the statistics of who comes here, who gets referred from where, and what search terms they used to find me. Well, the other day, somebody came here from google searching for “IPA translation widget”. For those of you unfamiliar with the terms, a “widget” is a small program written for Apple’s Dashboard interface, and IPA refers to the International Phonetic Alphabet. What this person seems to be wanting was a widget that, like some existing translation widgets, could take a block of text and immediately turn it into IPA characters. For the first few moments, I thought “Wow! That’d be a great idea!”.

Now, as somebody who uses the IPA very, very frequently, such a thing would be wonderful if it worked well. However, I think it would be impossible to actually create a program that goes from English writing to IPA transcriptions without incredible advances in Artificial Intelligence and speech recognition. Here’s why…

Transcription, not translation

At the surface, this doesn’t seem so crazy. Apple includes a widget to do rough, automated translations with Dashboard, and although I never trust automated translations, it does alright for basic words and phrases. I suspect that our anonymous searcher saw that widget and thought “Wow, cool! I wonder if it can help me put something into the IPA”. However, the fundamental difference between translating a sentence into Spanish and putting that same sentence into the IPA is that the IPA isn’t really a language at all, but instead, it’s a method of writing sounds.

The International Phonetic Alphabet is really a set of symbols, each of which represents a sound, sound characteristic, or other element of spoken language. What the IPA allows a linguist (or speech pathologist, or teacher…) to do is to take spoken language and put it onto paper (‘transcription’) with a great deal more precision than most other writing systems. The IPA isn’t a language in itself, it’s just an alternative, phonetic writing system for other languages. The beauty of this is that the IPA is designed to be able to be used not just for English, but for any language. The IPA symbols can be used to transcribe sounds not just from English, but from languages all over the world.

Broad vs. Narrow Transcription

The IPA can be used to transcribe sounds with two different degrees of precision.

If one takes advantage of all the symbols and diacritics, one can make a “narrow” or “phonetic” transcription. At this level, the linguist aims to capture all the detail possible about the word or phrase, including variations across word boundaries, sounds that occur in speech but are unnoticed or unrecognized by native speakers, and even features like intonation and pauses. From these transcriptions, a well-trained linguist could pronounce the words and phrases almost exactly as the speaker did, based simply on the transcriptions. The first, smallest line in the title graphic is a narrow transcription of me pronouncing the site’s title.

This degree of precision would be impossible for a modern computer widget to produce, simply because narrow transcriptions are based on actual words and phrases by a speaker, and really, one needs a fairly trained ear to make an accurate narrow transcription of a word or phrase. Sure, it could use a database of narrowly transcribed words from other speakers, but really, that’s not a narrow transcription. It’s not going to pick up on the variations that each speaker produces, like accents, vowel changes, unusual sound choices, or even tiny speech errors.

The alternative is called “broad” or “phonemic” transcription, expresses the basic sounds of a language or phrase, often more precisely than the native writing system, but at the same time, leaves out detail that’s not necessary to a native speaker. The middle line in the title graphic for this page is a phonemic transcription. Some dictionaries, including the built in OS X dictionary (if you enable IPA in Dictionary Preferences), can show you the standard american IPA Broad transcription form of a word.

Now, using a dictionary of words in a given language and their IPA equivalents, a computer could likely match things and give a passable broad transcription. However, there are variations that occur between people that show up even at a broad level, and are large enough to identify a speaker’s accent, dialect, or even idiolect. For some people (myself included), “caught” and “cot” have the same vowel, but for others, they’re two distinct vowels. So, even at a broad level, you’re not going to get any sort of reliable transcription of one’s actual speech from a computer widget, just a rough approximation.

Why are you transcribing anyways?

In the end, whether such a widget would be useful at all boils down to your reason for needing a transcription. Some people might be learning English and would want a better method of knowing how a given word is supposed to sound. For that, any good dictionary’s pronunciation key should do the trick.

Some people might be interested in the IPA, or want to know how a given word sounds. For that, they’d be better off getting a good phonetics textbook and learning a bit of the IPA themselves, along with some knowledge of phonetics.

However, our widget searcher might just be stuck in an introductory Linguistics course, having to transcribe their speech for an assignment. If so, I offer just one piece of advice: Don’t plagarize transcriptions off the web or from a dictionary. Your professor should have no trouble noticing if you’re not transcribing your own dialect, and everybody’s got a dialect.

Remember, if there’s one thing that phonetics professors are good at, it’s picking out a phone-y.

Tagged with Computers and Software, Conventional Linguistics, Phonetics and Phonology, Translation and Translation Theory | 31 Comments


Several days back, I posted a long review and commentary about Best Western Ruby’s Inn.

Well, I wanted to let you all know that the manager of Ruby’s Inn has chimed in in the comments, with a partial explanation and a partial apology. As a result, I’ve made a factual correction in the initial post (according to the manager, I’d misremembered the tax as 20% instead of 11%, and apparently the kiosks were 20 cents per minute), and replied to some of his assertions.

Although, as I say in the thread, I still can’t really recommend the hotel, I think it’s important to hear both sides, so I encourage you to check out the comments on the post and hear what Chris had to say.

As always, commenting is encouraged, and I appreciate when people do. Any reader feelings, whether they be praise, corrections, complaints or counterarguments are always welcome.

Tagged with Followups, Site News | Leave a Comment


Site Information

  • Categories

  • Latest Non-linguistic Posts

  • Archives

  • Site features