tag:blogger.com,1999:blog-77800154597524874662024-02-07T19:45:16.090-05:00Descriptively AdequateEd Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.comBlogger55125tag:blogger.com,1999:blog-7780015459752487466.post-59079326227080831762013-01-21T18:44:00.001-05:002013-01-21T18:44:33.328-05:00anaphoric clogsin the past week i have seen two different signs, in two restrooms separated by several hundred miles, one typed and one handwritten, that both said:<br />
<blockquote class="tr_bq">
Do not put paper towels in the toilet!!!<br />It clogs them up.</blockquote>
wait, what? that should be the other way around: <i>They clog it up</i>. the paper towels are the clog-ers, not the clog-ees. how did such a complete mix-up escape from the pens of two people who are presumably competent speakers of English, if not native speakers?<br />
<br />
it requires some charitable grammatical analysis, but i think there could be a few contributing factors. first, <i>it</i> is probably intended as a propositional anaphor, referring to the act of paper-towel-putting. one way to render the intended meaning is <i>It clogs it up</i>, which is a little confusing, since there are two occurrences of <i>it</i>—one propositional and one referential. this is probably the moment during the composition of this sentence when panic set in, and—with a gentle semantic push—the second anaphor became <i>them</i>. despite the unambiguously singular antecedent <i>the toilet</i> in the previous sentence, what the sign-writer means to convey is that paper-towel-putting leads to toilet-clogging in general. that generic reading involves (potentially) multiple toilets, and hence <i>them</i>.<br />
<br />
the misfortune lies in the fact that <i>it clogs them up</i> is such a short sentence that the reader can latch on to <i>them</i> before they've even decided that <i>it</i> should be a propositional anaphor. in that case, both get interpreted referentially, with the strange consequence that paper towels are being clogged up by toilets. that would be quite the plumbing problem.Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com2tag:blogger.com,1999:blog-7780015459752487466.post-78932936434643694102012-05-21T13:46:00.002-04:002012-06-14T11:04:54.509-04:00what's in a language's name?a few weeks ago we held the 7th edition of the <a href="http://conf.ling.cornell.edu/SULA7/index.html" target="_blank">Semantics of Semantics of Under-Represented Languages in the Americas</a> conference at Cornell. we had a great bunch of speakers, working on a wide range of languages. besides the variety, i was very interested in the way that languages were represented in the program. issues of identity and linguistic and cultural preservation are of major concern for most native communities. as a result, there's a trend towards using speakers' preferred name for their language. frequently this is an <i>endonym — </i>the name of the language in the language itself.<br />
<div>
<br /></div>
<div>
i decided to compile a list of all the languages represented in talks and posters and see how far this trend has gone. i was a little surprised how few endonyms there were, although some of them jump off the page. in the list below, the name on the left is how the language appeared in the program; on the right is an alternate name. endonyms are italicized.<br />
<div>
<br />
<table><tbody>
<tr><td>Yucatec</td><td><i>Màaya t'àan</i></td></tr>
<tr><td>Tseltal</td><td><i>Batźil Ḱop</i></td></tr>
<tr><td>Navajo</td><td><i>Diné bizaad</i></td></tr>
<tr><td>Blackfoot</td><td><i>Siksika</i></td></tr>
<tr><td>Karitiana</td><td>------</td></tr>
<tr><td>Cheyenne</td><td><i>Tsėhesenėstsestotse</i></td></tr>
<tr><td><i>Tlingit</i></td><td>------</td></tr>
<tr><td><i>Ktunaxa</i></td><td>Kootenai</td></tr>
<tr><td><i>Nsyilxcen</i></td><td>Colville-Okanagan</td></tr>
<tr><td><i>Ka'apor</i></td><td>Urubú</td></tr>
<tr><td><i>Mapudungun</i></td><td>Mapuche</td></tr>
<tr><td>Mi'kmaq</td><td><i>Lnuismk</i></td></tr>
<tr><td>Yauyos Quechua </td><td>------</td></tr>
<tr><td>Inuktitut</td><td>Eskimo</td></tr>
<tr><td><i>Skarù:rę</i>'</td><td>Tuscarora</td></tr>
<tr><td><i>Q'anjob'al</i></td><td>------</td></tr>
<tr><td>Guaraní</td><td><i>avañe'ẽ</i></td></tr>
<tr><td><i>Nɬeʔkepmxcín</i></td><td>Thompson</td></tr>
<tr><td>Saanich</td><td><i>SENĆOŦEN</i></td></tr>
<tr><td><i>Mbyá</i></td><td>------</td></tr>
<tr><td>Nez Perce</td><td><i>Niimiipuutímt</i></td></tr>
<tr><td>K'ichee'</td><td><i>Qatzijob'al</i></td></tr>
</tbody></table>
<br />
<div>
aside from Blackfoot and Nez Perce, the SULA linguists didn't use any names that clearly came from Euro-Americans naming other peoples or languages. (i'm fairly sure that in these two cases, the Euro names have been claimed by the associated groups, and are considered standard and non-deprecating.) there are some such names that seem over the line for 21st century use. when constructing a LING 101 phonology problem set once, we had to turn to Wikipedia to find a less awful-sounding name for Swampy Cree; the endonym is Omaškêkowak.</div>
</div>
</div>
<div>
<br /></div>
<div>
nevertheless, just shy of half of the presenters used the endonym for their language of study. some of them are on beyond tongue twisters for native English speakers (i'm looking at you, Nɬeʔkepmxcín). some seem downright otherworldly. i'm considering using a selection of these in an exercise for my freshman seminar on constructed languages. if i give my students an unannotated list of language names, and ask which are natural languages (not telling them that they <i>all</i> are), i wonder which ones they will peg as constructed? i feel like Q'aanjob'al may trick them along the lines of space opera, and the all caps neo-orthography of SENĆOŦEN and the very non-English consonant clusters of Lnuismk could be a trap too.</div>
<div>
<br /></div>
<div>
keep in mind that this is not limited to Native American languages. much more familiar languages have strange endonyms that could trip up even well-educated English speakers. some examples would be Euskara, Suomi, and Cymraeg. (those are Basque, Finnish, and Welsh, respectively. all European languages, all of which i didn't know the endonyms for until after i became a linguist). the question is whether they would just be unfamiliar, or actually make people believe they were constructed. i'm not teaching my course again until next spring, but when i do i'll report back with results. if anyone else wants to try something similar in the meantime, i'd be interested in hearing how it goes.</div>Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com6tag:blogger.com,1999:blog-7780015459752487466.post-40335061355824211812012-01-11T12:27:00.003-05:002012-01-11T12:43:44.654-05:00'only' peeving on the comicsi read all of my daily comic strips online now. one of the serious downsides to this is that gocomics.com has a comments thread(!) on every single strip that they post. they don't generate the same type of bottom-dwelling stuff as youtube comments, but they are some of the most mirthless places on the internet. nothing is worse than going all <a href="https://twitter.com/KarlVanHoet" target="_blank">Van Hœt</a> on something that's just supposed to be harmless fun. so i wasn't surprised, but still baffled when i saw this response to a Frazz comic a few weeks ago:<br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg6Gqcx_MuZ6to1_9rclfc4Zsfsitjg-Kwjzf-9F7HyBYsch_qpy0DKLLJRt2b_fZKazyZb5IytLx00b9qgvg7AbOSAQLhiK2coylrNSsUyKHFSN7FolHYAvy1sTJKCdwVU08X1ubgdiZ4j/s1600/frazz+peeve.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="267" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg6Gqcx_MuZ6to1_9rclfc4Zsfsitjg-Kwjzf-9F7HyBYsch_qpy0DKLLJRt2b_fZKazyZb5IytLx00b9qgvg7AbOSAQLhiK2coylrNSsUyKHFSN7FolHYAvy1sTJKCdwVU08X1ubgdiZ4j/s400/frazz+peeve.jpg" width="400" /></a></div>
<br />
misplaced a modifier? what? i wasn't even familiar with this language peeve. yesterday i was catching up on my RSS backlog, and found a post about <a href="http://throwgrammarfromthetrain.blogspot.com/2011/11/elusive-misplaced-only.html" target="_blank">"The elusive 'misplaced only'"</a> on Jan Freeman's blog <a href="http://throwgrammarfromthetrain.blogspot.com/" target="_blank">Throw Grammar From The Train</a>. it details how this peeve works: basically people swear up and down about a relationship between linear order and scope involving <i>only</i>, despite the fact that English doesn't work that way.<br />
<br />
so the "<i>only </i>fetishists" would like Caulfield to put <i>only</i> immediately preceding <i>heartburn</i>, because they are blinded by dogma and can't see that it modifies the entire VP. all in spite of the fact that putting it there makes the sentence actually sound worse. in other contexts it would sound worser and worser. compare the following constructed sentences, also using <i>heartburn</i> just for fun.<br />
<blockquote class="tr_bq">
my uncle went to the ER yesterday because he thought he was having a heart attack…<br />
<br />
but it turned out he only had heartburn.<br />
?but it turned out he had only heartburn.</blockquote>
moving <i>only</i> makes the sentence sound worse. put it in the progressive and it becomes even more terrible: <i>he was having only heartburn?? </i>not modern English. so no, our only conclusion here is that neither Caulfield nor Jef Mallett misplaced a modifier. he put it exactly where it's supposed to go.Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com2tag:blogger.com,1999:blog-7780015459752487466.post-53725711612373926712012-01-09T14:16:00.000-05:002012-01-09T14:19:47.246-05:00"un reality" and unrealitytoday, the following bit of Italian headlinese came across the tubes to my RSS reader:<br />
<blockquote class="tr_bq">
<h1 style="font-family: 'Lucida Sans', Arial, Trebuchet, sans-serif; font-size: 1.6em; line-height: 20px; margin-bottom: 5px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;">
<a href="http://www.calcioblog.it/post/21379/napoli-presentato-vargas-mi-sembra-di-essere-in-un-reality" target="_blank">Napoli, presentato Vargas: "Mi sembra di essere in un reality"</a></h1>
</blockquote>
the post is about a new acquisition of the soccer team in Naples, and his reaction to arriving in the city. the quote in the headline appears to be "I seem to be in a reality." this would be a decidedly odd thing to say in English — some sort of metaphysical claim.<br />
<br />
but actually, it's just the result of a creative borrowing from English. if we were talking about reality vs. fantasy, there's no doubt that the headline would have used the word <i>realtà</i> or <i>verità</i>. (note that i have absolutely no idea what Vargas actually said; the quote is given in a different form later in the article, although still including the word <i>reality</i>, and it may be translated from his native Spanish.)<br />
<br />
so what is <i>un reality</i>? it's a reality TV show. wordreference.com even has a separate Italian to English <a href="http://www.wordreference.com/iten/reality" target="_blank">entry for <i>reality</i></a> indicating this. Italian has a propensity for this type of clip-and-borrow process, often taking just the attributive piece of a phrase or compound, and they turn up very frequently in headlinese, where space is at a premium. (another famous example is Italian <i>basket</i> for English <i>basketball</i>, which is more common than the native <i>pallacanestro.</i>)<br />
<br />
perhaps more interesting than the morphological process here, though, is the semantic shift. Vargas' use of <i>un reality</i> clearly indicates that the content of <i>un reality</i> is anything but reality! replace <i>reality</i> with <i>sogno</i> 'dream' or <i>fantasia</i> 'fantasy' and the sentence means basically the same thing. of course, the blame for making a compositional phrase that can easily shift to mean the opposite probably falls more on English here, but Italian helps to obfuscate the process. i'm sure if we start using <i>a reality</i> to mean <i>a fantasy</i> in English, peevers will tell us that it's just another sign that 2012 is certainly the end of days. i guess we'll just have to wait and see what the <i>realtà </i>turns out to be.Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com1tag:blogger.com,1999:blog-7780015459752487466.post-13734635234799341192011-12-23T00:31:00.000-05:002011-12-23T00:33:00.087-05:00"grammar" as catch-allyesterday, <a href="https://plus.google.com/101833715717140625047/posts" target="_blank">Mignon Fogarty</a> (aka <a href="http://grammar.quickanddirtytips.com/" target="_blank">Grammar Girl</a>) posted a link to "<a href="http://www.onlinecollege.org/the-20-most-controversial-rules-in-the-grammar-world" target="_blank">The 20 Most Controversial Rules in the Grammar World</a>" on Google+. sufficiently baited, i read through it. as i did, i noticed that "The Grammar World" is a very vast place, and may in fact encompass several galaxies.<br />
<br />
i went through the list a second time to categorize each of these alleged "grammar" points in terms of what linguistic realm they fall under. (several fail to qualify for any linguistic subfield.) here were the results.<br />
<br />
<ol>
<li>the Oxford Comma<br /><b>orthography / style<br /></b></li>
<li>the pronunciation of "controversial"<br /><b>morpho-phonology</b>calling it that may be generous.<br /></li>
<li>double negatives<br /><b>syntax</b><br />they give a lousy example and fairly tone-deaf comments, but it's definitely a syntactic issue.<br /></li>
<li>"irregardless"<br /><b>morphology<br /></b></li>
<li>ending sentences with prepositions<br /><b>syntax</b><br />while writing this post, i noticed that when you create a link in the new blogger interface it asks you "To what URL should this link go?" at least it's reassuring that my blog software is a robot, not a native speaker of English.<br /></li>
<li>"hanged" vs. "hung"<br /><b>morphology<br /></b></li>
<li>"like" as a conjunction<br /><b>wait…what?</b><br />hold on, this is a double problem. anyone criticizing someone for using <i>like</i> as a conjunction should first ask themselves whether they know what a conjunction is. this peeve requires it to be anything that takes a clausal complement. the built in Mac OS X dictionary lists such uses of <i>like</i> under a subheading "conjunction", and does the same for <i>when</i> (which also baffled me: apparently <i>Saturday is the day when I get my hair done</i> contains a "relative adverb" while
<span class="s1"><i>I</i></span><i> loved math when I was in school </i>contains a "conjunction". this is completely backwards terminology, since it's clearly the latter that's modifying something verbal, the VP [loved math].) however, the entry for "after" has no erroneous label as conjunction, despite the fact that it too can clearly take clausal complements.<br />so if this one was controversial, it's more likely because it's poorly defined, rather than having zealots on two sides of a clearly drawn line.<br /></li>
<li>"good" vs. "well"<br /><b>morphology</b><br />again, generously. this is a fight over the meaning and distribution of lexical items.<br /></li>
<li>text/internet speak<br /><b>vague orthographical morass</b><br />not particularly grammar-y, and a grab-bag unto itself. this should have failed the criteria for inclusion on the list by not being a <i>rule</i>.<br /></li>
<li>starting sentences with "however"<br /><b>syntax / style<br /></b></li>
<li><span style="font-weight: normal;">starting sentences with "but" or "and"</span><br style="font-weight: normal;" /><b>syntax / style</b><br />should've been 10b.<br /></li>
<li>gender-neutral pronouns<br /><b>morphology</b><br />totally misses the point by not even mentioning singular <i>they</i>. in an article about controversy, we're failing to teach the controversy.<br /></li>
<li>split infinitives<br /><b>syntax<br /></b></li>
<li>passive voice<br /><b>syntax<br /></b></li>
<li>punctuation inside quotation marks<br /><b>orthography<br /></b></li>
<li>possessive apostrophes on words that end in 's'<br /><b>orthography</b><br />side note: my students this past semester were unduly concerned with this. apparently this is a failing offense in certain US high schools now.<br /></li>
<li>"e-mail" vs. "email"<br /><b>very specific piece of orthography</b><br />all the rest up to this point were at least generalizations that had to be applied to individual circumstances.<br /></li>
<li>universal grammar rules<br /><b>what is this i don't even.</b><br />we are informed that Noam Chomsky is an "influential linguist", but otherwise…i mean sure, there's some controversy over whether UG exists, and plenty over what it contains, but this description of it says so little. and if you were thinking that this doesn't seem like a "grammar rule" on par with the rest of the list, just wait for the next one.<br /></li>
<li>the fact that there are different kinds of dashes<br /><b>typography</b><br />GAH. orthography, ok. punctuation, at the fringe of orthography, maybe. EM AND EN DASHES ARE GRAMMAR? about as much as camera ISO settings are visual cognition, or audio file formats are hearing. this wins the award for shameless list-padding.<br /></li>
<li>"who" vs. "whom"<br /><b>morphosyntax</b><br />infamous. <a href="http://descriptively.blogspot.com/2007/10/office-on-prescriptivism.html" target="_blank">covered previously</a>. (now with dead Google Video link!)</li>
</ol>
<div>
the final tally, giving benefit of the doubt: 13 items that could actually be considered on the spectrum of "grammar" from phonology to pragmatics; 4 orthographic points that tangentially bear on the written encoding of language; 2 ill-defined bits of nonsense; and 1 complaint about the geometry of making writing look pretty. i give it a C- and suggest it repeat its course on the definition of grammar.</div>Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com0tag:blogger.com,1999:blog-7780015459752487466.post-95011273717821702010-04-30T22:28:00.007-04:002010-04-30T22:33:56.539-04:00Ignite Ithaca talkthis is storming the internets thanks to all my friends promoting it. they will probably drive more traffic than me posting this here, but nevertheless, here it is: my <a href="http://igniteithaca.com">Ignite Ithaca</a> talk entitled "<a href="http://www.youtube.com/watch?v=DkkZ77lnuas">Why nobody ever taught you how to write good (and what you can do about it)</a>".<br /><br /><object width="640" height="385"><param name="movie" value="http://www.youtube.com/v/DkkZ77lnuas&hl=en_US&fs=1&"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/DkkZ77lnuas&hl=en_US&fs=1&" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="450" height="270"></embed></object>Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com5tag:blogger.com,1999:blog-7780015459752487466.post-76133398834731782812010-02-01T12:18:00.003-05:002010-02-01T12:44:19.089-05:00newsflash: language peeves potentially irritablecoming across the twitter tubes this morning (via <a href="http://twitter.com/jillianp">@jillianp</a>), this story out of New Zealand: "<a href="http://www.stuff.co.nz/national/education/3277574/Research-could-dismay-English-language-purists">Research could dismay English language purists</a>". in other news, "Water is wet", "Vegetarians not so keen on meat", etc. etc.<div><br /></div><div>i shouldn't mock the bit of news that prompted this piece: a USD $400K+ grant to do a massive morphological survey of English. this could be incredibly useful. it's just the "context" that the writer put it in. gems like:</div><div><blockquote>It is the first time the morphology of the English language has been looked at in this depth since rules were first laid out in the 19th century.</blockquote>because before the 19th century, there were no rules! sheer and utter chaos! it's a miracle people could even form words.</div><div><br /></div><div>but this brings up an interesting question: why did the (mostly inaccurate) grammar texts of the 19th century become sacrosanct to the so-called "purists"? and there's no doubt that they've taken on a mystical value, because they have the ability to trump basic logical reasoning. present a "purist" with two options—150 years of hearsay based on something initially wrong, or the collaborative research of renowned language experts—and they'll chose the former every time. and it can't just be anti-institutional, "down with the man!" sentiment; if that were the case, they should have rejected the <a href="http://languagelog.ldc.upenn.edu/nll/?cat=5">prescriptivist poppycock</a> (to borrow <a href="http://ling.ed.ac.uk/~gpullum/index.html">Geoff Pullum</a>'s term) in the first place.</div><div><br /></div><div>oh well, we all know that the surest way to go insane is to argue with irrational people. so instead, i'll just pretend that the headline on the story was "Research could be pivotal for English linguists" and go about my day.</div>Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com0tag:blogger.com,1999:blog-7780015459752487466.post-18728540666301443902010-01-19T12:15:00.003-05:002010-01-19T12:43:37.918-05:00give and take: math and linguisticsthis is a response to the excellent post "<a href="http://thelousylinguist.blogspot.com/2010/01/why-linguists-should-study-math.html">Why Linguists Should Study Math</a>" over at <a href="http://thelousylinguist.blogspot.com/">The Lousy Linguist</a>, which i found via fellow Cornellian <a href="http://twitter.com/nmashton">@nmashton</a> on twitter. i was going to just write a comment there, but i realized that it would probably become rather long.<div><br /></div><div>first of all, let me say that i am in absolute agreement with the sentiments put forth by Chris in his post. in fact, i'm going to be auditing the brand new, never-before-offered Statistics for Linguists course this semester. but i think that one major point needs to be added.</div><div><br /></div><div>simply: there is a grave asymmetry in linguists learning math versus mathematicians (or statisticians, or computer scientists, etc.) learning linguistics.</div><div><br /></div><div>here's the scenario. you're a grad student in linguistics. this means that you went to high school once, and probably were rather good at most of your subjects, or you wouldn't be a student anymore. in high school, they made you learn math. if you were really good at it, you made it through single-variable calculus; if not, probably trig. even if you didn't like it and haven't touched math since, you should have a decent sense of How Math Works, in case you need to pick it up again.</div><div><br /></div><div>but the converse just isn't true. i've audited the NLP course at Cornell, which is taught by an excellent professor in the CS department who has a very solid grounding in theoretical linguistics. but that almost doesn't matter given the fact that there are <b>zero prerequisites</b> for the course. that's right, no LING101, no nothing. the demographics of the ~80-person lecture break down roughly as 70 CS undergrads, 9 linguistics undergrads, and 1 lonely linguistics grad student.</div><div><br /></div><div>so what's the big problem? they'll learn as they go, right? learning by doing is the best way, no? wrong. as has been shown time after time on Language Log and elsewhere for this and other fields (law, education, etc.), these would-be NLPers have a complex against linguistics. i think they recognize that they're uninformed on the finer points of linguistic theory, but because "hell, i speak a language!" <b>they don't think they need any more expertise </b>to solve complex linguistic problems. throw more code at it, throw more servers at it, we can brute force our way through. i've watched them re-invent the wheel, and it's a square wheel with an off-center axis. and they're not looking to refine its design, or ask those crazy round-wheeler linguists what they've got cooking in their lab. instead they're trying to make titanium and carbon-fiber square wheels, thinking that will improve things. the mantra is to strive for good enough rather than (i concede, unattainable) perfection.</div><div><br /></div><div>i think that linguists are more and more cognizant of the need for mathematical training. and for those who just aren't math types, they're willing to go find fellow linguists who are, or even statisticians and computer scientists outside their departments to collaborate with. but nobody comes knocking on the linguistics department door. it's open, guys, and seriously, you could stand to visit. we won't bite.</div>Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com6tag:blogger.com,1999:blog-7780015459752487466.post-44313102710876390412009-12-07T09:16:00.004-05:002009-12-07T11:30:00.093-05:00seek and ye shall not findmorphological revelations on my morning comb through twitter and facebook statuses:<div><blockquote><a href="http://twitter.com/cdbarker/status/6430979729">Just so we're clear: The past tense form of "to seek" is "sought" not "seeked" as 2/3 of my World History class seems to believe.</a></blockquote></div><div>whoa. on the other hand, this isn't entirely unexpected. morphology tends to be entropic, that is, it favors simplicity and regularity and minimal expression, and moves in that direction over time. this doesn't mean the language apocalypse is upon us any more than the heat death of the universe, as predicted by physical entropy, is. just like physical entropy, language entropy can be locally reduced by other factors, particularly token frequency. that is to say—in the broadest terms—speakers are likelier to hang on to irregular forms of words that are used all the time, and tend to regularize words that aren't as common.</div><div><br /></div><div>that brings me to my "whoa" moment. i just hadn't realized that 'seek' was possibly on the cusp of regularization. so the question is, how does 'seek'/'sought' stack up to other verbs with past tense forms in <i>-ought</i>? to get a comprehensive list, i turned to <a href="http://www.schneertz.com/revalph/T.html">a reverse dictionary</a>, which yielded just five non-compound -ought pasts: <i>bought, fought, thought, brought,</i> and our test case, <i>sought</i>. next to test their frequencies i headed to <a href="http://www.wordcount.org/">wordcount.org</a>, a nifty visualization of frequency in the British National Corpus. admittedly the BNC might not give the most precise results for predicting the tendencies of young speakers in Michigan, but should be accurate enough. here are their ranks (not token counts; smaller numbers indicate higher frequency):</div><div><br /></div><div>buy/bought: 785/1129</div><div>fight/fought: 1484/3204</div><div>think/thought: 102/152*</div><div>bring/brought: 631/461</div><div>seek/sought: 1875/1895</div><div><br /></div><div>the data reveals that i perhaps shouldn't be as surprised as i was. 'seek' is the least frequent of the five verbs, although strangely 'fought' is the least frequent past tense form. i starred 'thought' since its frequency is probably affected considerably by use of the noun 'thought'. also of note is the fact that 'bring' is the only item whose past tense is more frequent than the base form; this is due to the fact that 'bring' requires a progressive present tense ("I bring the wine" ≠ "I am bringing the wine" but rather "I (habitually) bring the wine"). despite—or perhaps owing in part to—its frequency, 'bring' is subject to taking on a different irregular pattern, 'bring'/'brang'/'brung' in many children's speech and some adult dialects.</div><div><br /></div><div>anyhow, to wrap this up, it looks like 'sought' might well be the best candidate of these forms to undergo regularization, even if i hadn't expected it before. the only other form that might do the same is 'fought'-->'fighted', but i think that would be even more surprising...i'm actually wondering why its frequency turned out to be so low in the BNC.</div><div><br /></div><div>a postscript: although i certainly have 'sought' as the past tense of 'seek' in its basic sense "to look for", 'seeked' is also in my lexicon. it's the past tense of the relatively new lexical item 'seek' "to move rapidly through a video or audio clip". 'sought' is terrible as its past tense:</div><div><br /></div><div>i seeked ahead 2 minutes to skip the commercials.</div><div>*i sought ahead 2 minutes to skip the commercials.</div><div><br /></div><div>this kind of regularization is a common symptom of generating a new, distinct lexical entry from an existing form, cf. the classic case <i>bad/worse/worst</i> vs. <i>bad/badder/baddest</i>.</div><div><br /></div><div>[UPDATE] regarding 'wrought', which is very low frequency, and i (rightly) eliminated from consideration as not being a productive past form. i commented the following on the ongoing facebook thread that prompted this all:</div><div><blockquote>'wrought' is a strange case...it's actually the old past participle of 'work' (e.g. "wrought iron" = "worked iron" ≠ "wreaked iron"), and the historical past tense of 'wreak' is regular 'wreaked'. they got conflated because both 'work' and 'wreak' were used in the "____ havoc" idiom. since 'wrought' is almost never used outside the idiom any more, it probably doesn't fit into the regularization question here.</blockquote></div>Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com5tag:blogger.com,1999:blog-7780015459752487466.post-66450293883087581272009-08-27T10:35:00.003-04:002009-08-27T10:52:41.429-04:00surprisal for dogs<div style="text-align: left;">today's Frazz comic:</div><div><br /></div><div><img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 128px;" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhTv-fPLlaeNphH5GZWzFLS0wdl8j4K35CJ3sNm6Qg1HNGII2688AcZ4drASfGhhdDZ47Garn6gMeezlFE5XZ4lNJDVapM_8kTgo8lEMKPdLpmJvAI5m2FS9U89OxU788bc8T3fJmT_ahYy/s400/largeimage.ae6b75f44de7acc1c99ff92ed21f1913.gif.jpeg.gif" border="0" alt="" id="BLOGGER_PHOTO_ID_5374654776735403858" /></div><div style="text-align: left;">i don't know if anybody has actually done research on dogs' abilities to learn frequency-based patterns (although we had <a href="http://languagelog.ldc.upenn.edu/nll/?p=1596">cottontop tamarins</a> not so long ago). and unlike the <s>grammar</s>pattern-sensitive monkeys, Mario didn't even wait to confirm the probability-based prediction, he just went for it.</div>Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com1tag:blogger.com,1999:blog-7780015459752487466.post-30552895508502126852009-02-07T14:02:00.004-05:002009-02-07T14:14:39.813-05:00AND??? and i hate you, congress.why, why do i do things like try to figure out what has been going on in the Senate regarding the scazillion-dollar stimulus plan? it only a) gets my blood pressure up and b) confirms that our elected representatives are morons, or at least have stared at legislative doubletalk for so long that their judgements about English have been seriously compromised. exhibit 1: Senate Amendment 309, introduced by <a href="http://en.wikipedia.org/wiki/Thomas_Coburn">Thomas Coburn</a> (R-OK)<br /><p reps="assembly://GovTrackWeb/GovTrack.Web.Reps" bills="assembly://GovTrackWeb/GovTrack.Web.Bills" bill="assembly://GovTrackWeb/GovTrack.Web.Pages.Congress.Bill" context="HttpContext" util="assembly://GovTrackWeb/GovTrack.Web.Util"> </p><blockquote><p reps="assembly://GovTrackWeb/GovTrack.Web.Reps" bills="assembly://GovTrackWeb/GovTrack.Web.Bills" bill="assembly://GovTrackWeb/GovTrack.Web.Pages.Congress.Bill" context="HttpContext" util="assembly://GovTrackWeb/GovTrack.Web.Util">At the appropriate place, insert the following:</p><p reps="assembly://GovTrackWeb/GovTrack.Web.Reps" bills="assembly://GovTrackWeb/GovTrack.Web.Bills" bill="assembly://GovTrackWeb/GovTrack.Web.Pages.Congress.Bill" context="HttpContext" util="assembly://GovTrackWeb/GovTrack.Web.Util">SEC. __. LIMIT ON FUNDS.</p><p reps="assembly://GovTrackWeb/GovTrack.Web.Reps" bills="assembly://GovTrackWeb/GovTrack.Web.Bills" bill="assembly://GovTrackWeb/GovTrack.Web.Pages.Congress.Bill" context="HttpContext" util="assembly://GovTrackWeb/GovTrack.Web.Util"> None of the amounts appropriated or otherwise made available by this Act may be used for any casino or other gambling establishment, aquarium, zoo, golf course, swimming pool, stadium, community park, museum, theater, art center, and highway beautification project.</p></blockquote><p reps="assembly://GovTrackWeb/GovTrack.Web.Reps" bills="assembly://GovTrackWeb/GovTrack.Web.Bills" bill="assembly://GovTrackWeb/GovTrack.Web.Pages.Congress.Bill" context="HttpContext" util="assembly://GovTrackWeb/GovTrack.Web.Util"></p>take another look. "…museum, theater, art center <span style="font-weight: bold; font-style: italic;">and</span> highway beautification project"??? that is one hell of a project. in fact, i'm pretty sure you won't be finding any such mega-conglomerate initiative anywhere in the original bill. and they <span style="font-style: italic;">passed</span> this amendment. what a waste of time. idiots.<br /><br />of course, having someone proofread the damn thing and change <span style="font-style: italic;">and</span> to <span style="font-style: italic;">or</span> would have saved the nonsense that this will create in conference committee, in the courts if the bill is signed into law with this amendment in place, &c. &c.Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com2tag:blogger.com,1999:blog-7780015459752487466.post-67017503238298359622009-02-01T17:55:00.005-05:002009-02-01T17:57:40.585-05:00Pearls Before Swine takes on English-onlysums it up pretty well, i should say (click for big):<br /><br /><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://d.yimg.com/us.yimg.com/p/umedia/20090201/cp.7c6b3dcfb9b5a8859c996b9ee8554277.gif"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 100%;" src="http://d.yimg.com/us.yimg.com/p/umedia/20090201/cp.7c6b3dcfb9b5a8859c996b9ee8554277.gif" alt="" border="0" /></a>Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com0tag:blogger.com,1999:blog-7780015459752487466.post-86425166402033660312008-08-27T21:46:00.005-04:002008-08-27T22:20:28.059-04:00flavors of English on Googlei was just looking through the site statistics for this here blog. one of the most interesting and useful bits of information that <a href="http://www.statcounter.com/">statcounter</a> provides me are the search terms that people use. i would say that 99% of these searches are done on Google — we really have drunk the pagerank kool-aid. a lot of searches are pretty lengthy and specific (e.g. "kobe bryant interview in italian" or "who is the girl in the benny lava video?"). one recent search stuck out to me, though. somebody searched for just the word "whomever", and wound up at my previous post "<a href="http://descriptively.blogspot.com/2007/10/office-on-prescriptivism.html">The Office on <span style="font-style: italic;">whomever</span></a>". i thought that was pretty remarkable. i clicked through on the link that statcounter provided me and saw that the search was made on google.co.uk, and that descriptively adequate was on the front page of results, at position number 6.<br /><br />then, for whatever reason, i decided to re-run the search using google.com. my post was nowhere to be found on the first page. the results were entirely different. descriptively adequate finally showed up at #14 on the list of results. what's going on? certainly google hasn't written different versions of pagerank to deal with different localizations of English? as far as cataloguing search results goes, the fact that a bunch of Americans in California wrote the algorithm shouldn't adversely affect Brits and the like.<br /><br />i couldn't stop there. i ran the search on all of the English Google localizations that i could think of, and got even more different results. i've also noted the number of total results that Google estimates, which also (oddly) vary by localization.<br /><br /><table border="1"><tbody><tr><th>localization</th><th>#</th><th>total hits</th></tr><tr><td><a href="http://www.google.com/search?hl=en&safe=off&q=whomever&start=10&sa=N">google.com</a></td><td>14</td><td>7,480,000</td></tr><tr><td><a href="http://www.google.co.uk/search?hl=en&q=whomever&btnG=Google+Search&meta=">google.co.uk</a></td><td>6</td><td>8,200,000</td></tr><tr><td><a href="http://www.google.ca/search?hl=en&q=whomever&btnG=Google+Search&meta=">google.ca</a></td><td>7</td><td>8,180,000</td></tr><tr><td><a href="http://www.google.com.au/search?hl=en&q=whomever&btnG=Google+Search&meta=">google.com.au</a></td><td>10</td><td>8,190,000</td></tr><tr><td><a href="http://www.google.co.nz/search?hl=en&q=whomever&btnG=Google+Search&meta=">google.com.nz</a></td><td>7</td><td>8,460,000</td></tr></tbody></table><br />as i was compiling this table i remembered that Google mucks with your search results if you're signed in (which i of course had to be in order to access blogger, without which i couldn't be writing this post). i signed out, and on google.com the DA link rose to #4. i guess i should just be happy i'm on the front page on all of these searches. but there are still lingering, bizarre questions.<br /><br /><span style="font-weight: bold;">why does Google report different numbers of hits for different localizations?</span><br />no clue. (comments are open!)<br /><br /><span style="font-weight: bold;">what is causing the rank fluctuations even when i'm not logged in?</span><br />some clue. on all of the non-US localizations there is a feature "search pages from [country name]". perhaps i've got fewer australian sites linking to my blog, so my rank is slightly lower in australia than in the US or great britain.<br /><br /><span style="font-weight: bold;">why the hell is Google biasing my custom algorithm against my own damn blog?!</span><br />i mean throw me a bone here, guys.<br /><br /><span style="font-style: italic;"><span style="font-weight: bold;">and the baffler...</span></span><span style="font-weight: bold;"><br />why do i get this on google.ca?<br /></span><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjSAJpUki1eiyiDX7xZHvxYcLsAVNg58nTSnRFxfiBWYhyphenhyphenhYBybiQ2DlkoMPQ4BgAi4SGemrscnoQMbDXIizBKvwryxUtn7iBQ4CQWCDlYjrVf7FVTz9o2GL_Hf9pW6y0cniltE9yXfd6jL/s1600-h/did+you+mean+whatever%3F.png"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjSAJpUki1eiyiDX7xZHvxYcLsAVNg58nTSnRFxfiBWYhyphenhyphenhYBybiQ2DlkoMPQ4BgAi4SGemrscnoQMbDXIizBKvwryxUtn7iBQ4CQWCDlYjrVf7FVTz9o2GL_Hf9pW6y0cniltE9yXfd6jL/s400/did+you+mean+whatever%3F.png" alt="" id="BLOGGER_PHOTO_ID_5239386903602384626" border="0" /></a>i mean, you're kidding, right? i'm sure that the frequency of <span style="font-style: italic;">whatever</span> is much higher than that of <span style="font-style: italic;">whomever</span>, but 8 million hits on a word that's in the dictionary should be enough data for google to not question my intent. and why only canadians, eh? this, of course, isn't the first time that i've seen <a href="http://descriptively.blogspot.com/2008/04/wtf-cupertino.html">weird spelling suggestions on Google</a>. so perhaps they really do think they know something about English varieties that i don't?Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com0tag:blogger.com,1999:blog-7780015459752487466.post-7202831186805976242008-08-23T22:39:00.003-04:002008-08-23T22:58:13.263-04:00Malaysian government fails to ban feature reconstructionplease, don't judge me about the inspiration for this post. the short story is "sometimes you just get bored, and who knows where you could end up on Wikipedia!" tonight it was crappy pop song articles. thence comes this quote from the "Controversy" section of the article for this summer's top hit, <a href="http://en.wikipedia.org/wiki/I_Kissed_a_Girl">I Kissed a Girl</a>.<br /><blockquote>In Malaysian radio stations, the song has been retitled 'I Kissed...' with the words 'a girl' silenced throughout the chorus in the song.<span title="This claim needs references to reliable sources since July 2008" style="white-space: nowrap;"></span></blockquote>never mind the odd choice of preposition (as a native English speaker i've never heard a song <span style="font-style: italic;">in</span> a radio station; <span style="font-style: italic;">on</span> works fine). the fact of the matter is that this censorship is about as effective as bleeping the <span style="font-style: italic;">-hole</span> in <span style="font-style: italic;">asshole</span>. if you take the phrase "i kissed a girl" and eliminate "a girl", then in isolation it becomes completely open-ended. it could be "i kissed a man" or "i kissed my mother" or "i kissed a frog". too bad there are more lyrics in the song's refrain!<br /><blockquote>i kissed <s>a girl</s> / and i liked it / the taste of <span style="font-weight: bold;">her</span> cherry chapstick</blockquote>oops! there's a gendered pronoun hanging out there, eight words later. and it needs an antecedent. and the only preceding nominals are <span style="font-style: italic;">i</span> and <span style="font-style: italic;">it</span>. <span style="font-style: italic;">i </span>can't be the antecedent, because then she would have said <span style="font-style: italic;">my</span>, and <span style="font-style: italic;">it</span> is decidedly neuter. so it can only be...gasp! she didn't! chances are nobody's getting the wool pulled over their eyes either; Wikipedia also says that increasing numbers of Malaysians are identifying English as a first language. they can put the pieces of this not-so-tricky linguistic puzzle back together as quickly as i did. censorship falls flat again.<br /><br />i think i've gotten more linguistic enjoyment out of the song than by listening to it. there's one other bit of the chorus that intrigued me. it's the other pronoun in those lines, namely <span style="font-style: italic;">it</span>. i'm sure that the intended antecedent is "[the fact that] i kissed a girl", but i can't help but get an ambiguous interpretation where it could be topicalized and actually refer to "the taste..." is this a weird judgement? comments are always open here.Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com2tag:blogger.com,1999:blog-7780015459752487466.post-10372885490410680712008-08-18T16:15:00.004-04:002008-08-18T16:36:11.366-04:00we want...a count noun!it's great that Language Log has enabled comments on some of their posts, but it's all the more frustrating when i have something pithy to say and they're turned off. this is a would-be comment in response to Arnold Zwicky's post <a href="http://languagelog.ldc.upenn.edu/nll/?p=501">Countification</a>.<br /><br />in describing the difference between mass and count nouns in English, he says that <span style="font-style: italic;">shrub</span> is a count noun while <span style="font-style: italic;">shrubbery</span> is a mass noun. while i can certainly use <span style="font-style: italic;">shrubbery</span> as a mass noun, i rarely talk about shrubbery at all, and when i do, i'm almost always quoting Monty Python.<br /><br /><object height="344" width="425"><param name="movie" value="http://www.youtube.com/v/QTQfGd3G6dg&hl=en&fs=1"><param name="allowFullScreen" value="true"><embed src="http://www.youtube.com/v/QTQfGd3G6dg&hl=en&fs=1" type="application/x-shockwave-flash" allowfullscreen="true" height="344" width="425"></embed></object><br /><br />throughout the Knights Who Say 'Ni!' sketch, <span style="font-style: italic;">shrubbery</span> is used consistently as a count noun, taking determiners such as <span style="font-style: italic;">a</span> and <span style="font-style: italic;">another</span>, and having a plural form <span style="font-style: italic;">shrubberies</span>. i never thought of these as ungrammatical in any way, although i suppose it would add to the humor, although the pure absurdity of the scene is plenty. we of course also have Monty Python to thank for the brilliant backformation <span style="font-style: italic;">shrubber</span> (n.) - one who arranges, designs, and sells shrubberies.Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com1tag:blogger.com,1999:blog-7780015459752487466.post-47000955893057431192008-08-18T16:13:00.004-04:002008-08-18T16:36:39.333-04:00aaaaaand back!a barrage of posts is forthcoming! i'm officially declaring my summer blogging malaise to be over as a new school year is (sadly) just around the corner. as with all my blog revival phases in the past, things will probably slow down in a few weeks, but in the meantime, here we go!Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com0tag:blogger.com,1999:blog-7780015459752487466.post-18335623383674719862008-06-04T18:46:00.004-04:002008-06-04T18:49:30.023-04:00mind your [t]s and [ʔ]sthis is a few days old, coming from this past weekend's National Spelling Bee. ah, America. home of the only language in the world that actually creates a need for spelling bees (as far as i know), and word final glottalization of vowels.<div><br /><object width="425" height="344"><param name="movie" value="http://www.youtube.com/v/NaQ22DM0mjs&hl=en"><param name="wmode" value="transparent"><embed src="http://www.youtube.com/v/NaQ22DM0mjs&hl=en" type="application/x-shockwave-flash" wmode="transparent" width="425" height="344"></embed></object><br /></div><div><br /></div><div>Erin Andrews and numbnuts? youtube gold.</div>Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com1tag:blogger.com,1999:blog-7780015459752487466.post-13138457601767781992008-05-12T18:06:00.005-04:002008-05-12T18:13:22.343-04:00don't mention this onea strange coincidence of events just occurred. i was walking home from campus, where i had been working on a paper on some bizarre control phenomena, when "Pipe Dream" by the now-sadly-defunct <a href="http://myspace.com/trendy">Trendy</a> came up on my ipod. the first sentence of the song is:<div><blockquote>would it be all right / if i asked a deaf guy / to borrow his headphones?</blockquote></div><div>yeah, that's PRO, in a clause introduced by <span class="Apple-style-span" style="font-style: italic;">ask </span>that also contains an object, being subject-controlled. i've got enough to talk about in my paper (analyzing sentences of the type "John said PRO to leave"), so this oddball case is going to get conveniently ignored for the time being.</div>Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com0tag:blogger.com,1999:blog-7780015459752487466.post-51424896646196984812008-04-29T00:30:00.006-04:002008-04-29T00:54:10.876-04:00WTF, Cupertino???i have just stumbled upon a most peculiar <a href="http://www.google.com/search?hl=en&q=language+log+cupertino&btnG=Search">Cupertino effect</a> trap. i am writing a paper for my phonology ii class on <a href="http://www.google.com/search?hl=en&q=raddoppiamento&btnG=Search"><span class="Apple-style-span" style="font-style: italic;">raddoppiamento</span></a> in Italian. it's a continuation of some previous work that i did last semester. when writing my previous paper, i was running Mac OS X 10.4. the system-wide spellchecker had no clue what this big, long, Italian word was, and offered up no possible corrections. to avoid lots of little red underlines, i added the word to the dictionary and went merrily on my way. since then i suffered a hard drive failure, which had two consequences: 1) the dictionary "forgot" that <span class="Apple-style-span" style="font-style: italic;">raddoppiamento</span> is a word, as far as i'm concerned and 2) i've upgraded to Mac OS X 10.5. no problem, i just have to re-teach the dictionary the word.<div><br /></div><div>and that's when this happened:</div><div><br /></div><div><img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj08Rb_h-7xQxoj4pdYr9Ur47vdLuaGESQ0c_EQfKFYDEsvLrXWdD8kAFKY6KBB77PmeltJdcLHNoY4BIFHD8ksN9OthMikUHkhblWTV_vBpejn2fd7qDdBi5UxdDK1tew0A7ZeJ7qKjyFL/s400/Pages001.jpg" border="0" alt="" id="BLOGGER_PHOTO_ID_5194522326031857618" /></div><div style="text-align: center;"><br /></div><div style="text-align: left;">whaaaaaaaaat? where did that "d" come from?? i fired up dictionary.app (which presumably is the same dictionary resource used by the spellchecker) and, no surprise, <span class="Apple-style-span" style="font-style: italic;">raddoppiamdento</span> is decidedly not in the dictionary either. nor, as far as i can tell, is any substring that would fool the spelling suggestions algorithm (which i know for a fact does sometimes produce novel suggestions, especially when two words are accidentally conjoined) into thinking that <span class="Apple-style-span" style="font-style: italic;">raddoppiamdento</span> was not only a possibility, but in fact preferable to <span class="Apple-style-span" style="font-style: italic;">raddoppiamento</span>.</div><div style="text-align: left;"><br /></div><div style="text-align: left;">just for confirmation that i wasn't going crazy, i turned to Google, which seems to know which is the real word and which is the imposter (what the hell is going on, <span class="Apple-style-span" style="font-style: italic;">imposter</span> just got flagged, despite the fact that dictionary.app gives both <span class="Apple-style-span" style="font-style: italic;">imposter</span> and <span class="Apple-style-span" style="font-style: italic;">impostor</span>):</div><div style="text-align: left;"><br /></div><div style="text-align: left;"><img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhm15ugCjZBM0Ory_MR-mijmj6RYLGwzKpyNNqeePozf1xtXUFiH1_MCMo75BmNz31HpYD-GUi6kSvRDa3MfZMMKoXeA6o9b8Sm-_80vCLSrkvSjXvY1MPkziQkKaEWxP68iN8nQBx0Vtnb/s400/Safari001.jpg" border="0" alt="" id="BLOGGER_PHOTO_ID_5194524001069103074" /></div><div style="text-align: center;">um, yes?</div><div style="text-align: center;"><br /></div><div style="text-align: left;">if anyone has any hypothesis whatsoever as to what the great "improvement" Apple made to its spellchecker algorithm such that it produces these shenanigans, please leave a comment.</div>Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com1tag:blogger.com,1999:blog-7780015459752487466.post-36107059207072058512008-04-26T09:42:00.001-04:002008-04-26T00:50:14.835-04:00i can haz fotos?seriously, flickr? seriously?<div><br /></div><div><img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjjUyVt_fPTOE1hyCK-HSMutnbTt4qY0JLjOuVfrX4Rc6B1TKZBByxYPzIumTFWqqXByH4V6hCZ0-2bn9c2bvtFEcEcFH-6owAC_dMuMhxXIejdTSt5xoXHwHRfOMJ0_k4nxmQ4_nbkB1mA/s400/Safari001.jpg" border="0" alt="" id="BLOGGER_PHOTO_ID_5193410066941147042" /></div><div style="text-align: center;"><br /></div><div style="text-align: left;">a quick handful of refreshes gave me greetings in languages that i expected: Portuguese, Tagalog, Arabic, French, and English. although i guess flickr isn't totally hanging on formality—both "hello" and "yo" are given as English greetings.</div><div style="text-align: left;"><br /></div><div style="text-align: left;">just as long as they don't publish an entire site localization in lolcat.</div>Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com1tag:blogger.com,1999:blog-7780015459752487466.post-52482772457869866662008-04-24T10:32:00.002-04:002008-04-24T10:45:41.763-04:00on the origin of sign languages?with sign languages, the sign is not necessarily arbitrary. which can lead to hilarity.<div><br /></div><div>i stumbled on this from a "don't click here!" link on <a href="http://www.mgoblog.com">mgoblog</a> today. i'm a little embarrassed to say that i enjoyed it quite a bit, and there are several other related videos on youtube.</div><div><br /></div><object width="425" height="355"><param name="movie" value="http://www.youtube.com/v/osnUB9bUm-E&hl=en"><param name="wmode" value="transparent"><embed src="http://www.youtube.com/v/osnUB9bUm-E&hl=en" type="application/x-shockwave-flash" wmode="transparent" width="425" height="355"></embed></object><div><br /></div><div>the performer is <a href="http://en.wikipedia.org/wiki/David_Armand">David Armand</a>, in character as Johann Lippowitz. this was recorded in 2005, so don't get upset with me for being behind on my memery.</div><div><br /></div><div>this also reminds me of the "fake language" videos going around youtube last year (which seem to have all been taken down since). in those, the best you can do to fake a language is try to make up nonsense words that accurately reflect the phonology and prosody of the language, but with fake sign language you can actually put some semantic content into the mix. just think about it: it's possible to "translate" an English song into fake sign language, but it's impossible to translate ASL speech into fake English!</div>Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com0tag:blogger.com,1999:blog-7780015459752487466.post-8198660772006144702008-04-23T21:50:00.002-04:002008-04-23T22:06:37.441-04:00everything's...segmented differently in texas?i've been watching a lot of hockey on the versus recently (probably to the detriment of my academics, but it's the playoffs!) as a result, i've seen the same eight commercials dozens and dozens of times over the past couple weeks. mostly they're unremarkable, but one stuck out. it's an ad by the Texas tourism board or something, and what caught my attention the first time i saw it is their tagline, which is displayed proudly at the end of the spot:<div><blockquote><a href="http://www.traveltex.com/">Texas: it's like a whole other country</a></blockquote></div><div>i meant to write on "split another" last fall, but never got around to it, after i realized that there's already been quite a bit done on it (which i somehow can't find now via quick google search; odd). the fact of the matter is that i didn't have anything too exciting to say. my ideolect has split another—i say "a whole 'nother" without exception, so when i saw the Texas ad, it seemed wrong to me. i wonder how many Texans actually say "a whole other." my bet is that it's very few, and the only reason we got "whole other" in the tagline is because an editor at some point in the creative process nixed a spelling with <span class="Apple-style-span" style="font-style: italic;">nother</span>.</div><div><br /></div><div>so why prefer <span class="Apple-style-span" style="font-style: italic;">nother</span> at all? this may be just due to the influence of orthography getting the best of me, but it seems that if you take it upon yourself to reconstruct the unmodified phrase by removing <span class="Apple-style-span" style="font-style: italic;">whole</span>, the results are better with <span class="Apple-style-span" style="font-style: italic;">nother</span>.</div><div></div><blockquote><div>a whole other --> *a other --> ?an other</div><div>a whole 'nother --> a 'nother --> another</div></blockquote><div>when you get to the intermediate stage <span class="Apple-style-span" style="font-style: italic;">a other</span>, allomorphy kicks in and you get an other, but does it get to go on to fully become <span class="Apple-style-span" style="font-style: italic;">another</span> afterwards? it seems to me that <span class="Apple-style-span" style="font-style: italic;">a 'nother</span> is more likely to reform into a single word. like i said, i realize this is relying on the spelling somewhat, but i have a feeling that unless i'm totally artificially imposing it there is some difference in the prosody of <span class="Apple-style-span" style="font-style: italic;">an other</span> vs. <span class="Apple-style-span" style="font-style: italic;">a 'nother</span>.</div><div><br /></div><div>regardless of whether this little reconstruction exercise was sound or not, i still think that Texas should have stuck with <span class="Apple-style-span" style="font-style: italic;">'nother</span>.</div>Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com1tag:blogger.com,1999:blog-7780015459752487466.post-397893788665189412008-04-21T00:16:00.007-04:002008-04-21T00:51:31.516-04:00LL made me do itlook at this here, a new post. back from the dead. Eric Bakovic posted a <a href="http://languagelog.ldc.upenn.edu/nll/?p=69">call for comment</a> on a couple sentences over on Language Log, and I couldn't resist putting in a quick two cents. i'm sure his comments will be more in-depth, but here's a quick shot before i head to bed.<div><br /><div>the sentences in question are the following:</div><blockquote>I’ll never forget how he must have felt. (overheard)<br />Aren’t you glad you archived instead of deleted? (over-read)</blockquote><div><span class="Apple-style-span" style="font-weight: bold;"><br /></span></div><div><span class="Apple-style-span" style="font-weight: bold;">1. I'll never forget how he must have felt.</span></div><div>yeah, this is definitely semantically odd. the intent and necessary pragmatic context seems pretty obvious to me, and i doubt that i would do a serious double-take at hearing it. the oddity lies in the complement of <span class="Apple-style-span" style="font-style: italic;">forget</span>, "how he must have felt." there's no syntactic trouble, since similar constructions like "i'll never forget how that sunset looked" are unproblematic. there is a semantic disconnect, though. the predicate <span class="Apple-style-span" style="font-style: italic;">forget</span> includes a presupposition, namely that whatever is to be forgotten was at some point known or experienced by the subject. this is not realized in the complement "how he must have felt," because the experiencer in the complement is not the same as the experiencer of <span class="Apple-style-span" style="font-style: italic;">forget</span> (the forgetter, if you will).</div><div><br /></div><div>nevertheless, i can come up with two scenarios where this sentence could be used felicitously. the first, and i think more plausible, interpretation is that the sentence means "i will never forget witnessing/experiencing his outward reaction of how he felt." for example, someone just dropped a full cup of coffee all over himself; he makes an unforgettable yelp and a look of shock comes over his face. it's an event not to be forgotten, and furthermore the onlooker has a pretty good idea of how a hot cup of coffee in the lap feels. the second, and a little more strained, interpretation is that the forgetter has had a strikingly parallel experience, and therefore "how he must have felt" actually refers to the forgetter's experience, which is in keeping with the requirements of the predicate. an example scenario: "his brother died of cancer last month. i'll never forget how he must have felt (because my brother died of cancer too)."</div><div><br /></div><div><span class="Apple-style-span" style="font-weight: bold;">2. Aren't you glad you archived instead of deleted?</span></div><div>maybe i've just been organizing my email too much recently, but this seems perfectly normal to me. of course, if i look at it, i can see the object drop that's going on, which is uncommon for english. ew, i thought simple reconstruction would do the trick, but that just creates more syntactic problems. taking "your email" as the understood object of each predicate gives</div><div></div><blockquote><div>*aren't you glad you archived <span class="Apple-style-span" style="font-style: italic;">your email</span> instead of deleted <span class="Apple-style-span" style="font-style: italic;">your email/it</span>?</div></blockquote><div>bad, bad, bad. this is definitely WTF coordination. the closest grammatical paraphrase with reconstruction would be something like:</div><div></div><blockquote><div>aren't you glad you archived your mail instead of <span class="Apple-style-span" style="font-weight: bold;">deleting</span> it?</div></blockquote><div>the question then is whether this is still perceived as an oddity with truly unaccusative predicates. are the following weird?</div><div></div><blockquote><div>aren't you glad you walked instead of ran?</div><div>aren't you glad you whispered instead of shouted?</div></blockquote><div>the walked/ran sentence seems more natural to me; it might have something to do with the fact that the forms are not morphologically parallel. anyway, there is something about the "instead of..." clause that wants a tenseless verb form. i'll leave it to Eric to flesh that out further.</div><div><br /></div><div><br /></div><div>more posts to come soon in the future? maybe. because i quit blogging when it starts to feel like work. but when i have papers to write, blogging definitely becomes a preferable form of writing. (huzzah, procrasti-tasks!)</div></div>Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com0tag:blogger.com,1999:blog-7780015459752487466.post-64958098807365893872007-11-22T13:24:00.001-05:002008-08-11T10:21:53.267-04:00the peroles of "whore"i was watching the pregame coverage of the Packers - Lions football game today and noticed the continuation of a somewhat disturbing trend: strange pronunciations in the singing of The Star-Spangled Banner. now you should know that i'm certainly no prescriptivist, so this has to be offending some other sensibility of mine. my one peeve is certainly more legitimate than the other.<br /><br />prior to today's game, the singer (whose name i unfortunately missed) pronounced the word <span style="font-style: italic;">perilous</span> rather strangely, with a definite [o] as the nucleus of the middle syllable. i know that singing coaches often fiddle with vowel quantities, but i wonder what motivated the distinction to change [<span class="IPA">ɨ</span>] or [ə] to [o]. it certainly wasn't by analogy to the stem which <span style="font-style: italic;">perilous</span> is formed from, namely <span style="font-style: italic;">peril</span>, in which the corresponding syllable is pronounced with [ɪ], which is perfectly acceptable in most voice coaching systems, to the extent that i'm familiar with them. based on some standard assumptions about back-formation and english orthography, you would guess that the stem is <span style="font-style: italic;">perole</span>. but as i said, this is a bit more of a peeve than anything else—it doesn't really detract from the song.<br /><br />what does detract from a national anthem, designed to honor our country, is the gratuitous aspiration of vowel-initial words when singing. sometimes it's even more pronounced than some sort of aspiration, and is an actual addition of [h] to the beginning of the word. this is dangerous within the context of the phrasing of our national anthem. i have heard several performers turn "o'er the land of the free..." into what very distinctly sounds like "whore the land of the free..." this sounds like an imperative verb with the interpretation "sell out the land of the free for sexual favors." now i think the whole <a href="http://news.yahoo.com/s/afp/lifestyleaustraliachristmasoffbeat"><span style="font-style: italic;">ho ho ho</span> debacle</a> with the Australian Santa Clauses is absurd, but clearly pronouncing the word <span style="font-style: italic;">whore</span> in the middle of the national anthem—even unintentionally—is really a bad idea, regardless of who it's offending. so please, respect our country, and watch your pronunciation, at least when you're singing on national TV.Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com7tag:blogger.com,1999:blog-7780015459752487466.post-5768667493850275092007-11-07T20:08:00.000-05:002007-11-07T20:15:56.254-05:00quick takes; hiatusthis is a very very short post, as i'm very busy. i won't be around to make any new posts until at least monday night, and probably not until tuesday. so here we go!<br /><br /><span style="font-weight: bold;">immature moments in phonology</span><br />in my phonology class yesterday, we were looking at a data set. i had to stifle a few giggles as i looked over one of the examples. this is probably funny even if you don't know IPA: fʌkkʌrʌ. i'm glad it was only referred to as "example 3" and not read aloud.<br /><br /><span style="font-weight: bold;">resumptive pronoun sighting</span><br />i noticed this yesterday while reading an article about file-sharing lawsuits:<br /><blockquote>It then demands the institution to turn over the identities of the individuals <span style="font-weight: bold;">to whom</span> the IP addresses were assigned <span style="font-weight: bold;">to</span>.</blockquote>these constructions made the <a href="http://itre.cis.upenn.edu/%7Emyl/languagelog/archives/004510.html">headlines over at language log</a> a while back. no time to add anything new to that, but it stuck out to me!<br /><br />gotta run now. have a good weekend!Ed Cormanyhttp://www.blogger.com/profile/13832549055517378348noreply@blogger.com0