Google Translate

All non-Go discussions should go here.
Post Reply
hyperpape
Tengen
Posts: 4382
Joined: Thu May 06, 2010 3:24 pm
Rank: AGA 3k
GD Posts: 65
OGS: Hyperpape 4k
Location: Caldas da Rainha, Portugal
Has thanked: 499 times
Been thanked: 727 times

Google Translate

Post by hyperpape »

I played around with Google Translate in response to Daniel's post. I was surprised how quickly an English-French-English translation became stable. That's not so surprising when you think about it, but I was not expecting it. So I tried a few more languages.
original wrote:I'd say it's more like go is a conversation where everything you say is translated by google through half a dozen different languages (e.g., Engish->Japanese->Czech->Korean->Basque->Chinese->your opponent's language) before your opponent hears it. You know what your opponent said, but you can't assume that what you think it means is what they thought it meant.
After being translated (something like English-French-Spanish-German-Portugeuse-Italian-English)
after translation wrote:I'd say more like a conversation, if everything you say is Google (translated as Engish-> Japan-> Czech Republic> South> Basque Country> Chinese language> opponent) by a half dozen different languages​​, the first of its opponent feels. Do you know what your opponent said, but one can not assume that what you think it means what they thought it meant.
Surprising how good the "main" European languages are, though there's still a lot of loss of meaning there.
User avatar
tchan001
Gosei
Posts: 1582
Joined: Wed Apr 21, 2010 6:44 pm
GD Posts: 1292
Location: Hong Kong
Has thanked: 54 times
Been thanked: 534 times
Contact:

Re: Google Translate

Post by tchan001 »

Try translating through Korean as well. I've had really bad times trying to translate Korean into intelligible English with regards to some Korean go books.
http://tchan001.wordpress.com
A blog on Asian go books, go sightings, and interesting tidbits
Go is such a beautiful game.
User avatar
daniel_the_smith
Gosei
Posts: 2116
Joined: Wed Apr 21, 2010 8:51 am
Rank: 2d AGA
GD Posts: 1193
KGS: lavalamp
Tygem: imapenguin
IGS: lavalamp
OGS: daniel_the_smith
Location: Silicon Valley
Has thanked: 152 times
Been thanked: 330 times
Contact:

Re: Google Translate

Post by daniel_the_smith »

Yeah, crossing major language family boundaries really kills it. There was a hilarious site I was trying to find again that would translate back and forth between English and Japanese until it got a stable cycle, but I can't remember what it was. :(
That which can be destroyed by the truth should be.
--
My (sadly neglected, but not forgotten) project: http://dailyjoseki.com
User avatar
prokofiev
Lives with ko
Posts: 223
Joined: Tue Apr 27, 2010 8:03 pm
Rank: decent sdk
GD Posts: 138
Has thanked: 67 times
Been thanked: 10 times

Re: Google Translate

Post by prokofiev »

daniel_the_smith wrote:Yeah, crossing major language family boundaries really kills it. There was a hilarious site I was trying to find again that would translate back and forth between English and Japanese until it got a stable cycle, but I can't remember what it was. :(


Here's the link

I gave it the "original" expression above, and it went 25 times there and back before claiming "It is doubtful that this phrase will ever reach equilibrium."

Here's the result of 25 times there & back:

25 times through google translate wrote:> Japanese - -> Czech Republic -> Korea -> Basque Country -> China -> I'm being used by Google (half a dozen different languages, including translation into Engish ,,,,,,, all partners said before in other places). Morning is to ask me. "M is a word I do not have the disease in the past, I have your partner, they have it, I mean life, or I suspect he more Wakaranakatsu .
hyperpape
Tengen
Posts: 4382
Joined: Thu May 06, 2010 3:24 pm
Rank: AGA 3k
GD Posts: 65
OGS: Hyperpape 4k
Location: Caldas da Rainha, Portugal
Has thanked: 499 times
Been thanked: 727 times

Re: Google Translate

Post by hyperpape »

Yeah, my reason for poking around was the thought that all the entropy was coming from the Japanese, Chinese and Basque translations. Though I really should've checked the Czech in that case.
Marcus
Gosei
Posts: 1387
Joined: Tue Apr 20, 2010 8:51 am
GD Posts: 209
KGS: Marcus316
Has thanked: 139 times
Been thanked: 111 times

Re: Google Translate

Post by Marcus »

Bah, I tried some longer sentences with a few complicated words, and got minimal loss of meaning. Then I tried "Don't fear the reaper":

http://translationparty.com/#9998014
http://translationparty.com/#9998017

Getting the opposite meaning in such a simple sentence is mind boggling. :)

If you translate "Don't fear me", you get equilibrium at "I am not scared" ... Google isn't afraid of me!
User avatar
daniel_the_smith
Gosei
Posts: 2116
Joined: Wed Apr 21, 2010 8:51 am
Rank: 2d AGA
GD Posts: 1193
KGS: lavalamp
Tygem: imapenguin
IGS: lavalamp
OGS: daniel_the_smith
Location: Silicon Valley
Has thanked: 152 times
Been thanked: 330 times
Contact:

Re: Google Translate

Post by daniel_the_smith »

prokofiev wrote:
daniel_the_smith wrote:Yeah, crossing major language family boundaries really kills it. There was a hilarious site I was trying to find again that would translate back and forth between English and Japanese until it got a stable cycle, but I can't remember what it was. :(


Here's the link


Awesome! That's what I was looking for!
That which can be destroyed by the truth should be.
--
My (sadly neglected, but not forgotten) project: http://dailyjoseki.com
hyperpape
Tengen
Posts: 4382
Joined: Thu May 06, 2010 3:24 pm
Rank: AGA 3k
GD Posts: 65
OGS: Hyperpape 4k
Location: Caldas da Rainha, Portugal
Has thanked: 499 times
Been thanked: 727 times

Re: Google Translate

Post by hyperpape »

I know Google Translate works statistically, by analyzing large corpora, but does anyone know more about how it works? Does it have some sort of underlying/universal grammar that it translates the text into and then uses that to translate it into the target language? (If it did, you'd have the amusing possibility of trying to translate English into English).
tapir
Lives in sente
Posts: 774
Joined: Fri Apr 23, 2010 5:52 pm
GD Posts: 0
Has thanked: 137 times
Been thanked: 155 times
Contact:

Re: Google Translate

Post by tapir »

People, learn languages. It is more rewarding than google translate :)
User avatar
jts
Oza
Posts: 2662
Joined: Sat Sep 18, 2010 4:17 pm
Rank: kgs 6k
GD Posts: 0
Has thanked: 310 times
Been thanked: 632 times

Re: Google Translate

Post by jts »

hyperpape wrote:I know Google Translate works statistically, by analyzing large corpora, but does anyone know more about how it works? Does it have some sort of underlying/universal grammar that it translates the text into and then uses that to translate it into the target language? (If it did, you'd have the amusing possibility of trying to translate English into English).

http://translate.google.com/about/index.html

I believe the reason that the European languages work so well is that the EU produces reams and reams of mega-multi-lingual documents.
hyperpape
Tengen
Posts: 4382
Joined: Thu May 06, 2010 3:24 pm
Rank: AGA 3k
GD Posts: 65
OGS: Hyperpape 4k
Location: Caldas da Rainha, Portugal
Has thanked: 499 times
Been thanked: 727 times

Re: Google Translate

Post by hyperpape »

Unfortunately, that front page doesn't answer my question, as far as I can tell (I probably should have put "grammar" in scare quotes in my question. I just meant some kind of representation that isn't part of any natural language but can be used to translate into any natural language. I bet something in the links would say.

However, elsewhere, I see this: http://en.wikipedia.org/wiki/Google_Tra ... ethodology
User avatar
perceval
Lives in gote
Posts: 312
Joined: Thu Aug 05, 2010 3:35 am
Rank: 7K KGS
GD Posts: 0
KGS: tictac
Has thanked: 52 times
Been thanked: 41 times

Re: Google Translate

Post by perceval »

i worked in automatic software tranlsation before (we were doing google translate before they did it themselves, and babelfish).
and we had a collections of the worst nonsenses. amongst the worst was "fan" translated by default as "ventilator" instead of groupie, and nut translated as in "bolt nut" instead as in nut the fruit.
also, in french "My name is bond, james bond" was translated as "Mon nom est esclavage, lien de james" which might get retranslated as "my name is slavery, chain of james"
also they was a mistake in our arabic dict so that lebanon was translated as yogourt :oops: (well there were other mistake but that was one of the funniest)

i think those errors are still on the internet in quite some places because at the time we were the only ones and some site translated their content with bablefish. At start google even reproduced some of them because they automatically extract translation form bilingual sites, and some of them where in fact automaticala translated.

as for multi translation, for the more rare language pairs google uses a pivot language (typically english) so the back anf forth effect is less than if you go through a truly weak kanguage pair ie
if you translate swedish-vietnamese it will do swedish->english->vietnamese (not sure about swedish vietnamese just an example)
In theory, there is no difference between theory and practice. In practice, there is.
tj86430
Gosei
Posts: 1348
Joined: Wed Apr 28, 2010 12:42 am
Rank: FGA 7k GoR 1297
GD Posts: 0
Location: Finland
Has thanked: 49 times
Been thanked: 129 times

Re: Google Translate

Post by tj86430 »

jts wrote:
I believe the reason that the European languages work so well is that the EU produces reams and reams of mega-multi-lingual documents.

Most European languages have similar grammar and are "easy" to translate more or less successfully. However, e.g. Finnish is entirely different from other European languages (except Estonian and Hungarian), and the translation doesn't work at all well. I suspect the same holds for Estonian and Hungarian too. I have never tried translating to/from those languages (and in most cases I wouldn't know if it worked well or not)
Offending ad removed
hyperpape
Tengen
Posts: 4382
Joined: Thu May 06, 2010 3:24 pm
Rank: AGA 3k
GD Posts: 65
OGS: Hyperpape 4k
Location: Caldas da Rainha, Portugal
Has thanked: 499 times
Been thanked: 727 times

Re: Google Translate

Post by hyperpape »

You can judge the accuracy of a translation by adding it to a cycle. So instead of English-French-English, you do English-French-Estonian-English (or English-Estonian-French-English).
User avatar
perceval
Lives in gote
Posts: 312
Joined: Thu Aug 05, 2010 3:35 am
Rank: 7K KGS
GD Posts: 0
KGS: tictac
Has thanked: 52 times
Been thanked: 41 times

Re: Google Translate

Post by perceval »

hyperpape wrote:I know Google Translate works statistically, by analyzing large corpora, but does anyone know more about how it works? Does it have some sort of underlying/universal grammar that it translates the text into and then uses that to translate it into the target language? (If it did, you'd have the amusing possibility of trying to translate English into English).


Last time i heard they were using vanilla prhase base translation, but with huuuge language models;
In those models there is no notion of grammar at all, you just have a dictionnary of sentence fragment where punctuations like. or , is treated just like grammatical words or noun .+ a "language model" which give the probability that a given sentence is good in the target language, and you run a minimizazion algo to find the les costly translation

an excellent site for explanations is http://statmt.org/

They might be doing more advanced stuff now, but at the start at least it was just the method described in this site with enormous compting power.
Those algos are notoriously bad to moves words far in a sentence (becasue to reduce computation you would typically hard-code a limit to the number of slots a translated word might move ), a big problem for example with german when you need to boot the verb at the end., So they might do something extra to fix that.

as for english-english translation we actually dit that to correct our rule base models:
you have a non statistical engine that gives you an unperfect translation, and you train a statistical model to translate from "machine translated" english to more human english. It works well in case were you want to translate text from a narrow domain with little bilinugal corpus available / the rule base translation deos the grunt work with domain specific dictionnaries and you polish the language statistically

The "universal grammar" is still a theorist dream as far as i understand. Some research try to statiscally construct a syntax tree in the source language and then transform it into the target language but i don't think that google does that (not sure though)
In theory, there is no difference between theory and practice. In practice, there is.
Post Reply