A Community of Developers Devoted to Machine Translation?

Hi, there. I am a translator who just stepped into the programming world a couple of months ago.

I want to combine my translation and interpretation (T&I) experience with programming and hopefully make contributions to the development of machine translation–perhaps a better Google Translate or simultaneous interpretation application.

As a beginner (still), I am learning a bit of everything here and there, such as HTML, JavaScript, Python, and even Swift. Although there is still much to learn, I wonder if there is a specific forum or community where people are working on similar projects? Somewhere I can meet the opposite of me–programmers who already know much about programming but want to understand what translators and interpreters actually do at work? (Note that translation and interpretation are two very different jobs even though they look similar to outsiders.)

I feel that if coders and T&I professionals can talk to each other, we can already achieve so many breakthroughs.

For example, professional translators spend a lot of time going through the tedious type-the-keyword-and-search cycles on certain websites. (Yes, Google and Wikipedia are among the most popular ones, but the sites we look at also depend on the assignments.) Such repetitive tasks can already be programmed so we just need one click and the computer will list all the search results–it is a bit like Google Dictionary (which is gone now), but it will need to be customized for each specific translation task. A translator also needs different help when she has to translate into her B language, usually her second language. In other words, I can already design something to save translators perhaps 50% of their time if not more–IF ONLY I knew more about programming.

Another example is how speech recognition technology can help professional interpreters deal with a “difficult” speaker. However, the machine has to be trained to recognize the speaker’s accent, and preferably the terminology of the scheduled talk. The current speech-to-text technology works well when the user is a NATIVE speaker who talks NORMALLY about a GENERAL topic. But in the real world, a simultaneous interpreter might need to work with a speaker who talks about a very SPECIALIZED topic SUPER FAST in her SECOND (or even third) language. The trick I can think of is, since interpreters sometimes have a chance to do briefing with the speaker, we can record the speaker and use the recording to train the machine. The worst case scenario: we have no chance to talk to the speaker before the actual event, but since simultaneous interpreters usually work in a pair of two, the non-working interpreter can focus on training the computer by editing the errors of speech recognition production so the accuracy of the recognition technology improves steadily as the speaker goes on talking.

I am sure a lot of programmers are interested in machine translation. I have even read some papers published by computer science researchers. Is there an easy way to reach out to them? Like a specific forum or website?

1 Like

Hi, did you find anything? If you did could you share? Thanks!

If you’re still looking, I’m a coding noob that works as a localization specialist for an ecommerce site (EN-US>ES-MX), I’m working on learning how continuous localization would work with our specific content/data set-up (200,000 products getting updates and edits every day, yeesh).

Luckily we have a dev team but they are constantly drawn away to stabilizing web app stuff just on the English side, so I’m trying to learn what I can so that perhaps I can design a l10n flow myself in the future.

Anyway, to answer your question, aside from forums on proz.com, and some subreddits,I can’t think of a group for l10n professionals starting out in coding, though I think that should change!

So, nice to meet you, and I’d love to connect with anybody else interested in this intersection of fields. Anybody have any l10n engineering tips tricks recommendations?