Vous êtes sur la page 1sur 28

9 Oct

sergeyrybkin Sergey Rybkin RT @JoelAVB: KV: Asia Online MT approach is to focus on data quality not data volume #mt #tausuc11 #xl8 7 Oct

sustarling Susan Starling RT @JoelAVB: Converser not yet perfect, main issue: speech recognition of Spanish speakers #tausuc11 #mt <So it's not just me! ;) 7 Oct

sustarling Susan Starling RT @kvashee: Istvan:TM should not be important than terminology #tausuc11 Quality is linked to terminology <TMs outdated after 2-3yrs anyway 7 Oct

GermanENTrans Kevin Lossner RT: @kvashee: #memoQ will use open, published edit distance algorithm [useful to establish] better MT post-editing pricing #tausuc11 #xl8 7 Oct

GermanENTrans Kevin Lossner RT: @kvashee: Istvn: On MT output - untouched segment is a good segment - [ #memoQ ] will provide stats on changes #tausuc11 #xl8 7 Oct

manuelhrrnz manuel herranz RT @kvashee: Istvan: We need a better way to figure out the productivity - MemoQ introduced version control to help this #tausuc11 7 Oct

glucardi Gustavo Lucardi #tausuc11 ended, a weekend of work, and #LWSV starts! 7 Oct

AVBTranslations AVBTranslations TAUS User conference 2011 is over. Another brilliant & inspiring two days. Will be back next year. Thanks to Jaap and his team! #tausuc11 7 Oct

JoelAVB Jol Sigling TAUS User conference 2011 is over. Another brilliant & inspiring two days. Will be back next year. Thanks to Jaap and his team! #tausuc11 7 Oct

alexpysaryuk Oleksandr Pysaryuk RT @kvashee: Istvan: TM should not be important than terminology #tausuc11 Quality is linked to terminology 7 Oct

kvashee K Vashee MemoQ will use open and published edit distance algorithm that could be helpful in establishing better pricing for MT postediting #tausuc11 7 Oct

kvashee K Vashee Istvan showing edit distance analysis and explaining algorithm #tausuc11 7 Oct

traducaoviaval Traduo via Val RT @kvashee: Istvan: We need a better way to figure out the productivity - MemoQ introduced version control to help this #tausuc11 7 Oct

kvashee K Vashee Istvan: On MT output - untouched segment is a good segment - they will provide stats on changes #tausuc11 7 Oct

kvashee K Vashee Istvan: We need a better way to figure out the productivity - MemoQ introduced version control to help this #tausuc11 7 Oct

traducaoviaval Traduo via Val RT @kvashee: Istvan: TM should not be important than terminology #tausuc11 Quality is linked to terminology 7 Oct

kvashee K Vashee

Istvan: Every vendor gets different matching and it is crazy to base your business model on this #tausuc11 NOT translator productivity 7 Oct

kvashee K Vashee Istvan: TM should not be important than terminology #tausuc11 Quality is linked to terminology 7 Oct

kvashee K Vashee Istvan thinks TM focus is short-sighted = loss of data and asks who is interested in sub-segment leveraging #tausuc11 7 Oct

kvashee K Vashee Istvan introducing Kilgray to group - great recent success and acclaimed by both vendors and tarnslators #tausuc11 7 Oct

JoelAVB Jol Sigling Hundreds of mostly non-professional translators participating in Adobe community translation for free! #tausuc11 #communitytranslation 7 Oct

kvashee K Vashee Ray reading out enthusiastic comments by volunteer translators who wish to share info with countrymen #tausuc11 7 Oct

kvashee K Vashee Adobe translators unpaid but are product enthusiasts, personal/professional development or linguistic pride #tausuc11 7 Oct

kvashee K Vashee Ray: Adobe manages translator approvals and quality and links to TDA search #tausuc11 7 Oct

JoelAVB Jol Sigling Adobe partnered with dotSUB for translating parts of AdobeTV Community Translation in 45 languages through #communitytranslation #tausuc11 7 Oct

kvashee K Vashee Adobe seeded some video into 25 languages #tausuc11 586 videos translated using @dotsub platform > 45 languages targeted 7 Oct

kvashee K Vashee Ray: Adobe TV subtitling on training and product overview videos #tausuc11 using @dotsub 7 Oct

JoelAVB Jol Sigling Adobe have used community translation to translate software for the first time (into Dutch!). #tausuc11 #communitytranslation 7 Oct

kvashee K Vashee Ray Flournoy talking about community translation experiment in China & LatAm #tausuc11 #dotSub #lingotek 7 Oct

kvashee K Vashee Vikram: Collaborative technologies to enable your content to be MT'd and community to feedback with corrections - CTF #tausuc11 7 Oct

JoelAVB Jol Sigling Micorsoft Translator can be used to machine-translate websites and have them approved by authorized crowd reviewers #mt #tausuc11 7 Oct

JoelAVB Jol Sigling VD: many non-English parents in Seattle areas, schools look for cheap translation of information for parents... #tausuc11 7 Oct

JoelAVB Jol Sigling VD: make sure you give your customers what they want, be flexible and provide the platform they need. Applies to LSPs just as much #tausuc11 7 Oct

JoelAVB Jol Sigling

Vikram Dendi from Microsoft talks about Microsoft Translator, a community model for tailored MT #mt #tausuc11 7 Oct

kvashee K Vashee Vikram from MSFT : Collaboration is really changing how things are done e.g. search provides continuous feedback #tausuc11 7 Oct

kvashee K Vashee Will post-editing eventually become unnecessary? MSFT says no but will be less in major LPs #tausuc11 7 Oct

kvashee K Vashee MT will go to mobile platforms and closer to text analytics #tausuc11 7 Oct

kvashee K Vashee Panel: MT will improve and really be compelling in some languages but HT will be needed to steer #tausuc11 7 Oct

kvashee K Vashee Alon: Some MT will get better than HT, some LPs, but in other languages it is much harder #tausuc11 7 Oct

kvashee K Vashee Marcu: Academia will not lead MT initiatives - Industry more likely to lead new areas of implementation #tausuc11 7 Oct

kvashee K Vashee Alon: Much more funding going to MT funding in EU, much less in US #tausuc11 7 Oct

kvashee K Vashee Declan: CNGL sees more crowdsourcing and growing importance of user generated content #tausuc11 7 Oct

JoelAVB Jol Sigling TM/MT Recommender helps to choose between TM and MT. User can set threshold to choose automatically by confidence score #mt #tausuc11

7 Oct

kvashee K Vashee Declan: Cool example of how TM/MT confidence scores can help translators work faster #tausuc11 7 Oct

JoelAVB Jol Sigling Another useful tool from SNGL: automatic domain classification. Helps MT quality in some instances... #mt #tausuc11 7 Oct

kvashee K Vashee Declan - Domain adaptation techniques help with user generated content #tausuc11 7 Oct

kvashee K Vashee Declan giving examples of how training on children's literature will not result in a good business translation engine #tausuc11 7 Oct

JoelAVB Jol Sigling SNGL have developed data cleaning tools for MT pre-processing. Useful for medium-sized corpora #mt #tausuc11 7 Oct

kvashee K Vashee Declan; They do rough training and then translate back to see where biggest problem are and fix data for re-train #tausuc11 7 Oct

kvashee K Vashee Declan finally states the quality of the system is dependent on the quality of training data #tausuc11 Even corp TM is ?>?> 7 Oct

kvashee K Vashee Declan: Hybrid research and adding linguistic concepts to SMT - MaTrEx framework for all MT research #tausuc11 7 Oct

JoelAVB Jol Sigling Chris Wendt suggest companies should spend part of their localization budgets on gathering data for the smaller languages #tausuc11 7 Oct

kvashee K Vashee Declan on MT at CNGL Center for Next Generation Localization #tausuc11 Broad scope including digital content mgmt 7 Oct

JoelAVB Jol Sigling Great example of how technology can help in language learning by Chris Wendt too. Again very good automatic speach quality #tausuc11 #mt 7 Oct

kvashee K Vashee Chris: Asking crowd to help with "minor" language data acquisition #tausuc11 7 Oct

JoelAVB Jol Sigling Fascinating example on Youtube of phone translation using MT with excellent speach output. #tausuc11 7 Oct

kvashee K Vashee @ @jeromobot They seem to completely miss the -ve historical significance -- Kinda Like celebrating Nicolae_Ceauescu #tausuc11 7 Oct

kvashee K Vashee Chris: Many more linguistic analysis tools http:// labs.microsofttranslator.com/thesauraus #tausuc11 7 Oct

JoelAVB Jol Sigling CW: need to promote MT for smaller languages #mt #tausuc11 7 Oct

kvashee K Vashee Chris: MT in new applications > showing EN-US to EN-UK examples #tausuc11 and Thesaurus application 7 Oct

JoelAVB Jol Sigling

Chris Wendt from MS on MT future: better quality, + languages, easy access, better integration, collaborative architectures #mt #tausuc11 7 Oct

kvashee K Vashee Chris: Better quality, more languages, easier access and integration and collaborative architectures #tausuc11 7 Oct

kvashee K Vashee Chris Wendt of MSR on Where is MT going next ? #tausuc11 7 Oct

garyy626 Gary Yuen TAUS Conference 2011 Discussion notes on interoperability File from bit.ly Editable by anyone with link #tausuc11 7 Oct

jfxstar Joseph Starnes Wendt: Grammarless MT system works best for Real time chat #tausuc11 7 Oct

peterrey Peter Reynolds RT @ctatwork: Converser uses DNS as a speech-to-text solution. Was Charlie Weasley consulted.? #tausuc11 > What about house elves? 7 Oct

JoelAVB Jol Sigling Converser not yet perfect, main issue: speech recognition of Spanish speakers, others: quality & speed of spoken translations #tausuc11 #mt 7 Oct

ctatwork Chase Tingley Converser uses Dragon Naturally Speaking as a speech-to-text solution. No word if Charlie Weasley was consulted. #tausuc11 7 Oct

ctatwork Chase Tingley Spoken Translations is deploying their Converser tool for a customer who, for now, is known only as Voldemort. #tausuc11 7 Oct

JoelAVB Jol Sigling Feedback used to allow for translation evaluation in Converser for Healthcare which makes broad area of application possible #tausuc11 7 Oct

JoelAVB Jol Sigling Spoken Translations presents Converser for Healthcare, automated multimedia translation for healthcare organisation #mt #tausuc11 7 Oct

appliedlanguage appliedlanguage What a difference a year makes. Pace of MT development and adoption into workflows since last year's conference is impressive #tausuc11 7 Oct

JoelAVB Jol Sigling KV shows demo of multingual chat solution for companies focused on Thai, Vietnamese and Indonesian #tausuc11 7 Oct

JoelAVB Jol Sigling Other intesting MS application, MT of global feedback of new MS Office software #tausuc11 #mt 7 Oct

Jeromobot Jeromobot .@Kilgraymemoq: XLIFF is a problem child. To make it work, you need to accept compromises, be realistic. #tausuc11 <or produce simple XLIFF> 7 Oct

Jeromobot Jeromobot RT @Kilgraymemoq: The coffee is really horrendous. #tausuc11 <welcome to the US> 7 Oct

dotSUB dotSUB Focus on quality, turnaround time &cost; initially Machine Translation is more expensive than humans because 100% review. @kvashee #tausuc11 7 Oct

JoelAVB Jol Sigling Chris Wendt from MS: opportunities for MT of instant messaging and tweets, gives very fast notifications eg #tausuc11 7 Oct

kvashee K Vashee CW: Twitter can help you understand what customers say about you #tausuc11 7 Oct

kvashee K Vashee Chris Wendt - MSFT on IM and Chat translation - messy source data #tausuc11 7 Oct

JoelAVB Jol Sigling Results for Spanish chat MT better than Chinese, as expected. Some good results for Spanish #mt #tausuc11 7 Oct

JoelAVB Jol Sigling WB: chat transation very specific linguistic challenges (abbreviations, spelling) MT led to greater awareness in chat authoring #tausuc11 7 Oct

kvashee K Vashee WB: Generic MT did better on idiomatic but trained did better on technical stuff #tausuc11 ES= +ve ZH= -ve user feedback 7 Oct

appliedlanguage appliedlanguage Re: TM, SmartMATE includes file filtering (MS Office, xliff, txt, xml, html) + TM fuzzy matching + TM rebuild after editing #tausuc11 7 Oct

kvashee K Vashee WB: ES was good but ZH was not as good in quality evaluations of response #tausuc11 7 Oct

kvashee K Vashee WB: Not enough corpora available for chat training so used monolingual - chat data is messy #tausuc11 7 Oct

JoelAVB Jol Sigling Will Burgett, Intel: chat preffered method of communication for many customers, has to be available in local languages, costly! #tausuc11 7 Oct

kvashee K Vashee Will Burgett of Intel - Chat for support experiment overview #tausuc11 explaining why it is important 7 Oct

JoelAVB Jol Sigling TDA data important in the success of the PayPal MT Nordic localization project #TDA #tausuc11 7 Oct

appliedlanguage appliedlanguage Andy Way ran out of time but worth adding that MT quality is addressed with data cleaning, tokenisation, casing, URL handling, etc #tausuc11 7 Oct

JoelAVB Jol Sigling Olga from Welocalize: translators accepted post-editing task more easily when they realised the translation was well-prepared #tausuc11 7 Oct

JoelAVB Jol Sigling Alon Lavie from Safaba explains how they use Moses for both MT and Automated Post-Editing using PayPal data. Successful for Paypal #tausuc11 7 Oct

rhotulos Johann Roturier RT @kvashee reducing training time - is that the wrong problem? #tausuc11 > retraining must be instantaneous for a better PE experience 7 Oct

ctatwork Chase Tingley VERY incomplete list, but Microsoft, Safaba, ProMT, Pangeanic, CNGL, and others had people involved. Really good discussion. #tausuc11 7 Oct

StoecklM Manuel Stoeckl @

@Kilgraymemoq Yes 4 LSPs. 1st client list needs evaluation > suitable candidates >Need 2 start training 1 staff > get posteditors #tausuc11 7 Oct

ctatwork Chase Tingley Also, thanks to everyone who contributed to the great "towards a common MT API" open spaces session yesterday. #tausuc11 7 Oct

ctatwork Chase Tingley Until yesterday, I didn't realize how many MT engines handled inline tagging. #tausuc11 7 Oct

JoelAVB Jol Sigling ProMT created cusomised MT environment for PayPal, based around WorldServer, <70% matches go to MT, rest is human #tausuc11 7 Oct

appliedlanguage appliedlanguage RT @ctatwork: Big theme of the day is self-serve, on-demand MT training. LSPs are jumping into this space in a big way. #tausuc11 7 Oct

alexpysaryuk Oleksandr Pysaryuk RT @ctatwork: Big theme of the day is self-serve, on-demand MT training. LSPs are jumping into this space in a big way. #tausuc11 7 Oct

sergeyrybkin Sergey Rybkin RT @alexpysaryuk: RT @Kilgraymemoq: Paypal uses Acrolinx for writing - easier to MT that way! #tausuc11 7 Oct

Kilgraymemoq Istvan Lengyel Is it profitable to explain to translators/lsps the benefits of using affordable MT or will this remain with buyers? #tausuc11 7 Oct

alexpysaryuk Oleksandr Pysaryuk RT @Kilgraymemoq: Paypal uses Acrolinx for writing - easier to MT that way! #tausuc11 7 Oct ctatwork Chase Tingley

Wayne Bourland from Dell notes the dialogue surrounding MT has shifted from price to speed. Good point. #tausuc11 7 Oct

Kilgraymemoq Istvan Lengyel I see the emergence of affordable MT engines. But will translators and LSPs start utilizing these instead of Google/MS? #tausuc11 7 Oct

kvashee K Vashee Dell saying that MT offers velocity to translation process #tausuc11 7 Oct

sergeyrybkin Sergey Rybkin RT @kvashee: CD: MT also provided better consistency that human only process - multiple translators = less consistent #tausuc11 7 Oct

ctatwork Chase Tingley Big theme of the day is self-serve, on-demand MT training. LSPs are jumping into this space in a big way. #tausuc11 7 Oct

kvashee K Vashee Andy Way talking about reducing training time - is that the wrong problem? Data quality drives MT quality - How do u address that? #tausuc11 7 Oct

i18n_mug IMUG #tausuc11 attendees may be interested in IMUG's Statistical MT talk by SDL / Language Weaver speakers at Google, 20 Oct: Statistical MT: 10 Years of Innovation and a Look Ahead - IMUG - The *Original* Multilingual Computing User Group (San Jose, CA) - Meetup 7 Oct trnsltntech Translation Tech RT @kvashee: CD: MT also provided better consistency that human only process - multiple translators = less consistent #tausuc11 7 Oct

trnsltntech Translation Tech RT @kvashee: CD: Machine-aided human translation = better terminology, more time to focus on style and fluency + FASTER #tausuc11

7 Oct

trnsltntech Translation Tech RT @kvashee: CD: PayPal brand is key - voice and tone important - huge amt of rework with vendors #tausuc11 15K words needed back in 3 days 7 Oct

trnsltntech Translation Tech RT @kvashee: Catherine Dove from Paypal > 25 languages and priority = Time to Market and Customer Quality for localized sites #tausuc11 7 Oct

trnsltntech Translation Tech RT @kvashee: DJ: Caterpillar actually got better results on quality measure with MT than with HT in Br.PT and RU #tausuc11 7 Oct

Kilgraymemoq Istvan Lengyel Andy Way showing Smartmate #tausuc11 7 Oct

ctatwork Chase Tingley Lots of respect for PayPal l10n. Their use case seems like a total nightmare, and they've made it work. #tausuc11 7 Oct trnsltntech Translation Tech RT @kvashee: Don Johnson from Caterpillar talking about their use of MT #tausuc11 Volumes increasing too fast so need to outsource 7 Oct

Kilgraymemoq Istvan Lengyel Doing the maths, i have the feeling that Paypal has a big loc team compared to their translation spending. #tausuc11 7 Oct

kvashee K Vashee CD: Has instituted a quality improvement process - constant loop of feedback and improvement #tausuc11 7 Oct

JoelAVB Jol Sigling Paypal does a lot of pre and post-editing to reduce linguistic QC effort/rework, 20% improvement achieved on 8/10 languages #mt #tausuc11

7 Oct

kvashee K Vashee CD uses Edit Distance to measure rework effort for 8 LPs = 20% reduction they find that rework is less #tausuc11 7 Oct

ctatwork Chase Tingley PayPal uses MT to catch i18n glitches in the UI earlier in the release cycle. #tausuc11 7 Oct

Kilgraymemoq Istvan Lengyel Paypal uses a Review distance score. - this is going to be the hot topic soon! #tausuc11 7 Oct

kvashee K Vashee CD: Website is changing all the time = 1.5 million words on Paypal #tausuc11 7 Oct

Kilgraymemoq Istvan Lengyel The Paypal site has 1.5 million words. Amazing! #tausuc11 7 Oct

JoelAVB Jol Sigling Caterpillar finds MT produces more consistent quality than humans, especially in terminology #mt #tausuc11 7 Oct

Kilgraymemoq Istvan Lengyel Paypal uses Acrolinx for writing - easier to MT that way! #tausuc11 7 Oct

kvashee K Vashee CD: MT used both upstream (authoring corrections) and also downstream and monitor efficiency #tausuc11 7 Oct

Kilgraymemoq Istvan Lengyel Paypal is the only company I saw with relatively little content using MT - they do 15000+ words every two weeks. #tausuc11 7 Oct

sergeyrybkin Sergey Rybkin RT @kvashee: CD: Machine-aided human translation = better terminology, more time to focus on style and fluency + FASTER #tausuc11 7 Oct

kvashee K Vashee CD: MT also provided better consistency that human only process - multiple translators = less consistent #tausuc11 7 Oct

JoelAVB Jol Sigling Paypal needs very quick turnaround, hence need for Human-Assisted MT. MT delivers better quality, it is not distracted by eg tags #tausuc11 7 Oct

ctatwork Chase Tingley Some Paypal source segments have 77 tags. Oof. #tausuc11 7 Oct

Kilgraymemoq Istvan Lengyel Catherine Dove, Paypal: for segments with 77 tags, MT is better than human. --- and also more eager to work on it :) #tausuc11 7 Oct

kvashee K Vashee CD: MT does tag handling better than HT - some segments have 77 tags = tough for humans #tausuc11 7 Oct

kvashee K Vashee CD: Machine-aided human translation = better terminology, more time to focus on style and fluency + FASTER #tausuc11 7 Oct

kvashee K Vashee CD: PayPal brand is key - voice and tone important - huge amt of rework with vendors #tausuc11 15K words needed back in 3 days 7 Oct

Kilgraymemoq Istvan Lengyel At Paypal each country has a different English content! #tausuc11

7 Oct

JoelAVB Jol Sigling Paypal 2011 localization priorities: Time to market due to simship policy & Customer quality, should be better than human quality #tausuc11 7 Oct

T21Century Rahzeb Choudhury RT @kvashee: Catherine Dove from Paypal > 25 languages and priority = Time to Market and Customer Quality for localized sites #tausuc11 7 Oct

kvashee K Vashee CD: "Human quality was not good enough for Paypal" - too much rework required and deadlines that were too tight #tausuc11 15K new words/week 7 Oct

sergeyrybkin Sergey Rybkin RT @Kilgraymemoq: Don Johnson, Caterpillar: MT quality was on average higher than human translation quality for Russian & PT-BR - #tausuc11 7 Oct

Kilgraymemoq Istvan Lengyel For Paypal, human quality was not enough. #tausuc11 (For tagged content with variables.) 7 Oct

kvashee K Vashee Catherine Dove from Paypal > 25 languages and priority = Time to Market and Customer Quality for localized sites #tausuc11 7 Oct

JoelAVB Jol Sigling MT to human translation quality comparison for Brazilian and Russian shows better quality for both with MT #tausuc11 #mt 7 Oct

Kilgraymemoq Istvan Lengyel Don Johnson, Caterpillar: MT quality was on average higher than human translation quality for Russian and PT-BR - wow! #tausuc11 7 Oct

kvashee K Vashee DJ: Will expand to new languages and new kinds of content after process is in place #tausuc11 7 Oct

sergeyrybkin Sergey Rybkin RT @kvashee: Andy Way : "The quality of MT will never be perfect - it's just a tool" #tausuc11 7 Oct

kvashee K Vashee DJ: Caterpillar actually got better results on quality measure with MT than with HT in Br.PT and RU #tausuc11 7 Oct

JoelAVB Jol Sigling Caterpillar putting a lot of effort into structuring authoring effort. Demand for translation is rising sharply #tausuc11 7 Oct

kvashee K Vashee DJ: Focus on Quality, Turnaround time and cost - initially MT is more expensive because 100% review #tausuc11 7 Oct

kvashee K Vashee DJ: Technical documents are the core of our business #tausuc11 Authors write in "Caterpillar English" 7 Oct

kvashee K Vashee Don Johnson from Caterpillar talking about their use of MT #tausuc11 Volumes increasing too fast so need to outsource 7 Oct

kvashee K Vashee Way: MT can significantly reduce th cost of large scale translation #tausuc11 7 Oct

kvashee K Vashee Andy Way : "The quality of MT will never be perfect - it's just a tool" #tausuc11 7 Oct

sergeyrybkin Sergey Rybkin RT @ABBYY_USA: Are you attending TAUS User Conference 2011 today? Check us out: ABBYY to Present on Machine Translation Breakthrough at TAUS User Conference 2011 -- MILPITAS, Calif., Oct. 6, 2011 /PRNewswire/ -#tausuc11 7 Oct

sergeyrybkin Sergey Rybkin RT @kvashee: Marcu: MT needs "highly qualified" people to use things like Moses effectively #tausuc11 7 Oct

mrkpl Mirko Plitt "@kvashee: Marcu: MT needs "highly qualified" people to use things like Moses effectively #tausuc11"< Oh, I like that :-) 7 Oct

kvashee K Vashee Jaap: Instant MT training (SMT) is getting hot #tausuc11 7 Oct

JoelAVB Jol Sigling Very important to SDL and their customers to decide if it's worthwhile to create an engine or not. #tausuc11 7 Oct

kvashee K Vashee Marcu focused on easy training but what if data is not ready - what is the point? GIGO efficiency? #tausuc11 7 Oct

kvashee K Vashee Marcu: has learnt that efficiency is important and wants on-button to no-button training but what if data is not good? #tausuc11 7 Oct

JoelAVB Jol Sigling SDL have established some rules for predicting if it is useful to develop engines based on language combination & volume #tausuc11 7 Oct

kvashee K Vashee Marcu: Some language combinations are easier than others #tausuc11 DE is tough 7 Oct

kvashee K Vashee Marcu: MT can accelerate HT documentation production processes but TU, HU and AR not so successful #tausuc11 7 Oct

JoelAVB Jol Sigling SDL has a very easy to use inhouse engine building system, based on LanguageWeaver apparently... #tausuc11 7 Oct

JoelAVB Jol Sigling SDL have already developed over 10,000 MT engines this year, both for production and research platforms #tausuc11 7 Oct

JoelAVB Jol Sigling Daniel Marcu from SDL says it is necessary to develop MT engines which are very easy to train. #tausuc11 7 Oct

kvashee K Vashee Marcu: SDL built 9800+ engines in 2011 so far -- #tausuc11 but very few actually used = "not put into production" 7 Oct

kvashee K Vashee Marcu: The people who know how to customize MT (SMT) is miniscule #tausuc11 7 Oct

kvashee K Vashee Marcu: MT needs "highly qualified" people to use things like Moses effectively #tausuc11 7 Oct

kvashee K Vashee Marcu drawing comparisons of MT with speech technology evolution #tausuc11 7 Oct

JoelAVB Jol Sigling ABBYY presentation on their Compreno engine seems to have impressed many in the expert audience #tausuc11 7 Oct

kvashee K Vashee Daniel Marcu on Letting a 1000 MT systems bloom - (copied from Mao?) #tausuc11 7 Oct

kvashee K Vashee Alon Lavie says many ideas are similar to RbMT and SMT research but sees challenges in semantic approach #tausuc11 Good luck 7 Oct

kvashee K Vashee Rose asking if it is not really a modified rule based approach #tausuc11 7 Oct

kvashee K Vashee Abbyy spent 15 years developing this with 200 people and are conducting pilots #tausuc11 7 Oct

JoelAVB Jol Sigling Big advantage of ABBYY Compreno is that it can process natural language and does not need simplified language for good results #tausuc11 7 Oct

alexpysaryuk Oleksandr Pysaryuk RT @kvashee: Abbyy overviewing different MT approaches says thier approach is different = "model based" MT #tausuc11 MBMT 7 Oct

sergeyrybkin Sergey Rybkin RT @kvashee: Abbyy overviewing different MT approaches says thier approach is different = "model based" MT #tausuc11 MBMT 7 Oct

kvashee K Vashee

Abbyy showing how the analyze "I saw my saw" in their parser #tausuc11 7 Oct kvashee K Vashee Abbyy claims deep semantic knowledge and integration #tausuc11 7 Oct kvashee K Vashee Abbyy approach involves morphological and syntactic analysis and claims semantic knowledge #tausuc11 7 Oct kvashee K Vashee Abbyy overviewing different MT approaches says thier approach is different = "model based" MT #tausuc11 MBMT 7 Oct kvashee K Vashee RT @i18n_mug: ABBYY to Present on Machine #Translation at TAUS User Conference 2011 #tausuc11 7 Oct

JoelAVB Jol Sigling Second day of TAUS User Conference 2011 on its way... #tausuc11 7 Oct

i18n_mug IMUG ABBYY to Present on Machine #Translation Breakthrough at TAUS User Conference 2011 today at 9:15 AM ABBYY to Present on Machine Translation Breakthrough at TAUS User Conference 2011 - MarketWatch #tausuc11 @ABBYY_USA 7 Oct

Tesstranslates Tess Whitty RT @multilingualmag: Follow the TAUS User Conference at #tausuc11 TAUS User Conference 2011 | Conferences | TAUS - Enabling better translation #xl8 #machinetranslation 7 Oct

ctaube2011 Christian Taube @ @Kilgraymemoq Try the Logrus TMX tool and let me know how it works: TMX Editor #tausuc11 <Worth a very close look, thnks, #Logrus> 7 Oct

Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11:Greg Oxton from CSI - "open space" on interoperability.CSI focuses on providing exceptional customer experiences

7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee:Fred Hollowood - presented SymEval (115 users) that measures translation quality both MT & Human 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Moses does't handle in-line formatting,so he collaborated with Openoffice and OKAPI framework 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Achim Ruopp: Moses for L10n on real-world MT integration 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Logrus supports TMX and RegEx 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Logrus tool splits file by LP, splits files into smaller fragments and strong search & replace capability 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Logrus created tool to make TDA data usable for their own Moses use 7 Oct

Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Logrus is offering a TMX Editor to help with Moses use as current market tools not good 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee:Adobe will make these tools available to community 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee:Adobe tools help simplify Moses engine training runs and BLEU scoring tool 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11:Jeff Rueppel - Adobe presenting tools for supporting Moses use:Corpus cleaning, Engine training Engine testing 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee:TAUS labs on interoperability to develop a reference use case 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee:BW: How do we pull standards together? OAXAL helps by encouraging Open standards and provides SOA for XML 7 Oct Yulia_TAUS Yulia Korobova, TAUS

Key points #TAUSUC11 RT @kvashee:Bob Willans on OAXAL.Too many standards in the L10n business makes life complicated 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: AP: XLIFF is too complicated and does not have clarity on "processing expectations" 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee:This is almost the opposite of the previous presentation where simplicity was the goal 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: AP: commented on lack of clear syntax and understanding on key issues 7 Oct

Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: AP: The goal -Can a simple implementor use it? 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Andrew Pimlott of WeLocalize on XLIFF 2.0 - commented on initial lack of success 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: BM: Release early and release often 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11:BM advocates a simple REST oriented interface - API only defines how systems communicate - only request and fetch tr-s 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee:BM: Start with CGI, use JSON not XML in responses - don't over-design - ask " Can this be simpler?" 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee:BM: Twitter API started very simply = 1000's of applications were developed quickly. A model for xl8 API 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee:BM: Don't let perfect be the enemy of good. time spent on envisioning every translation use scenario 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: BM advocates a minimalist ad-hoc approach to interoperability similar to Twitter API 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11:BM: Average developer is't interested in finding out about tr. black box technology (SDL) which hamper interoperability 7 Oct

Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Brian McConell: on simple web API to enable translation everywhere on the web 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee:Jaap: 3 areas of concern: Interoperability, Dynamic Quality Measurement and definition,Open-source MT 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Jaap: Members can influence change in translation landscape 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee:TAUS now partnering with academic world and standards agencies,ProZ and CNGL like research organizatioins 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee:Jaap: TAUS brings buyers, institutions, LSPs and tr. technology providers together. What about translators? 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Jaap: After LISA died we wanted to help interoperability related issues and this is ongoing 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Jaap: TAUS working on dynamic quality measurement initiative. Still evolving 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Jaap: How do we measure translation quality that can scale across translation usage and current needs? 7 Oct

Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Jaap talked about evolution of TAUS on how ideas have evolved and have been implemented 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11:The community that cares about solving language translation problems'll grow beyond the professional tr. industry 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Phillips: The computer does not know meaning but can process data to make "intelligent" deductions 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Dolan: Big Data will help us to solve many linguistic problems that involve semantics

7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Alon Lavie: In post-editing we get lots of feedback and this can be useful for improving MT 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11: Dolan: Present choices to user and see what they choose - we need this for MT to improve technology 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Phillips: More translation will be of words and small phrases where MT + crowdsourcing can outperform HT 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Dolan: HT will focus on high ground - MT is far from duplicating what a HT does 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11: Phillips: More content'll go to automation.Kindle = 2000 words UI and MT+ crowdsourcing does better than professional 7 Oct

Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Phillips: Some kind of content needs professional translation but social media content can work with MT 7 Oct

Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Phillips: As technical barriers fall minority languages can create capabilities to make content visible 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Dolan: This data availability situation creates a digital divide for people of smaller languages 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Dolan: MT will get better where there is a lot data 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Phillips: How do you help people find content that is relevant? Beyond search 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Ruiz: Engagement around content happens in communities = culture + language + values 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Ruiz: Talking about TED video mentioned in

eMpTy Pages: The Future of Translation Memory (TM) to show how language analysis is evolving 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Ruiz: Link text, location, topic to find cultural meaning as this is a larger translation perspective 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Ruiz: There are 20 billion likes daily. so we need to ask: is text the key thing that needs translation 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Link words to concepts to make search and translation more useful 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Dolan: Multilingual search data can provide many clues on how to do MT better - we need to mine this 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Dolan: Modeling the variability of language is similar whether we go EN to EN or to another language 7 Oct

Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Dolan: Semantics has not been necessary to make progress with MT contrary to what I originally thought 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Dolan: MT does not like markup - I (MT) want stripped down text and standards 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points #TAUSUC11 RT @kvashee: Ruiz: HTML5 enables publishers to be much freer in creation and transformation processes 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points of #TAUSUC11 RT @kvashee: Phillips: This new infrastructure can deliver publication quality = ready for DTP 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points of #TAUSUC11: HTML5 is translation process friendly + CSS3 + Javascript can handle translation of flowing data across platforms 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points of #TAUSUC11 RT @kvashee: Literary translation not likely to change. small nuggets that need to be done instantly will increase 7 Oct

Yulia_TAUS Yulia Korobova, TAUS Key points of #TAUSUC11 RT @kvashee: Phillips: Need to build one application that can go to many platforms = more widget style applications 7 Oct

Yulia_TAUS Yulia Korobova, TAUS Key points of #TAUSUC11 RT @kvashee: Dolan: I most agree with the huge increase in DATA and the impact it has on so many things 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points of #TAUSUC11 RT @kvashee: Ruiz: Cloud needs to be integrated and tools + processes need to change with this floating content 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points of #TAUSUC11 RT @kvashee: Ruiz: Mega Trends = Big Data and Cloud must fit into language perspectives of future 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points of #TAUSUC11 RT @kvashee: Bill Dolan = Bing Translate, NLP efforts at Microsoft. New UK-US En translator coming soon 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points of #TAUSUC11 RT @kvashee: Jaap about the "non-sponsored" nature of TAUS = more open dialogue and idea sharing 7 Oct Yulia_TAUS Yulia Korobova, TAUS Key points of #TAUSUC11 RT @kvashee: TAUS User conference started up in Santa Clara with 100+ people who ARE using MT regularly. 7 Oct

sustarling Susan Starling Let myself get distracted last night by highly interesting timeline from #tausuc11, need to make up for it now before things start up again! 7 Oct

Dig_IT_Global Andrew Lawless RT @kvashee: Ruiz: Talking about TED video mentioned in eMpTy Pages: The Future of Translation Memory (TM) to show how language analysis is evolving #tausuc11 6 Oct

Vous aimerez peut-être aussi