2013년 10월 20일 일요일

Google's robot army learns Spanish

"Our method can translate missing word and phrase entries by learning language structures based on large monolingual data and mapping between languages from small bilingual data," they write. "This method makes little assumption about the languages, so it can be used to extend and refine dictionaries and translation tables for any language pairs.But only a handful of unmanned ground systems were shown,kitchen knives and they were based on technology half a decade old."The system works by visualizing the vectors of individual words, then projecting the vector from the source language to the target language and swapping in the word with that vector representation in that dictionary. 

It is able to work because, the researchers explain, "all common languages share concepts that are grounded in the real world such as that cat is an animal smaller than a dog, there is often a strong similarity between the vector spaces.We'll do this again next month in Europe and one more time for Asia-Pacific at the beginning of November,diamond core bit and then on Nov. 7 the standard will be officially released."Google's technology relies on the Skip-gram or Continuous Bag-of-Words models proposed by Googlers in another, earlier paper, which found that word vectors could be used to infer other words. "For example, vector operations 'king' - 'man' + 'woman' results in a vector that is close to 'queen'."Now, the team has been able to put these models to use to train them to figure out the relationship between different words, and infer the vector representations of a word's counter in another language.And as we get more money and more members, we'll be able to replace donated core barrel with higher-grade, contractor-grade tools. 

These models let Google create neural network models that learn high-quality word vectors from vast datasets, and do so in a less compute-intensive way than ever before. This lets the company scale up the model far beyond previous limits."Using the DistBelief distributed framework, it should be possible to train the CBOW and Skip-gram models even on corpora with one trillion words, for basically unlimited size of the vocabulary,Reynolds and his committee hoped that at least $60,000 remained in the fund,diagnosisexpert and also wanted approval to bid for construction of the dock, another estimated $20,000." they wrote at the time. "That is several orders of magnitude larger than the best previously published results for similar models.After total silence, they finally told me that his knife sets was revoked for reasons they could not legally address."

댓글 없음:

댓글 쓰기