博客
关于我
强烈建议你试试无所不能的chatGPT,快点击我
Alibaba AI Model Tops Humans in Reading Comprehension
阅读量:6535 次
发布时间:2019-06-24

本文共 2643 字,大约阅读时间需要 8 分钟。

_

Score one for machines in the battle of man versus machine, with an Alibaba deep-learning model this month topping humans for the first time in one of the world’s most-challenging reading comprehension tests.

Alibaba’s Institute of Data Science and Technologies (iDST) said Monday its deep neural network model scored 82.44 in the Stanford Question Answering Dataset (SQuAD) on Jan. 11, beating the human score of 82.304 for Exact Match, i.e. providing exact answers to questions. The SQuAD is a large-scale reading comprehension dataset comprised of over 100,000 question-answer pairs based on over 500 Wikipedia articles.

“It is our great honor to witness the milestone where machines surpass humans in reading comprehension,” said Luo Si, iDST’s chief scientist for Natural Language Processing. “We are thrilled to see NLP research has achieved significant progress over the year. We look forward to sharing our model-building methodology with the wider community and exporting the technology to our clients in the near future.”

Teams competing in the challenge need to build machine-learning models that can provide answers to the questions in the dataset, such as “what causes rain?” The Alibaba model’s accuracy was tied to its ability to read from paragraphs to sentences to words, locating precise phrases that contain potential answers. That model, which leverages the Hierarchical Attention Network, is viewed as having strong commercial value. Alibaba has used the underlying technology in its 11.11 Global Shopping Festival for several years, with machines answering large amounts of inbound customer inquiries.

Other potential customer-service uses included tutorials for visitors to museums and online responses to inquiries from some medical patients.

The SQuAD is perceived as the world’s top machine reading-comprehension test and attracts universities and institutes ranging from Google, Facebook, IBM, Microsoft to Carnegie Mellon University, Stanford University and the Allen Research Institute.

While its SQuAD performance is a milestone, it’s just one of the proof points made by the iDST’s Natural Language Processing Team recently. Other successes include the best scores and prizes in the ACM CIKM Cup, which focuses on personalized e-commerce searches, Chinese Grammar Error Diagnosis and English-named entity classifications tasks at the Text Analysis Conference, a series of workshops arranged by the U.S. National Institute of Standards and Technology.

The iDST is Alibaba’s primary research arm focusing on . It’s heavily into Natural Language Processing and solving problems that lead to real-world applications.

转载地址:http://mdkdo.baihongyu.com/

你可能感兴趣的文章
设计模式学习---UML常见关系的实现
查看>>
图解openssl实现私有CA
查看>>
BZOJ2213 : [Poi2011]Difference
查看>>
c++ Constructor FAQ 继续
查看>>
事务之六:spring 嵌套事务
查看>>
C#:路径
查看>>
js表单计算金额问题
查看>>
iOS图片加载速度极限优化—FastImageCache解析
查看>>
PHP中的一些新特性
查看>>
Jmockit使用
查看>>
I.MX6 Android mmm convenient to use
查看>>
[CareerCup] 13.9 Aligned Malloc and Free Function 写一对申请和释放内存函数
查看>>
Stack and Heap 堆和栈的区别
查看>>
什么是 A 轮融资?有 B轮 C轮么?
查看>>
55、Android网络图片 加载缓存处理库的使用
查看>>
svn文件提交时强制写注释
查看>>
【转载】千万级规模高性能、高并发的网络架构经验分享
查看>>
jsp字段判空
查看>>
OC基础--OC中的类方法和对象方法
查看>>
ubuntu samba服务器多用户配置【转】
查看>>