Alibaba and Microsoft AI best humans in reading test

Artificial intelligence models developed by Microsoft and Alibaba have, for the first time, outperformed humans in a reading comprehension challenge.

The Stanford Question Answering Dataset (SQuAD) consists of a series of questions to which the answers can be found within more than 500 Wikipedia entries.

Alibaba’s deep neural network model scored 82.440 on the ‘exact match’ part of the test, besting the scores achieved by humans (82.304). Microsoft’s similar model achieved a score of 82.650.

The scoreboard is a who’s who of corporates carrying out artificial intelligence research, featuring the likes of Google, IBM Research, Facebook AI Research, Salesforce Research, Tencent and Samsung.

Alibaba and Microsoft have been placed joint first in the ranking, although both companies claim to have reached the better-than-human milestone first.

While Microsoft is listed as having registered its score on 3rd January and Alibaba two days later, Alibaba said those dates were when the companies submitted their models, not when test results were registered.

“It is our great honour to witness the milestone where machines surpass humans in reading comprehension,” said Luo Si, chief scientist for natural language processing at Alibaba’s Institute of Data Science and Technologies (iDST) in a statement. “We are thrilled to see NLP research has achieved significant progress over the year. We look forward to sharing our model-building methodology with the wider community and exporting the technology to our clients in the near future.”

Ming Zhou, assistant managing director of Microsoft Research Asia, said despite the milestone, overall, people are still much better than machines at comprehending the complexity and nuance of language.

“Natural language processing is still an area with lots of challenges that we all need to keep investing in and pushing forward,” he said. “This milestone is just a start.”

The big AI players are investing heavily in reading comprehension and response models.

Alibaba said it had been using the underlying technology during its ‘Global Shopping Festival’ for a number of years to answer customer inquiries.

Microsoft said it was applying earlier versions of the model to its Bing search engine.

“These tools also could let doctors, lawyers and other experts more quickly get through the drudgery of things like reading through large documents for specific medical findings or rarified legal precedent. The technology would augment their work and leave them with more time to apply the knowledge to focus on treating patients or formulating legal opinions,” the company wrote in a blogpost.

It is also working on models that answer probable follow-up questions.

“For example, let’s say you asked a system, ‘What year was the prime minister of Germany born?’ You might want it to also understand you were still talking about the same thing when you asked the follow-up question, ‘What city was she born in?’

“It’s also looking at ways that computers can generate natural answers when that requires information from several sentences. For example, if the computer is asked, ‘Is John Smith a US citizen?’ that information may be based on a paragraph such as, ‘John Smith was born in Hawaii. That state is in the US’” Microsoft explained.

Survey reveals misalignment between cybersecurity and business goals in the UAE and KSA

Bybit opens global headquarters in Dubai on the heels of 50% increase in user base

DIFC Courts reinforces its commitment to sustainability following expansion of its digital infrastructure

NETSCOUT adds new data centre presence in Dubai with added Arbor Cloud Capabilities

Dubai ranks as the second most crypto-ready city in the world

LinkShadow reinforces its commitment to Saudi Arabia

Huawei Cloud’s Riyadh launch boosts Saudi tech advancement

Cisco unveils new data centre to enhance security services in Saudi Arabia

Survey reveals misalignment between cybersecurity and business goals in the UAE and KSA

Google Cloud announce significant new collaboration with Saudi Pharmaceuticals giant

KROHNE delivers insights to inspire the next generation of engineers in Oman

Oracle supports major project to accelerate Oman digital economy

Ooredoo accelerates cybersecurity in Oman with new deal

Omantel selects Ericsson to manage its nationwide multi-vendor networks

Oman TRA shares plans to accelerate 5G deployment

BDB launches “tijara” platform for SMEs

Bahrain achieves full nationwide 5G coverage

Batelco, SonicWall launch integrated security solutions for SMEs in Bahrain

Bahrain to offer COVID-19 test results on WhatsApp, Facebook Messenger

Infor supports Bahrain’s digital transformation

Infopercept opens its first Middle East office in Kuwait

Microsoft Compliance Manager now available in Kuwait

Commercial Bank of Kuwait gets mobile payments moving with Thales Digital Solutions

Ooredoo chooses Fortinet to deliver secure SD-WAN managed services in Kuwait

Zain rolls out first-ever 5G roaming service in MENA

Looking for the best label solutions in South Africa? Go OKI!

OKI is only going bigger in the South African market!

Huawei honours Women in Tech at Apps UP 2022

Amazon payment services launches in the MENA region

Egypt turns to AI to deliver COVID-19 diagnosis service to People of Determination

Vertiv extends partnership with MDS SI Group to enhance digital infrastructure solutions

Bosch sees 21% sales surge in the Middle East

“AI has been in Google’s DNA from the beginning – we are an AI-first company” – Tarek Khalil, Google Cloud

How companies can future-proof their business with Web3

“Diversity, equity and inclusion are a core part of our culture and values at AWS” – Yasmine Afifi

Gender Lens investing vital to economic recovery

Virgin Hyperloop unveils location for Hyperloop certification centre

TikTok taps Oracle as secure cloud provider

Zoom gets security boost with Keybase acquisition

Why collaboration is the answer to the successful roll-out of 5G

OPSWAT invests $10 Million in scholarship learning program to help close cybersecurity skills gap

Alef Education showcases the Alef Metaverse to improve climate education at COP28

Bybit and AUS unveil the tech talents of tomorrow

How digital education solutions can help students achieve their full learning potential

Seeds for the Future 2023: Huawei gathers ME & CA’s brightest ICT talent in Qatar

Huawei launches ground-breaking solar inverter at World Future Energy Summit

Middle East Energy to further boost their sustainability agenda

EDF UK selects Dynatrace to keep the power flowing

KROHNE harnessing the power of analytics to drive energy transformation

“All of us are pulling in the right direction for a better tomorrow” – Frank Janssens, VP at KROHNE

Above and ‘Beyon’ – New Money SuperApp launched in the UAE

New cross-border payments platform Digit9 launches in the UAE

Hub71 partners with global economic transformation specialist on MENA start-up ecosystem

Spheroid Universe coin to be listed on MEXC exchange

DIFC Innovation Hub launches AccelerateHER program

Innovating Finance: UAE’s Pioneering Role in the Blockchain and Crypto Landscape

Pioneering Sharjah’s Digital Revolution

Interview: Shaping a Secure World

Dubai Chamber of Digital Economy scouts Asia for tech start-ups

Government leaders from IT sector honoured at GovTech Innovation Awards

Dubai Science Park study backs R&D localisation amid projected surge in UAE healthcare spending

MarkiTech expands GCC footprint

MoHAP Launches Health Sector’s First National Centre of Excellence for AI

Korian Benelux future-proofs residential Care with AI-driven solutions

SAS accelerates responsible innovation efforts with new collaborations

Digitalisation key to accelerating construction development in Middle East, says Trimble

R&M Introduces First Single Pair Ethernet System to Support Middle East Smart Building Trend

To ‘upsmart’ your building, start with the elevator

Renters searching for homes online surge amid coronavirus fears

Emirati tech entrepreneur launches new app to ease Dubai’s rental woes

Brands Beware: The Application Generation Demands Seamless Digital Experiences

Opinion: Brands must respond to meet heightened customer expectations

Thriwe: Enhancing the Omni-channel experience

Navigating the Festive Cyber Landscape: Ensuring Online Safety during Holiday Shopping

Celebrate the Dubai Shopping Festival with Eros Electronics’ exclusive deals

AVEVA launches CONNECT, the world’s leading industrial intelligence platform, at Hannover Messe

Infopercept launches Invinsense 5.0- – A cybersecurity platform completely made in India

Interview: Building Cyber Resilience

Kaspersky Thin Client 2.0: Cyber Immune protection with enhanced connectivity, performance and design

xCube launches the UAE’s first fully automated SLB service for retail investors