{"id":806026,"date":"2021-12-20T11:08:11","date_gmt":"2021-12-20T19:08:11","guid":{"rendered":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/?p=806026"},"modified":"2021-12-20T11:08:13","modified_gmt":"2021-12-20T19:08:13","slug":"azure-ai-milestone-microsoft-kear-surpasses-human-performance-on-commonsenseqa-benchmark","status":"publish","type":"post","link":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/blog\/azure-ai-milestone-microsoft-kear-surpasses-human-performance-on-commonsenseqa-benchmark\/","title":{"rendered":"Azure AI milestone: Microsoft KEAR surpasses human performance on CommonsenseQA benchmark"},"content":{"rendered":"\n<figure class=\"wp-block-image alignwide size-large\"><img decoding=\"async\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/1400x788_Common_Sense_no_logo.gif\" alt=\"An illustration of the KEAR architecture represented by five panels side by side. The first contains an input question\u2014\u201cWhat is a treat that your dog will enjoy?\u201d\u2014and the answer choices \u201csalad,\u201d \u201cpetted,\u201d \u201caffection,\u201d \u201cbone,\u201d and \u201clots of attention.\u201d The second panel has three boxes, each representing retrieval from a specific knowledge source. A box labeled \u201cKnowledge Graph\u201d has a silhouette of a dog and underneath it and labeled \u201cdesires\u201d a silhouette of a dog being petted; a heart representing \u201caffection\u201d; a bone; and clapping hands representing \u201clots of attention.\u201d A box labeled \u201crelevant questions\u201d has the question \u201cWhat do dogs like to eat?\u201d and the accompanying answer \u201cBones.\u201d A boxed labeled \u201cdictionary\u201d contains the definition of \u201cbone\u201d: \u201ca composite material making up the skeleton of most vertebrates.\u201d The third panel, labeled \u201cconcatenation with input,\u201d contains the input question followed by \u201cDog, desires, bone. Dog, desires, lots of attention\u201d followed by the relevant question and finally the dictionary definition of bone. In between each is a separation token [SEP]. The fourth panel is labeled \u201clanguage model\u201d and contains a quote box labeled \u201clanguage services,\u201d a cube labeled \u201cmodel,\u201d and left and right braces punctuation within a circle labeled \u201clanguage understanding.\u201d The fifth panel is labeled \u201coutput\u201d and includes silhouettes of each of the five answer choices. The silhouette of the bone is highlighted in blue, representing the appropriate response.  \"\/><\/figure>\n\n\n\n<p><em><a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/publication\/human-parity-on-commonsenseqa-augmenting-self-attention-with-external-attention\/\">KEAR (Knowledgeable External Attention for commonsense Reasoning)<\/a>\u2014along with recent milestones in\u00a0<a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/blog\/azure-ai-milestone-new-foundation-model-florence-v1-0-pushing-vision-and-vision-language-state-of-the-art\/\">computer vision<\/a> and\u00a0<a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/blog\/azure-ai-milestone-new-neural-text-to-speech-models-more-closely-mirror-natural-speech\/\">neural\u00a0text-to-speech<\/a>\u2014is part of a larger <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/azure.microsoft.com\/en-us\/overview\/ai-platform\/\">Azure AI<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> mission to provide relevant, meaningful AI solutions and services that work better for people\u00a0because they better capture how people learn and work\u2014with improved vision, knowledge understanding, and speech capabilities. At the center of these efforts is XYZ-code, a joint\u00a0representation\u00a0of three cognitive attributes: monolingual text (X), audio or visual sensory signals (Y), and multilingual (Z). For more information about these efforts, read the\u00a0<\/em><a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/blog\/a-holistic-representation-toward-integrative-ai\/\" target=\"_blank\" rel=\"noreferrer noopener\"><em>XYZ-code blog post<\/em><\/a><em>.<\/em><\/p>\n\n\n\n<p>Last month, our <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/azure.microsoft.com\/en-us\/services\/cognitive-services\/\" target=\"_blank\" rel=\"noopener noreferrer\">Azure Cognitive Services<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> team, comprising researchers and engineers with expertise in AI, achieved a groundbreaking milestone by advancing commonsense language understanding. When given a question that requires drawing on prior knowledge and five answer choices, our latest model\u2014 <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/publication\/human-parity-on-commonsenseqa-augmenting-self-attention-with-external-attention\/\" target=\"_blank\" rel=\"noreferrer noopener\">KEAR, Knowledgeable External Attention for commonsense Reasoning<\/a>\u2014performs better than people answering the same question, calculated as the majority vote among five individuals. KEAR reaches an accuracy of 89.4 percent on the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.tau-nlp.org\/commonsenseqa\" target=\"_blank\" rel=\"noopener noreferrer\">CommonsenseQA<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> leaderboard compared with 88.9 percent human accuracy. While the CommonsenseQA benchmark is in English, we follow a similar technique for multilingual commonsense reasoning and topped the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/inklab.usc.edu\/XCSR\/\" target=\"_blank\" rel=\"noopener noreferrer\">X-CSR<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> leaderboard.<\/p>\n\n\n\n<p>Although recent large deep learning models trained with big data have made significant breakthroughs in natural language understanding, they still struggle with commonsense knowledge about the world, information that we, as people, have gathered in our day-to-day lives over time. Commonsense knowledge is often absent from task input but is crucial for language understanding. For example, take the question \u201cWhat is a treat that your dog will enjoy?\u201d To select an answer from the choices <em>salad<\/em>, <em>petted<\/em>, <em>affection<\/em>, <em>bone<\/em>, and <em>lots of attention<\/em>, we need to know that dogs generally enjoy food such as bones for a treat. Thus, the best answer would be \u201cbone.\u201d Without this external knowledge, even large-scale models may generate incorrect answers. For example, the <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/publication\/deberta-decoding-enhanced-bert-with-disentangled-attention-2\/\" target=\"_blank\" rel=\"noreferrer noopener\">DeBERTa language model<\/a> selects \u201clots of attention,\u201d which is not as good an answer as &#8220;bone.&#8221; <\/p>\n\n\n\n<p>On the other hand, expert systems with lots of rules and domain knowledge and little data have failed to deliver their promise of AI that understands and reasons more like people do. We revisit the rules and knowledge approach and find that deep learning models and knowledge can be organically combined via an external attention mechanism to achieve breakthroughs in AI. With KEAR, we specifically equip language models with commonsense knowledge from a knowledge graph, dictionary, and publicly available machine learning data.<\/p>\n\n\n\n<p>Given a question and five candidate answers, for the CommonsenseQA task, the KEAR model first retrieves related knowledge from a knowledge graph via entity linking, from a dictionary via word matching, and from related QA datasets via text retrieval. Then, the retrieved knowledge is concatenated with the input question and candidate answer and fed into a language model to produce a score. The candidate answer with the highest score is chosen as the output. The final submission is generated by an ensemble of 39 language models, such as DeBERTa and <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/arxiv.org\/abs\/2003.10555\" target=\"_blank\" rel=\"noopener noreferrer\">ELECTRA<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, with majority voting. In this way, the KEAR model can attend to related external knowledge for effective commonsense understanding.<\/p>\n\n\n\n<p>For example, for the aforementioned question\u2014\u201cWhat is a treat that your dog will enjoy?\u201d\u2014KEAR retrieves \u201cDog \u2014 desires \u2014 petted, affection, bone, lots of attention\u201d from the knowledge graph ConceptNet (note that the choice \u201csalad,\u201d offered as one of the five options, doesn\u2019t appear in the retrieved results); \u201cBone: a composite material making up the skeleton of most vertebrates\u201d from the dictionary Wiktionary; and \u201cWhat do dogs like to eat? bones\u201d from the training data in the<strong> <\/strong>CommonsenseQA dataset. After concatenating the retrieved knowledge with the input, KEAR feeds it into the DeBERTa model, which selects the answer \u201cbone.\u201d<\/p>\n\n\n\n<p>In applying external attention to multilingual commonsense reasoning, we translate a non-English question into English, retrieve the knowledge from various sources, and translate the knowledge text into the source language for external attention. The proposed model, <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/publication\/leveraging-knowledge-in-multilingual-commonsense-reasoning\/\" target=\"_blank\" rel=\"noreferrer noopener\">Translate-Retrieve-Translate (TRT)<\/a>, achieved first place on both the X-CODAH and X-CSQA datasets on the X-CSR benchmark.<\/p>\n\n\n\n<h3 id=\"external-attention-the-benefits-of-looking-outward\">External attention: The benefits of looking outward<\/h3>\n\n\n\n<p>External attention<strong> <\/strong>is complementary to self-attention, which has been widely adopted by many of today\u2019s AI systems, such as those using <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/arxiv.org\/pdf\/1810.04805.pdf\" target=\"_blank\" rel=\"noopener noreferrer\">Transformers<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. These systems rely on a large amount of diverse data to achieve impressive AI performance with huge-size models. This has prompted the recent boom of super large Transformer models, ranging from <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/arxiv.org\/abs\/1810.04805\" target=\"_blank\" rel=\"noopener noreferrer\">BERT<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> with 110 million parameters to <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/openai.com\/blog\/gpt-3-apps\/\" target=\"_blank\" rel=\"noopener noreferrer\">GPT-3<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> with 175 billion parameters. Nevertheless, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/arxiv.org\/abs\/2108.07258\" target=\"_blank\" rel=\"noopener noreferrer\">numerous studies<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> have shown that the corresponding general understanding and generation capabilities of these models are lower than that of people, especially on tasks requiring external knowledge. Moreover, the sheer size of these models poses a\u00a0challenge for much of the AI community to use, study, and deploy, not to mention the significant carbon footprint created during computation.<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"509\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/CommonSenseQAFig2-1024x509.png\" alt=\"Figure 2.2: External Attention to various knowledge sources.\" class=\"wp-image-806614\" srcset=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/CommonSenseQAFig2-1024x509.png 1024w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/CommonSenseQAFig2-300x149.png 300w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/CommonSenseQAFig2-768x382.png 768w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/CommonSenseQAFig2-240x119.png 240w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/CommonSenseQAFig2.png 1440w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption>Figure 2: External Attention to various knowledge sources\u201d with \u201cThe KEAR model first retrieves relevant knowledge from various sources and then uses a language model to conduct self-attention to the input and external attention to the knowledge.<\/figcaption><\/figure><\/div>\n\n\n\n<p>While Transformer models process input by looking <em>inward<\/em> via self-attention, external attention makes a model look <em>outward<\/em> by providing it with related context and knowledge from various sources, including knowledge graphs, dictionaries, corpora, and other language models\u2019 output, and then letting the model conduct both self-attention to the input and external attention to the knowledge. The external information is stored in a symbolic way (for example, in plain text or knowledge graph entries) and thus enables a moderately sized Transformer model to excel in language understanding. Moreover, the text-level concatenation of input and knowledge used by KEAR incurs no change to the Transformer model architecture, enabling existing systems to be easily adapted to external attention.<\/p>\n\n\n\n<p>Another benefit of external attention is that one could easily update the knowledge source to change the model behavior. The latest world knowledge can be fed into the model by updating the knowledge graph using recent online sources. By incorporating explicit world knowledge, the decision process of the model also becomes more transparent and explainable. These benefits can greatly facilitate the application of external attention technology to various natural language processing research projects and products. This opens the door for us to better understand the meaning of text, associate it with related knowledge, and generate more accurate output.\u00a0<\/p>\n\n\n\n<p>For more information on <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/github.com\/microsoft\/KEAR\">KEAR<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, check out this\u00a0<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/j6IFOTdnJMg\" target=\"_blank\" rel=\"noopener noreferrer\">Tech Minutes video<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0and our <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/github.com\/microsoft\/KEAR\" target=\"_blank\" rel=\"noopener noreferrer\">GitHub page,<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> and for our team\u2019s latest advancements, visit the <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/group\/knowledge-and-language\/\" target=\"_blank\" rel=\"noreferrer noopener\">Knowledge and Language Team<\/a>\u00a0page.\u00a0<\/p>\n\n\n\n<figure class=\"wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<div class=\"yt-consent-placeholder\" role=\"region\" aria-label=\"Video playback requires cookie consent\" data-video-id=\"j6IFOTdnJMg\" data-poster=\"https:\/\/img.youtube.com\/vi\/j6IFOTdnJMg\/maxresdefault.jpg\"><iframe aria-hidden=\"true\" tabindex=\"-1\" title=\"Developer Tech Minutes: Human Parity on CommonsenseQA\" width=\"500\" height=\"281\" data-src=\"https:\/\/www.youtube-nocookie.com\/embed\/j6IFOTdnJMg?feature=oembed&rel=0&enablejsapi=1\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><div class=\"yt-consent-placeholder__overlay\"><button class=\"yt-consent-placeholder__play\"><svg width=\"42\" height=\"42\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" aria-hidden=\"true\" focusable=\"false\"><g fill=\"none\" fill-rule=\"evenodd\"><circle fill=\"#000\" opacity=\".556\" cx=\"21\" cy=\"21\" r=\"21\"\/><path stroke=\"#FFF\" d=\"M27.5 22l-12 8.5v-17z\"\/><\/g><\/svg><span class=\"yt-consent-placeholder__label\">Video playback requires cookie consent<\/span><\/button><\/div><\/div>\n<\/div><\/figure>\n","protected":false},"excerpt":{"rendered":"<p>KEAR (Knowledgeable External Attention for commonsense Reasoning)\u2014along with recent milestones in\u00a0computer vision and\u00a0neural\u00a0text-to-speech\u2014is part of a larger Azure AI (opens in new tab) mission to provide relevant, meaningful AI solutions and services that work better for people\u00a0because they better capture how people learn and work\u2014with improved vision, knowledge understanding, and speech capabilities. At the center [&hellip;]<\/p>\n","protected":false},"author":40306,"featured_media":806671,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-author-ordering":[{"type":"user_nicename","value":"Yichong Xu","user_id":"40279"},{"type":"user_nicename","value":"Chenguang Zhu","user_id":"35600"},{"type":"user_nicename","value":"Shuohang Wang","user_id":"39678"},{"type":"user_nicename","value":"Michael Zeng","user_id":"33141"},{"type":"user_nicename","value":"Xuedong Huang","user_id":"34869"}],"msr_hide_image_in_river":0,"footnotes":""},"categories":[1],"tags":[],"research-area":[13556],"msr-region":[],"msr-event-type":[],"msr-locale":[268875],"msr-post-option":[],"msr-impact-theme":[],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-806026","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-research-blog","msr-research-area-artificial-intelligence","msr-locale-en_us"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[],"msr_impact_theme":[],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[741481],"related-projects":[715045],"related-events":[],"related-researchers":[{"type":"user_nicename","value":"Shuohang Wang","user_id":39678,"display_name":"Shuohang Wang","author_link":"<a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/people\/shuowa\/\" aria-label=\"Visit the profile page for Shuohang Wang\">Shuohang Wang<\/a>","is_active":false,"last_first":"Wang, Shuohang","people_section":0,"alias":"shuowa"}],"msr_type":"Post","featured_image_thumbnail":"<img width=\"960\" height=\"540\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/1400x788_Common_Sense_still_no_logo-1-scaled-960x540.jpg\" class=\"img-object-cover\" alt=\"An illustration of the KEAR architecture represented by five panels side by side. The first contains an input question\u2014\u201cWhat is a treat that your dog will enjoy?\u201d\u2014and the answer choices \u201csalad,\u201d \u201cpetted,\u201d \u201caffection,\u201d \u201cbone,\u201d and \u201clots of attention.\u201d The second panel has three boxes, each representing retrieval from a specific knowledge source. A box labeled \u201cKnowledge Graph\u201d has a silhouette of a dog and underneath it and labeled \u201cdesires\u201d a silhouette of a dog being petted; a heart representing \u201caffection\u201d; a bone; and clapping hands representing \u201clots of attention.\u201d A box labeled \u201crelevant questions\u201d has the question \u201cWhat do dogs like to eat?\u201d and the accompanying answer \u201cBones.\u201d A boxed labeled \u201cdictionary\u201d contains the definition of \u201cbone\u201d: \u201ca composite material making up the skeleton of most vertebrates.\u201d The third panel, labeled \u201cconcatenation with input,\u201d contains the input question followed by \u201cDog, desires, bone. Dog, desires, lots of attention\u201d followed by the relevant question and finally the dictionary definition of bone. In between each is a separation token [SEP]. The fourth panel is labeled \u201clanguage model\u201d and contains a quote box labeled \u201clanguage services,\u201d a cube labeled \u201cmodel,\u201d and left and right braces punctuation within a circle labeled \u201clanguage understanding.\u201d The fifth panel is labeled \u201coutput\u201d and includes silhouettes of each of the five answer choices. The silhouette of the bone is highlighted in blue, representing the appropriate response.\" decoding=\"async\" loading=\"lazy\" srcset=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/1400x788_Common_Sense_still_no_logo-1-scaled-960x540.jpg 960w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/1400x788_Common_Sense_still_no_logo-1-300x169.jpg 300w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/1400x788_Common_Sense_still_no_logo-1-1024x576.jpg 1024w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/1400x788_Common_Sense_still_no_logo-1-768x432.jpg 768w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/1400x788_Common_Sense_still_no_logo-1-1536x865.jpg 1536w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/1400x788_Common_Sense_still_no_logo-1-2048x1153.jpg 2048w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/1400x788_Common_Sense_still_no_logo-1-1066x600.jpg 1066w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/1400x788_Common_Sense_still_no_logo-1-655x368.jpg 655w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/1400x788_Common_Sense_still_no_logo-1-343x193.jpg 343w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/1400x788_Common_Sense_still_no_logo-1-240x135.jpg 240w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/1400x788_Common_Sense_still_no_logo-1-scaled-640x360.jpg 640w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/1400x788_Common_Sense_still_no_logo-1-1280x720.jpg 1280w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/12\/1400x788_Common_Sense_still_no_logo-1-1920x1080.jpg 1920w\" sizes=\"auto, (max-width: 960px) 100vw, 960px\" \/>","byline":"","formattedDate":"December 20, 2021","formattedExcerpt":"KEAR (Knowledgeable External Attention for commonsense Reasoning)\u2014along with recent milestones in\u00a0computer vision and\u00a0neural\u00a0text-to-speech\u2014is part of a larger Azure AI (opens in new tab) mission to provide relevant, meaningful AI solutions and services that work better for people\u00a0because they better capture how people learn and work\u2014with&hellip;","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/posts\/806026","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/users\/40306"}],"replies":[{"embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/comments?post=806026"}],"version-history":[{"count":12,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/posts\/806026\/revisions"}],"predecessor-version":[{"id":809389,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/posts\/806026\/revisions\/809389"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/media\/806671"}],"wp:attachment":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/media?parent=806026"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/categories?post=806026"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/tags?post=806026"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=806026"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=806026"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=806026"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=806026"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=806026"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=806026"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=806026"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=806026"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}