{"id":742696,"date":"2021-05-03T11:51:37","date_gmt":"2021-05-03T18:51:37","guid":{"rendered":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/?p=742696"},"modified":"2023-03-14T15:41:08","modified_gmt":"2023-03-14T22:41:08","slug":"conversations-with-data-advancing-the-state-of-the-art-in-language-driven-data-exploration","status":"publish","type":"post","link":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/blog\/conversations-with-data-advancing-the-state-of-the-art-in-language-driven-data-exploration\/","title":{"rendered":"Conversations with data: Advancing the state of the art in language-driven data exploration"},"content":{"rendered":"\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/1400x788_Score_blog_still_nologo-3-scaled.jpg\" alt=\"Flow chart\"\/><\/figure>\n\n\n\n<p>One key aspiration of AI is to develop natural and effective task-oriented conversational systems. Task-oriented conversational systems use a natural language interface to collaborate with and support people in accomplishing specific goals and activities. They go beyond chitchat conversation. For example, as personal digital assistants, they ease the stress of trip planning or reduce the expertise required to generate a sales report from a database. While natural language understanding (NLU) technology and research have achieved remarkable recent progress, task-oriented assistance requires tackling additional challenges in practical NLU.<\/p>\n\n\n\n<p>Consider a prime application of task-oriented conversations: <em>language-driven data<\/em> <em>exploration<\/em>. Data scientists, analysts, and information workers routinely spend more than half of their time exploring, visualizing, and reformatting datasets, according to Anaconda\u2019s \u201c<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/www.anaconda.com\/state-of-data-science-2020\">The State of Data Science 2020<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>.\u201d Not only time-consuming, this process is also error prone and typically requires data science programming skills, such as knowledge in Python, R, or SQL. Augmenting interactive data science environments like Microsoft <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/microsoft-365\/excel\">Excel <\/a>or <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/jupyter.org\/\">Jupyter<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> with language-driven assistance would not only save time but also democratize data exploration. For instance, an analyst could ask a system in natural language to plot last month\u2019s sales metrics from her database rather than program a filtered visualization in SQL+R. Importantly, such a system should still allow analysts to inspect and edit program snippets after assisting with the most laborious parts of exploratory data science. This transparency and ability to edit will empower analysts and allow them to have confidence in the outcome of the work.<\/p>\n\n\n\n<p>Data exploration highlights a core NLU challenge that plagues all task-oriented conversational systems. Understanding people\u2019s language\u2014and thus, their <em>task intent<\/em>\u2014must be <em>grounded<\/em>. That is, what has been said must be interpreted relative to its context. Task-oriented systems deal with two kinds of relevant context. First, every task acts upon a structured ontology such as a database, a spreadsheet, or an API. The ontology provides data context, which influences language understanding. For example, the analyst question \u201c<em>Which departments have unfinished projects?<\/em>\u201d refers to \u201c<em>departments<\/em>\u201d and \u201c<em>projects<\/em>\u201d in her database and \u201c<em>unfinished<\/em>\u201d likely refers to a project\u2019s status column in that database, all of which the system must emit as column references in the desired SQL program. Second, conversational systems must consider multi-turn dynamics of an interaction, which create <em>conversation context.<\/em> For example, the analyst might follow up her exploration with \u201c<em>What is their total budget,<\/em>\u201d implicitly referring to \u201cunfinished projects\u201d from the previous turn.<\/p>\n\n\n\n<div class=\"annotations \" data-bi-aN=\"margin-callout\">\n\t<article class=\"annotations__list card depth-16 bg-body p-4 annotations__list--right\">\n\t\t<div class=\"annotations__list-item\">\n\t\t\t\t\t\t<span class=\"annotations__type d-block text-uppercase font-weight-semibold text-neutral-300 small\">Publication<\/span>\n\t\t\t<a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/publication\/score-pre-training-for-context-representation-in-conversational-semantic-parsing\/\" data-bi-cN=\"SCoRe: Pre-Training for Context Representation in Conversational Semantic Parsing\" data-external-link=\"false\" data-bi-aN=\"margin-callout\" data-bi-type=\"annotated-link\" class=\"annotations__link font-weight-semibold text-decoration-none\"><span>SCoRe: Pre-Training for Context Representation in Conversational Semantic Parsing<\/span>&nbsp;<span class=\"glyph-in-link glyph-append glyph-append-chevron-right\" aria-hidden=\"true\"><\/span><\/a>\t\t\t\t\t<\/div>\n\t<\/article>\n<\/div>\n\n\n\n<p>Recently, we\u2019ve made <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/project\/conversations-with-data\/#!publications\">several fundamental contributions<\/a> to these challenges. In the <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/event\/iclr-2021\/#!sessions\">2021 International Conference on Learning Representations (ICLR)<\/a> publication <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/publication\/score-pre-training-for-context-representation-in-conversational-semantic-parsing\/\">\u201cSCoRe: Pre-Training for Context Representation in Conversational Semantic Parsing,\u201d<\/a> we introduce SCoRe, a task-oriented conversational system with multiple applications. SCoRe achieves new state-of-the-art performance in interactive data exploration (on <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/yale-lily.github.io\/sparc#:~:text=SParC%20is%20a%20dataset%20for%20cross-domain%20S%20emantic,Spider%20task,%20a%20complex%20and%20cross-domain%20text-to-SQL%20challenge.\">SParC<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> and <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/yale-lily.github.io\/cosql\">CoSQL<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> benchmarks) and task-oriented dialogue (<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/github.com\/budzianowski\/multiwoz\">MultiWOZ<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>), improving upon previous best techniques by up to 12 percent. SCoRe addresses the conversation context challenge through its task-oriented pretraining methodology, which learns language representations that link multiple conversation turns. To address the data context challenge, SCoRe builds upon our previous work in <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/publication\/rat-sql-relation-aware-schema-encoding-and-linking-for-text-to-sql-parsers\/\">RAT-SQL<\/a> and <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/publication\/structure-grounded-pretraining-for-text-to-sql\/\">StruG<\/a>. These two publications introduce a unified framework for language understanding in the context of a structured database. It has since been leveraged in numerous applications in addition to SCoRe. We\u2019re presenting SCoRe at ICLR on Monday, May 3, from 5 PM to 7 PM Pacific Time and 7 PM to 9 PM Pacific Time. The&nbsp;SCoRe&nbsp;code&nbsp;will&nbsp;be&nbsp;<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/github.com\/microsoft\/SCoRe\" target=\"_blank\" rel=\"noopener noreferrer\">published&nbsp;on GitHub<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>;&nbsp;please follow&nbsp;the repository&nbsp;for updates.<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-large\"><a data-bi-bhvr=\"14\"  data-bi-cn=\"Two examples of user dialogue and their corresponding formal programs and databases. The first example is a multi-turn text-to-SQL task. The user query \u201cFind the names of the top 3 highest sales books\u201d corresponds to the formal program \u201cSELECT title FROM book ORDER BY sale_amount DESC LIMIT 3\u201d. The follow-up user query, \u201cWho are their authors,\u201d corresponds to the formal program \u201cSELECT t1.title, t1.name FROM author AS t1 JOIN book AS t2 ON t1.id = t2.author_id ORDER BY t2.sale_amount DESC LIMIT 3\u201d. The next turn, \u201cAlso show the names of their publishers,\u201d corresponds to the formal program \u201cSELECT t1.title, t1.name, t3.name FROM author AS t1 JOIN book AS t2 ON t1.id = t2.author_id JOIN press AS t3 ON t2.press_id = t3.id ORDER BY t2.sale_amount DESC LIMIT 3\u201d. In the corresponding database, there is an \u201cAuthor\u201d table with an \u201cid\u201d column, a \u201cname\u201d column, a \u201ccountry\u201d column, and an ellipsis signifying additional columns; a \u201cPress\u201d table with an \u201cid\u201d column, a \u201cname\u201d column, an \u201caddress\u201d column, and an ellipsis signifying additional columns; and a \u201cBook\u201d table with an \u201cid\u201d column, a \u201ctitle\u201d column, an \u201cauthor id\u201d column, a \u201csale_amount\u201d column, and an ellipsis signifying additional columns. The second example is a dialogue state tracking task. The user query \u201cI am looking for a cheap restaurant in the centre of the city\u201d corresponds to the formal program \u201cRestaurant(Price=cheap, area=center)\u201d. The system replies, \u201cThere is a cheap Chinese restaurant called Dojo Noodle Bar,\u201d to which the user answers, \u201cYes please, for 8 people at 18:30 on Thursday,\u201d which corresponds to the formal program \u201cRestaurant(Price=cheap, area=center, name=Dojo Noodle Bar, people=8, time=18:30, day=Thursday)\u201d. The user notes, \u201cI also need to book a taxi between to the restaurant at 20:30,\u201d which corresponds to the formal program \u201cRestaurant(Price=cheap, area=center, name=Dojo Noodle Bar, people=8, time=18:30, day=Thursday) Taxi(leaveAt=20:30, destination=Dojo Noodle Bar)\u201d. In the corresponding database, there is a \u201cRestaurant\u201d table with a \u201cname\u201d column, a \u201cprice\u201d column, an \u201carea\u201d column, a \u201ctime\u201d column, and an ellipsis signifying additional columns; and a \u201cTaxi\u201d table with a \u201cleaveAt\u201d column, a \u201cdestination\u201d column, and an ellipsis signifying additional columns.  \" href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/Score_Figure1_PRFeedback_edited-2.jpg\"><img loading=\"lazy\" decoding=\"async\" width=\"624\" height=\"200\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/Score_Figure1_PRFeedback_edited-2.jpg\" alt=\"Two examples of user dialogue and their corresponding formal programs and databases. The first example is a multi-turn text-to-SQL task. The user query \u201cFind the names of the top 3 highest sales books\u201d corresponds to the formal program \u201cSELECT title FROM book ORDER BY sale_amount DESC LIMIT 3\u201d. The follow-up user query, \u201cWho are their authors,\u201d corresponds to the formal program \u201cSELECT t1.title, t1.name FROM author AS t1 JOIN book AS t2 ON t1.id = t2.author_id ORDER BY t2.sale_amount DESC LIMIT 3\u201d. The next turn, \u201cAlso show the names of their publishers,\u201d corresponds to the formal program \u201cSELECT t1.title, t1.name, t3.name FROM author AS t1 JOIN book AS t2 ON t1.id = t2.author_id JOIN press AS t3 ON t2.press_id = t3.id ORDER BY t2.sale_amount DESC LIMIT 3\u201d. In the corresponding database, there is an \u201cAuthor\u201d table with an \u201cid\u201d column, a \u201cname\u201d column, a \u201ccountry\u201d column, and an ellipsis signifying additional columns; a \u201cPress\u201d table with an \u201cid\u201d column, a \u201cname\u201d column, an \u201caddress\u201d column, and an ellipsis signifying additional columns; and a \u201cBook\u201d table with an \u201cid\u201d column, a \u201ctitle\u201d column, an \u201cauthor id\u201d column, a \u201csale_amount\u201d column, and an ellipsis signifying additional columns. The second example is a dialogue state tracking task. The user query \u201cI am looking for a cheap restaurant in the centre of the city\u201d corresponds to the formal program \u201cRestaurant(Price=cheap, area=center)\u201d. The system replies, \u201cThere is a cheap Chinese restaurant called Dojo Noodle Bar,\u201d to which the user answers, \u201cYes please, for 8 people at 18:30 on Thursday,\u201d which corresponds to the formal program \u201cRestaurant(Price=cheap, area=center, name=Dojo Noodle Bar, people=8, time=18:30, day=Thursday)\u201d. The user notes, \u201cI also need to book a taxi between to the restaurant at 20:30,\u201d which corresponds to the formal program \u201cRestaurant(Price=cheap, area=center, name=Dojo Noodle Bar, people=8, time=18:30, day=Thursday) Taxi(leaveAt=20:30, destination=Dojo Noodle Bar)\u201d. In the corresponding database, there is a \u201cRestaurant\u201d table with a \u201cname\u201d column, a \u201cprice\u201d column, an \u201carea\u201d column, a \u201ctime\u201d column, and an ellipsis signifying additional columns; and a \u201cTaxi\u201d table with a \u201cleaveAt\u201d column, a \u201cdestination\u201d column, and an ellipsis signifying additional columns.  \" class=\"wp-image-743203\" srcset=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/Score_Figure1_PRFeedback_edited-2.jpg 624w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/Score_Figure1_PRFeedback_edited-2-300x96.jpg 300w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/Score_Figure1_PRFeedback_edited-2-16x5.jpg 16w\" sizes=\"auto, (max-width: 624px) 100vw, 624px\" \/><\/a><figcaption class=\"wp-element-caption\">Figure 1: Achieving natural and effective task-oriented conversational systems requires the ability to interpret natural language based on data context\u2014how a query relates to the ontology over which it is being made\u2014and conversation context\u2014how a query relates to previous conversational turns. Above are examples of conversational tasks from the text-to-SQL dataset SParC and the dialogue modeling dataset MultiWOZ, which exhibit different forms of such context for the system to learn.<\/figcaption><\/figure>\n\n\n\n<h2 id=\"data-context-representation\">Data context representation<\/h2>\n\n\n\n<p>The grounding challenges associated with data context and conversation context are distinct yet interconnected, and progress on both is critical to build effective task-oriented conversational systems. Here, we first address data context grounding by focusing on a single-turn version of the data exploration problem known as <em>database question answering (DBQA).<\/em> As you\u2019ll see, techniques developed for DBQA facilitate data context grounding in broader applications of task-oriented conversational systems.<\/p>\n\n\n\n<h3 id=\"rat-sql-joint-representation-of-question-and-data-context\">RAT-SQL: Joint representation of question and data context<\/h3>\n\n\n\n<p>When an analyst asks a question over her database, that question and its associated data context\u2014the <em>database schema<\/em>\u2014are both embedded into distributed <em>intent representations<\/em> by a neural encoder network. The key to addressing the data context challenge is <em>jointly <\/em>contextualizing intent representations\u2014the question and the schema provide important context to each other. For example, in Figure 1, \u201c<em>Find the names of the top 3 highest sales books\u201d<\/em> refers to the <code>title <\/code>column even though the question doesn\u2019t mention \u201c<em>titles<\/em>.\u201d From question language alone, <code>author.name<\/code> could superficially seem a better match.<\/p>\n\n\n\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/arxiv.org\/abs\/1706.03762\">Transformers<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> are the most effective approach for contextualized representation learning in modern NLU. They\u2019re based on <em>self-attention<\/em>, which, in one interpretation, learns latent relations between the inputs\u2014in this case, question words and column\/table names in the schema. While effective in <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/arxiv.org\/abs\/2103.05247\">multiple fields<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, self-attention requires large training corpora, and human-authored DBQA datasets reach only up to 10,000 training instances. With limited data, even simple natural language relations, easily discovered in machine translation and other NLU systems, can be challenging. For instance, in our DBQA experiments, standard Transformers struggled to reliably link <em>\u201csales\u201d<\/em> to the column <code>sale_amount<\/code>. In the <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/event\/acl-2020\/\">2020 Meeting of the Association for Computational Linguistics (ACL)<\/a> publication <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/publication\/rat-sql-relation-aware-schema-encoding-and-linking-for-text-to-sql-parsers\/\">\u201cRAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers,\u201d<\/a> we introduced <em>relation-aware Transformers (RAT)<\/em>, which reduce training data requirements by incorporating known relational information about the database into self-attention. They allow the encoder to consider, for instance, database foreign keys without either rediscovering them (as in standard Transformers) or hard-coding the network\u2019s relational structure to follow them (as in graph neural networks). As such, RAT augments the learning efficacy of Transformers with rich background knowledge about relational structure.<\/p>\n\n\n\n<p>We combine a RAT encoder with a grammar-driven program decoder into an end-to-end DBQA model called RAT-SQL. On <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/arxiv.org\/abs\/1809.08887\">Spider<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, currently the most challenging text-to-SQL dataset, RAT-SQL achieved a new state of the art of 65.6 percent exact-match accuracy at the time of its publication. Relation-aware data contextualization added a more than 5 percent margin over previous best techniques. Since then, numerous researchers from other institutions have built upon RAT-SQL in their DBQA models, achieving <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/yale-lily.github.io\/spider\">even more impressive performance<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>.<\/p>\n\n\n\n<h3 id=\"strug-structure-grounded-pretraining-for-robust-language-understanding\">StruG: Structure-grounded pretraining for robust language understanding<\/h3>\n\n\n\n<p>Joint representation of the question and its data context fundamentally requires solving an <em>alignment problem<\/em>\u2014that is, linking words in the question to the data columns they reference\u200b. Relation-aware Transformers effectively incorporate known relations between the question and the data, but alignment often requires additional background knowledge either from the database content or from NLU at large. For example, in Figure&nbsp;2, linking <em>\u201cHistory\u201d<\/em> to <code>department_name<\/code> is challenging without looking at the database content even though people\u2019s real-world experience intuitively aligns these two phrases.<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"912\" height=\"713\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/SCORE-Fig-2_Updated.jpg\" alt=\"Two illustrations of alignment between natural language (NL) utterances and tables and column names or content of these tables. The top example shows a DBQA task. The database (DB) contains two tables: \u201cstudent\u201d and \u201cdepartment\u201d. The table \u201cstudent\u201d contains columns \u201cid\u201d, \u201cname\u201d, \u201cdepartment_name\u201d, \u201ctotal_credits\u201d, and an ellipsis signifying additional columns. The table \u201cdepartment\u201d contains columns \u201cid\u201d, \u201cname\u201d, \u201cbuilding\u201d, \u201cbudget\u201d, and an ellipsis signifying additional columns. The NL utterance is \u201cWhat is the name of the student who has the highest total credits in the History department.\u201d The corresponding SQL program is \u201cSELECT name FROM student WHERE department_name = \u2018History\u2019 ORDER BY total_credits DESC LIMIT 1\u201d. The words \u201cHistory department\u201d in the NL utterance, the column name \u201cdepartment_name\u201d in the DB, and the clause \u201cdepartment_name = \u2018History\u2019\u201d in the SQL program are highlighted to indicate alignment. The words \u201ctotal credits\u201d in the NL utterance, the column name \u201ctotal_credits\u201d in the DB, and the clause \u201ctotal_credits\u201d in the SQL program are highlighted differently to indicate another alignment. The bottom example shows an instance of parallel text-table annotation from the web. It\u2019s a table with the columns \u201ctrain number\u201d, \u201cdeparture station\u201d, \u201cdeparture time\u201d, \u201cdeparture day\u201d, \u201carrival station\u201d, and an ellipsis signifying additional columns. It shows two rows of content. The first row has values \u201c11417\u201d for \u201ctrain number\u201d, \u201cPune Junction\u201d for \u201cdeparture station\u201d, \u201c22:00 PM\u201d for \u201cdeparture time\u201d, \u201cThu\u201d for \u201cdeparture day\u201d, and \u201cNagpur Junction\u201d for \u201carrival station\u201d. The second row has values \u201c11418\u201d for \u201ctrain number\u201d, \u201cNagpur Junction\u201d for \u201cdeparture station\u201d, \u201c15:00 PM\u201d for \u201cdeparture time\u201d, \u201cFri\u201d for \u201cdeparture day\u201d, and \u201cPune Junction\u201d for \u201carrival station\u201d. The NL utterance about the table is \u201cThe 11417 Pune-Nagpur Humsafar Express runs between Pune Junction and Nagpur Junction.\u201d The word \u201c11417\u201d in the NL utterance, the column name \u201ctrain number\u201d, and its first-row value \u201c11417\u201d in the table are highlighted to indicate alignment. The words \u201cPune Junction\u201d in the NL utterance, the column name \u201cdeparture station\u201d, and its first-row value \u201cPune Junction\u201d in the table are highlighted differently to indicate a second alignment. The words \u201cNagpur Junction\u201d in the NL utterance, the column name \u201carrival station\u201d, and its first-row value \u201cNagpur Junction\u201d in the table are highlighted differently to indicate a third alignment.  \" class=\"wp-image-743218\" srcset=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/SCORE-Fig-2_Updated.jpg 912w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/SCORE-Fig-2_Updated-300x235.jpg 300w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/SCORE-Fig-2_Updated-768x600.jpg 768w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/SCORE-Fig-2_Updated-16x12.jpg 16w\" sizes=\"auto, (max-width: 912px) 100vw, 912px\" \/><figcaption class=\"wp-element-caption\">Figure 2: For a task-oriented conversational system to account for data context, it must be able to link words in a query to their corresponding data columns based on background knowledge about the relations between the question and the data. The relations align words in a query to column names or table content, as illustrated here in database question answering (top) and in parallel text-table corpora from the web (bottom).<\/figcaption><\/figure>\n\n\n\n<p>Background knowledge on question-table alignment naturally occurs in parallel text-table corpora such as <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/github.com\/google-research-datasets\/ToTTo\">ToTTo<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. They pair data tables with relevant utterances about them, such as table summaries and data references. While such utterances are not typically in question form, they exhibit the same alignment patterns as questions in DBQA. In an upcoming <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/2021.naacl.org\/\">2021 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL)<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> publication <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/publication\/structure-grounded-pretraining-for-text-to-sql\/\">\u201cStructure-Grounded Pretraining for Text-to-SQL,\u201d<\/a> we propose a method to leverage this data for pretraining contextual representation models. StruG, short for \u201c<strong>Stru<\/strong>cture-<strong>G<\/strong>rounded pretraining,\u201d introduces three critical pretraining tasks that use text-table alignment annotations as weak supervision:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Column grounding:<\/strong> given a column\u2014for example, <code>train number<\/code> in Figure 2\u2014predict whether it\u2019s relevant to the utterance<\/li>\n\n\n\n<li><strong>Value grounding:<\/strong> given a word in the utterance\u2014for example, <em>\u201c11417\u201d<\/em>\u2014predict whether it refers to a cell in some column<\/li>\n\n\n\n<li><strong>Column-value mapping:<\/strong> given a word\u2014such as <em>\u201c11417<\/em>\u201d\u2014and a column name\u2014<code>train number<\/code>\u2014predict whether they align<\/li>\n<\/ul>\n\n\n\n<p>Importantly, the StruG model can\u2019t observe database content when predicting these tasks. It learns contextualized alignment solely from the utterance and the data schema. After pretraining, we apply StruG in DBQA to emit initial question\/table representations that communicate information about the context structure to the downstream RAT-SQL model.<\/p>\n\n\n\n<p>On the Spider dataset, StruG-augmented RAT-SQL performs competitively with all state-of-the-art models even without using database content. More importantly, its learned text-table alignment makes database question answering more robust. When development-set questions in Spider are rephrased in a more realistic, fluid natural language, execution accuracy of state-of-the-art models drops by 11 to 20 percent, but StruG-augmented RAT-SQL never suffers more than a 10 percent loss.<\/p>\n\n\n\n<h2 id=\"score-pretraining-for-conversation-context-representation\">SCoRe: Pretraining for conversation context representation<\/h2>\n\n\n\n<p>The solutions to the data contextualization challenge are also relevant for the conversation contextualization challenge. To account for multi-turn dynamics of dialogue, a conversational system must ground an individual\u2019s question in both its data context\u2014that is, its ontology\u2014and in the questions from previous turns. SCoRe introduces a task-oriented pretraining methodology to encode both.<\/p>\n\n\n\n<p>SCoRe pretrains a task-oriented language model contextualized by the conversational flow and the underlying ontology. In pretraining, the SCoRe model is self-supervised by two novel task-oriented objectives in addition to the established <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/arxiv.org\/abs\/1810.04805\">masked language model (MLM) objective<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. These objectives facilitate the accurate representation of the conversational flow between dialogue turns and how this flow maps to the desired columns in the ontology. For example, in the first question of Figure 1, <em>\u201cFind the names of the top 3 highest sales books,\u201d<\/em> the model needs to apply the <em>order by<\/em> operation to the column <code>sale_amount<\/code> to find the books with the highest sales. In the follow-up question \u201c<em>Who are their authors,\u201d<\/em> the model needs to understand that it should maintain the context of the previous question while also selecting a new column, <code>name<\/code>, from the <code>author <\/code>table<code>.<\/code><\/p>\n\n\n\n<p>The first pretraining objective of SCoRe, <em>Column Contextual Semantics (CCS),<\/em> aligns the question with the ontology. For each column in the ontology, CCS trains the model to predict the operations that should be performed on this column in each conversational turn. Specifically, SCoRe uses the encoding of each column or table name to predict its corresponding operation. The second pretraining objective, <em>Turn Contextual Switch (TCS)<\/em>, captures the conversational flow and how it\u2019s grounded in the model\u2019s expected output programs. It aims to predict the difference in programs from different dialogue turns based on the corresponding user questions. For example, the current turn may differ from the previous one by adding an additional filtering condition or changing the order of the results.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><a data-bi-bhvr=\"14\"  data-bi-cn=\"A diagram of the pretraining of a SCoRe encoder. It shows a \u201cTransformer Encoder\u201d block that takes three blocks of input\u2014\u200b\u201cCurrent Question,\u201d \u201cDialogue History,\u201d and \u201cDatabase Schema\u201d\u2014\u200band produces multiple outputs. The \u201cCurrent Question\u201d block contains the sentence \u201calso show the names of their publishers\u201d preceded by a special separator token. The \u201cDialogue History\u201d block contains the two sentences \u201cwho are \u2026 authors\u201d and \u201cfind <mask> \u2026 books\u201d delimited by separator tokens with ellipsis signifying additional words in the middle and \u201c<mask>\u201d signifying a special masked-word token. The \u201cDatabase Schema\u201d block contains the column names \u201cauthor id\u201d, \u201cauthor name\u201d, and \u201csale amount\u201d, each surrounded by separator tokens, and an ellipsis signifying additional columns. The first two outputs are for the TCS objective. They\u2019re aligned with the separator token from the current question and from the first sentence of the dialogue history, respectively, and show the target output \u201cINS(SELECT.column)\u201d and \u201cINS(SELECT.column)\u201d, respectively. The third output is for the MLM objective. It\u2019s aligned with the masked-word token in the dialogue history and shows the target output \u201cthe\u201d, which is the word that was masked. The final three outputs are for the CCS objective with an ellipsis signifying additional CCS outputs. They\u2019re aligned with column names \u201cauthor id\u201d, \u201cauthor name\u201d, and \u201csale amount\u201d, respectively, in the database schema. They show the target outputs \u201cNone\u201d, \u201cSELECT\u201d, and \u201cORDER BY DESC LIMIT\u201d, respectively. \" href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/04\/Fig-3-SCORE-blog.jpg\"><img decoding=\"async\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/04\/Fig-3-SCORE-blog-1024x315.jpg\" alt=\"A diagram of the pretraining of a SCoRe encoder. It shows a \u201cTransformer Encoder\u201d block that takes three blocks of input\u2014\u200b\u201cCurrent Question,\u201d \u201cDialogue History,\u201d and \u201cDatabase Schema\u201d\u2014\u200band produces multiple outputs. The \u201cCurrent Question\u201d block contains the sentence \u201calso show the names of their publishers\u201d preceded by a special separator token. The \u201cDialogue History\u201d block contains the two sentences \u201cwho are \u2026 authors\u201d and \u201cfind <mask> \u2026 books\u201d delimited by separator tokens with ellipsis signifying additional words in the middle and \u201c<mask>\u201d signifying a special masked-word token. The \u201cDatabase Schema\u201d block contains the column names \u201cauthor id\u201d, \u201cauthor name\u201d, and \u201csale amount\u201d, each surrounded by separator tokens, and an ellipsis signifying additional columns. The first two outputs are for the TCS objective. They\u2019re aligned with the separator token from the current question and from the first sentence of the dialogue history, respectively, and show the target output \u201cINS(SELECT.column)\u201d and \u201cINS(SELECT.column)\u201d, respectively. The third output is for the MLM objective. It\u2019s aligned with the masked-word token in the dialogue history and shows the target output \u201cthe\u201d, which is the word that was masked. The final three outputs are for the CCS objective with an ellipsis signifying additional CCS outputs. They\u2019re aligned with column names \u201cauthor id\u201d, \u201cauthor name\u201d, and \u201csale amount\u201d, respectively, in the database schema. They show the target outputs \u201cNone\u201d, \u201cSELECT\u201d, and \u201cORDER BY DESC LIMIT\u201d, respectively. \" class=\"wp-image-742771\"\/><\/a><figcaption class=\"wp-element-caption\">Figure 3: The SCoRe model is a Transformer-based encoder that takes as input the current natural language question along with its two kinds of context: <em>conversation context, <\/em>or its dialogue history of previous questions, and <em>data context,<\/em> or its database schema or other ontology. To encode this joint input into informative distributed representations, SCoRe is trained to predict three objectives. For each dialogue question, the Turn Contextual Switch (TCS) objective predicts the expected difference between the desired formal program and the program for the previous turn\u2019s question. For each column in the data context, the Column Contextual Semantics (CCS) objective predicts a desired operation that should be performed upon this column in the desired formal program, if any. Finally, the standard MLM objective reconstructs words in the input that were masked with a special token.<\/figcaption><\/figure>\n\n\n\n<h2 id=\"state-of-the-art-results\">State-of-the-art results<\/h2>\n\n\n\n<p>Our empirical results show that SCoRe can be effectively used as a feature representation encoder with strong baseline models for a wide range of tasks and can significantly improve the performance of existing strong baseline models by simply replacing an existing pretrained language model with our SCoRe pretrained model.<\/p>\n\n\n\n<p>SCoRe achieves state-of-the-art results when evaluated on three popular benchmarks for task-oriented conversational systems: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/yale-lily.github.io\/sparc\">SParC<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> (sequential text-to-SQL), <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/yale-lily.github.io\/cosql\">CoSQL<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> (conversational text-to-SQL), and <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/github.com\/budzianowski\/multiwoz\">MultiWOZ<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> (dialogue state tracking). It also performs competitively with state-of-the-art techniques on <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"http:\/\/aka.ms\/sqa\">SQA<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> (sequential question answering). Moreover, SCoRe delivers even larger improvements when in-domain data is limited\u2014for example, in a low-resource setting where only 10 percent of the training data is available. This wide range of applications demonstrates the effectiveness of addressing both context grounding challenges jointly.<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/SCORE_TABLE1_UPDATED-1024x489.jpg\" alt=\"table\" class=\"wp-image-743278\" width=\"872\" height=\"416\" srcset=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/SCORE_TABLE1_UPDATED-1024x489.jpg 1024w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/SCORE_TABLE1_UPDATED-300x143.jpg 300w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/SCORE_TABLE1_UPDATED-768x367.jpg 768w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/SCORE_TABLE1_UPDATED-16x8.jpg 16w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/SCORE_TABLE1_UPDATED.jpg 1235w\" sizes=\"auto, (max-width: 872px) 100vw, 872px\" \/><\/figure>\n\n\n\n<figure class=\"wp-block-image aligncenter size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/Table_SCORE-3_UPDATED.jpg\" alt=\"table\" class=\"wp-image-743284\" width=\"747\" height=\"547\" srcset=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/Table_SCORE-3_UPDATED.jpg 820w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/Table_SCORE-3_UPDATED-300x220.jpg 300w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/Table_SCORE-3_UPDATED-768x562.jpg 768w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/Table_SCORE-3_UPDATED-16x12.jpg 16w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/Table_SCORE-3_UPDATED-80x60.jpg 80w\" sizes=\"auto, (max-width: 747px) 100vw, 747px\" \/><\/figure>\n\n\n\n<figure class=\"wp-block-image aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"459\" height=\"582\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/TABLES2_SCORE-UPDATED.jpg\" alt=\"table\" class=\"wp-image-743281\" srcset=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/TABLES2_SCORE-UPDATED.jpg 459w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/TABLES2_SCORE-UPDATED-237x300.jpg 237w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/TABLES2_SCORE-UPDATED-9x12.jpg 9w\" sizes=\"auto, (max-width: 459px) 100vw, 459px\" \/><figcaption class=\"wp-element-caption\">SCoRe was analyzed on the SParC (sequential text-to-SQL), CoSQL (conversational text-to-SQL), MultiWOZ (dialogue state tracking), and SQA (sequential question answering) tasks, performing competitively or achieving state-of-the-art results. The first table shows the SParC and CoSQL accuracy over all questions (QM stands for \u201cquestion match\u201d) and all interactions (IM stands for \u201cinteraction match\u201d). The second table shows QM and IM accuracies on the SQA test set. The third table shows joint goal accuracies on the MultiWOZ 2.1 test set.<\/figcaption><\/figure>\n\n\n\n<h2 id=\"revolutionizing-interaction-with-data\">Revolutionizing interaction with data<\/h2>\n\n\n\n<p>Task-oriented conversational systems can revolutionize people\u2019s natural interaction with structured data and APIs. Using natural language as a universal interface has been a major goal of human-computer interaction and knowledge management fields for decades. Early attempts have faced challenges because of limitations in language understanding capability, extensibility, and transparency, among other areas. However, recent years have seen a major resurgence powered by interest in impactful applications such as personal digital assistants, question answering systems, automatic reporting, and AI-assisted data science.<\/p>\n\n\n\n<p>Many challenges remain. Conversational interfaces also require us to make systems reasoning and results explainable and trustworthy to those using them. Language-driven exploration must be supported by interactive interfaces for debugging and correcting generated programs or the underlying dataset. Creating systems that interact with those using them to resolve knowledge gaps and continue to learn to reduce human intervention over time remains an open research challenge. We\u2019ve studied additional forms of interaction to address these challenges in language-driven data exploration, including <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/publication\/nl-edit-correcting-semantic-parse-errors-through-natural-language-interaction\/\">natural language feedback<\/a> for correcting misinterpretations and <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/publication\/diy-assessing-the-correctness-of-natural-language-to-sql-systems\/\">Debug-It-Yourself (DIY)<\/a> multimodal feedback for assessing a system\u2019s responses and fixing errors in an interactive user interface. Both improve systems\u2019 accuracy and transparency, yet more research is needed to integrate them into task-oriented conversational systems more broadly. We hope that <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/publication\/speak-to-your-parser-interactive-text-to-sql-with-natural-language-feedback\/\">SPLASH<\/a>, a dataset of utterances, misinterpretations, and corrections we created from our feedback studies, will prove useful to facilitate that research.<\/p>\n\n\n\n<p>Finally, many NLU challenges stem from the limitations of current benchmarks and datasets of task-oriented conversational systems. As we scale our techniques and integrate them into real-world applications, we\u2019ll encounter more realistic scenarios and workflows, which will undoubtedly expose new research challenges. As such, language-driven data exploration will not only be one of the most impactful applications for the field, but also the catalyst to its further progress.<\/p>\n\n\n\n<p><strong><em>Acknowledgment<\/em><\/strong><\/p>\n\n\n\n<p><em>The development of SCoRe is a result of the collaborative efforts of <\/em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/taoyds.github.io\/\"><em>Tao Yu<\/em><span class=\"sr-only\"> (opens in new tab)<\/span><\/a><em> of Yale University, <\/em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/ryanzhumich.github.io\"><em>Rui Zhang<\/em><span class=\"sr-only\"> (opens in new tab)<\/span><\/a> <em>of The Pennsylvania State University, Microsoft researchers <\/em><a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/people\/polozov\/\"><em>Alex Polozov<\/em><\/a><em> and <\/em><a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/people\/meek\/\"><em>Chris Meek<\/em><\/a><em>, and Microsoft Senior Principal Research Manager <\/em><a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/people\/hassanam\/\"><em>Ahmed H. Awadallah<\/em><\/a><em>. RAT-SQL was led by <\/em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/berlino.github.io\/\"><em>Bailin Wang<\/em><span class=\"sr-only\"> (opens in new tab)<\/span><\/a><em> of The University of Edinburgh in collaboration with Microsoft researchers <\/em><a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/people\/eush\/\"><em>Richard Shin<\/em><\/a><em>, <\/em><a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/people\/xiaodl\/\"><em>Xiaodong Liu<\/em><\/a><em>, and Polozov and former Microsoft researcher <\/em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/www.linkedin.com\/in\/matt-richardson-0696295\"><em>Matthew Richardson<\/em><span class=\"sr-only\"> (opens in new tab)<\/span><\/a><em>. StruG was led by <\/em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/xiang-deng.github.io\/\"><em>Xiang Deng<\/em><span class=\"sr-only\"> (opens in new tab)<\/span><\/a><em> of The Ohio State University in collaboration with Awadallah, Meek, Polozov, Richardson, and assistant professor <\/em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"http:\/\/web.cse.ohio-state.edu\/~sun.397\/index.html\"><em>Huan Sun<\/em><span class=\"sr-only\"> (opens in new tab)<\/span><\/a><em>. Yu, Wang, and Deng conducted the work during their Microsoft Research internships.<\/em><\/p>\n\n\n\n<div class=\"wp-block-group is-layout-flow wp-block-group-is-layout-flow\">\n<hr class=\"wp-block-separator has-css-opacity\"\/>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>One key aspiration of AI is to develop natural and effective task-oriented conversational systems. Task-oriented conversational systems use a natural language interface to collaborate with and support people in accomplishing specific goals and activities. They go beyond chitchat conversation. For example, as personal digital assistants, they ease the stress of trip planning or reduce the [&hellip;]<\/p>\n","protected":false},"author":38838,"featured_media":743173,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-author-ordering":[{"type":"user_nicename","value":"Alex Polozov","user_id":"36711"},{"type":"user_nicename","value":"Chris Meek","user_id":"32868"},{"type":"user_nicename","value":"Ahmed H. Awadallah","user_id":"31979"}],"msr_hide_image_in_river":0,"footnotes":""},"categories":[1],"tags":[],"research-area":[13556,13545],"msr-region":[],"msr-event-type":[],"msr-locale":[268875],"msr-post-option":[],"msr-impact-theme":[],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-742696","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-research-blog","msr-research-area-artificial-intelligence","msr-research-area-human-language-technologies","msr-locale-en_us"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[],"msr_impact_theme":[],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[392600,702211],"related-projects":[724078],"related-events":[725710],"related-researchers":[{"type":"user_nicename","value":"Ahmed H. Awadallah","user_id":31979,"display_name":"Ahmed Awadallah","author_link":"<a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/people\/hassanam\/\" aria-label=\"Visit the profile page for Ahmed Awadallah\">Ahmed Awadallah<\/a>","is_active":false,"last_first":"Awadallah, Ahmed","people_section":0,"alias":"hassanam"}],"msr_type":"Post","featured_image_thumbnail":"<img width=\"960\" height=\"540\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/1400x788_Score_blog_still_nologo-3-960x540.jpg\" class=\"img-object-cover\" alt=\"An example of a multi-turn text-to-SQL task. The user query \u201cFind the names of the top 3 highest sales books\u201d corresponds to the formal program \u201cSELECT title FROM book ORDER BY sale_amount DESC LIMIT 3\u201d. The follow-up user query, \u201cWho are their authors,\u201d corresponds to the formal program \u201cSELECT t1.title, t1.name FROM author AS t1 JOIN book AS t2 ON t1.id = t2.author_id ORDER BY t2.sale_amount DESC LIMIT 3\u201d. In the corresponding database, there is an \u201cAuthor\u201d table with an \u201cid\u201d column, a \u201cname\u201d column, a \u201ccountry\u201d column, and an ellipsis signifying additional columns; a \u201cPress\u201d table with an \u201cid\u201d column, a \u201cname\u201d column, an \u201caddress\u201d column, and an ellipsis signifying additional columns; and a \u201cBook\u201d table with an \u201cid\u201d column, a \u201ctitle\u201d column, an \u201cauthor id\u201d column, a \u201csale_amount\u201d column, and an ellipsis signifying additional columns.\" decoding=\"async\" loading=\"lazy\" srcset=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/1400x788_Score_blog_still_nologo-3-960x540.jpg 960w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/1400x788_Score_blog_still_nologo-3-300x169.jpg 300w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/1400x788_Score_blog_still_nologo-3-1024x577.jpg 1024w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/1400x788_Score_blog_still_nologo-3-768x432.jpg 768w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/1400x788_Score_blog_still_nologo-3-1536x865.jpg 1536w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/1400x788_Score_blog_still_nologo-3-2048x1153.jpg 2048w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/1400x788_Score_blog_still_nologo-3-16x9.jpg 16w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/1400x788_Score_blog_still_nologo-3-1066x600.jpg 1066w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/1400x788_Score_blog_still_nologo-3-655x368.jpg 655w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/1400x788_Score_blog_still_nologo-3-343x193.jpg 343w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/1400x788_Score_blog_still_nologo-3-640x360.jpg 640w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/1400x788_Score_blog_still_nologo-3-1280x720.jpg 1280w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2021\/05\/1400x788_Score_blog_still_nologo-3-1920x1080.jpg 1920w\" sizes=\"auto, (max-width: 960px) 100vw, 960px\" \/>","byline":"Alex Polozov, Chris Meek, and <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/people\/hassanam\/\" title=\"Go to researcher profile for Ahmed Awadallah\" aria-label=\"Go to researcher profile for Ahmed Awadallah\" data-bi-type=\"byline author\" data-bi-cN=\"Ahmed Awadallah\">Ahmed Awadallah<\/a>","formattedDate":"May 3, 2021","formattedExcerpt":"One key aspiration of AI is to develop natural and effective task-oriented conversational systems. Task-oriented conversational systems use a natural language interface to collaborate with and support people in accomplishing specific goals and activities. They go beyond chitchat conversation. For example, as personal digital assistants,&hellip;","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/posts\/742696","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/users\/38838"}],"replies":[{"embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/comments?post=742696"}],"version-history":[{"count":44,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/posts\/742696\/revisions"}],"predecessor-version":[{"id":927219,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/posts\/742696\/revisions\/927219"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/media\/743173"}],"wp:attachment":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/media?parent=742696"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/categories?post=742696"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/tags?post=742696"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=742696"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=742696"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=742696"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=742696"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=742696"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=742696"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=742696"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=742696"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}