{"id":555504,"date":"2018-12-11T09:09:46","date_gmt":"2018-12-11T17:09:46","guid":{"rendered":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/?p=555504"},"modified":"2018-12-12T11:32:47","modified_gmt":"2018-12-12T19:32:47","slug":"first-textworld-problems-microsoft-research-montreals-latest-ai-competition-is-really-cooking","status":"publish","type":"post","link":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/blog\/first-textworld-problems-microsoft-research-montreals-latest-ai-competition-is-really-cooking\/","title":{"rendered":"First TextWorld Problems\u2014Microsoft Research Montreal\u2019s latest AI competition is really cooking"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"size-large wp-image-555507 aligncenter\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/Textworld-Part-2_Site_11_2018_1400x788-1024x576.png\" alt=\"textworld at neurips 2018\" width=\"1024\" height=\"576\" srcset=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/Textworld-Part-2_Site_11_2018_1400x788-1024x576.png 1024w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/Textworld-Part-2_Site_11_2018_1400x788-300x169.png 300w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/Textworld-Part-2_Site_11_2018_1400x788-768x432.png 768w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/Textworld-Part-2_Site_11_2018_1400x788-1066x600.png 1066w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/Textworld-Part-2_Site_11_2018_1400x788-655x368.png 655w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/Textworld-Part-2_Site_11_2018_1400x788-343x193.png 343w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/Textworld-Part-2_Site_11_2018_1400x788.png 1400w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<p>This week, Microsoft Research threw down the gauntlet with the launch of a competition challenging researchers around the world to develop AI agents that can solve text-based games. Conceived by the Machine Reading Comprehension team at Microsoft Research Montreal, the competition\u2014<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"http:\/\/aka.ms\/textworld\">First TextWorld Problems: A Reinforcement and Language Learning Challenge<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u2014runs from December 8, 2018 through May 31, 2019.<\/p>\n<p>First TextWorld Problems is built on the TextWorld framework. TextWorld was released to the public in July 2018 at <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"http:\/\/aka.ms\/textworld\">aka.ms\/textworld<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. TextWorld is an extensible, sandbox learning environment for reinforcement learning in text-based games. Beyond game simulation, it has the capacity to generate games stochastically from a user-specified distribution. Such a distribution of games opens new possibilities for the study of generalization and continual or meta-learning in a reinforcement learning setting, by enabling researchers to train and test agents on distinct but related games. TextWorld\u2019s generator gives fine control over game parameters like the size of the game world, the branching factor and length of quests, the density of rewards, and the stochasticity of transitions. Game vocabulary can also be controlled; this directly affects the action and observation spaces. Researchers can also use TextWorld to handcraft games that test for specific knowledge and skills.<\/p>\n<p>The theme for First TextWorld Problems is gathering ingredients to cook a recipe. Agents must determine the necessary ingredients from a recipe book, explore the house to gather ingredients, and return to the kitchen to cook up a delicious meal. Additionally, agents will need to use tools like knives and frying pans. Locked doors and other obstacles along the way must be overcome. The necessary ingredients and their locations change from game to game, as does the layout of the house itself; agents cannot simply memorize a procedure in order to succeed.<\/p>\n<div id=\"attachment_556203\" style=\"width: 522px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-556203\" class=\"size-large wp-image-556203\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/graph_hang-on-did-someone-change-the-floorplan_zoom-512x1024.png\" alt=\"Hang on \u2026 did someone change the floorplan in this house? Example house layouts generated by TextWorld.\" width=\"512\" height=\"1024\" srcset=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/graph_hang-on-did-someone-change-the-floorplan_zoom-512x1024.png 512w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/graph_hang-on-did-someone-change-the-floorplan_zoom-150x300.png 150w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/graph_hang-on-did-someone-change-the-floorplan_zoom-768x1536.png 768w\" sizes=\"auto, (max-width: 512px) 100vw, 512px\" \/><p id=\"caption-attachment-556203\" class=\"wp-caption-text\">Hang on \u2026 did someone change the floorplan in this house? Example house layouts generated by TextWorld.<\/p><\/div>\n<p>While a simple cooking task may seem quotidian by human standards, it is still very difficult for AI. Observations and actions are all text-based (see the example below), so a successful agent must learn to understand and manipulate its environment through language, as well as to ground its language in the environmental dynamics. It must also deal with classic, open reinforcement learning problems like partial observability and sparse rewards.<\/p>\n<div id=\"attachment_555516\" style=\"width: 712px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-555516\" class=\"wp-image-555516 size-full\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/An-example-of-a-text-based-cooking-game-whipped-up-in-the-TextWorld-framework-kitchen.png\" alt=\"\" width=\"702\" height=\"416\" srcset=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/An-example-of-a-text-based-cooking-game-whipped-up-in-the-TextWorld-framework-kitchen.png 702w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/An-example-of-a-text-based-cooking-game-whipped-up-in-the-TextWorld-framework-kitchen-300x178.png 300w\" sizes=\"auto, (max-width: 702px) 100vw, 702px\" \/><p id=\"caption-attachment-555516\" class=\"wp-caption-text\">An example of a text-based cooking game whipped up in the TextWorld framework kitchen.<\/p><\/div>\n<p>We hope this competition fosters research into generalization across tasks, meta-learning, zero-shot language understanding, common-sense reasoning, efficient exploration, and effective handling of combinatorial action spaces.\u00a0The winning team will be awarded a prize of $2000 USD, plus an exclusive one-hour discussion session with a Microsoft Research researcher, as well as being featured in a <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/blog\/\">Microsoft Research blog<\/a> post and in an accompanying article in the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/na01.safelinks.protection.outlook.com\/?url=https%3A%2F%2Fnote.microsoft.com%2Fww-registration-microsoft-research-newsletter-s.html%3Fwt.mc_id%3DS-webpage_msr-homepage&data=02%7C01%7Cv-emmary%40microsoft.com%7C7e523ff170e24261208c08d65ac7cb74%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C636796211482244204&sdata=MysvuF0GryqfjmGmukPo%2BvpBGxilFR0YdytRrEfjqfk%3D&reserved=0\">Microsoft Research Newsletter<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> (some restrictions apply, please check competition rules and regulations for details.)<\/p>\n<p>Did we pique your interest? We encourage everyone to put their reinforcement learning prowess\u2014and culinary talents\u2014to the test in First TextWorld Problems. Go to <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"http:\/\/aka.ms\/textworld-challenge\">aka.ms\/textworld-challenge<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> and sign up today!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>This week, Microsoft Research threw down the gauntlet with the launch of a competition challenging researchers around the world to develop AI agents that can solve text-based games. Conceived by the Machine Reading Comprehension team at Microsoft Research Montreal, the competition\u2014First TextWorld Problems: A Reinforcement and Language Learning Challenge\u2014runs from December 8, 2018 through May [&hellip;]<\/p>\n","protected":false},"author":37074,"featured_media":555507,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-author-ordering":[{"type":"user_nicename","value":"Wendy Tay","user_id":"37200"},{"type":"user_nicename","value":"Adam Trischler","user_id":"37143"}],"msr_hide_image_in_river":0,"footnotes":""},"categories":[241770],"tags":[],"research-area":[13556],"msr-region":[],"msr-event-type":[],"msr-locale":[268875],"msr-post-option":[],"msr-impact-theme":[],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-555504","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-artificial-intelligence","msr-research-area-artificial-intelligence","msr-locale-en_us"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[],"msr_impact_theme":[],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[863034],"related-projects":[442191],"related-events":[508112],"related-researchers":[],"msr_type":"Post","featured_image_thumbnail":"<img width=\"960\" height=\"540\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/Textworld-Part-2_Site_11_2018_1400x788.png\" class=\"img-object-cover\" alt=\"textworld at neurips 2018\" decoding=\"async\" loading=\"lazy\" srcset=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/Textworld-Part-2_Site_11_2018_1400x788.png 1400w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/Textworld-Part-2_Site_11_2018_1400x788-300x169.png 300w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/Textworld-Part-2_Site_11_2018_1400x788-768x432.png 768w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/Textworld-Part-2_Site_11_2018_1400x788-1024x576.png 1024w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/Textworld-Part-2_Site_11_2018_1400x788-1066x600.png 1066w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/Textworld-Part-2_Site_11_2018_1400x788-655x368.png 655w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/12\/Textworld-Part-2_Site_11_2018_1400x788-343x193.png 343w\" sizes=\"auto, (max-width: 960px) 100vw, 960px\" \/>","byline":"Wendy Tay and Adam Trischler","formattedDate":"December 11, 2018","formattedExcerpt":"This week, Microsoft Research threw down the gauntlet with the launch of a competition challenging researchers around the world to develop AI agents that can solve text-based games. Conceived by the Machine Reading Comprehension team at Microsoft Research Montreal, the competition\u2014First TextWorld Problems: A Reinforcement&hellip;","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/posts\/555504","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/users\/37074"}],"replies":[{"embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/comments?post=555504"}],"version-history":[{"count":9,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/posts\/555504\/revisions"}],"predecessor-version":[{"id":556977,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/posts\/555504\/revisions\/556977"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/media\/555507"}],"wp:attachment":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/media?parent=555504"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/categories?post=555504"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/tags?post=555504"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=555504"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=555504"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=555504"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=555504"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=555504"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=555504"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=555504"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=555504"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}