{"id":570525,"date":"2019-03-12T16:58:51","date_gmt":"2019-03-12T23:58:51","guid":{"rendered":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/?p=570525"},"modified":"2019-03-12T16:58:51","modified_gmt":"2019-03-12T23:58:51","slug":"calling-all-aspiring-women-in-data-science","status":"publish","type":"post","link":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/blog\/calling-all-aspiring-women-in-data-science\/","title":{"rendered":"Calling all aspiring women in Data Science"},"content":{"rendered":"<div id=\"attachment_570561\" style=\"width: 1410px\" class=\"wp-caption alignnone\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-570561\" class=\"wp-image-570561 size-full\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/02\/WiDS_Blog_Site_02_2019_1400x788.png\" alt=\"\" width=\"1400\" height=\"788\" srcset=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/02\/WiDS_Blog_Site_02_2019_1400x788.png 1400w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/02\/WiDS_Blog_Site_02_2019_1400x788-300x169.png 300w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/02\/WiDS_Blog_Site_02_2019_1400x788-768x432.png 768w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/02\/WiDS_Blog_Site_02_2019_1400x788-1024x576.png 1024w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/02\/WiDS_Blog_Site_02_2019_1400x788-1066x600.png 1066w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/02\/WiDS_Blog_Site_02_2019_1400x788-655x368.png 655w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/02\/WiDS_Blog_Site_02_2019_1400x788-343x193.png 343w\" sizes=\"auto, (max-width: 1400px) 100vw, 1400px\" \/><p id=\"caption-attachment-570561\" class=\"wp-caption-text\">Datathon participants at the Microsoft New England Research and Development center. Photo credit: Dana J. Quigley; @DJQPhotography<\/p><\/div>\n<p>What started as a one-day conference organized by Stanford University in 2015, Women in Data Science (WiDS) has blossomed into a movement bringing together women data scientists and aspiring data scientists via a series of over 150 virtual and in-person events worldwide, ultimately culminating in the March 4, 2019 main event at Stanford. Microsoft is a proud partner of WiDS; in addition to supporting the Datathon via the webinar, Microsoft also provided Xboxes as prizes.<\/p>\n<p>One of the main drivers for engagement is the WiDS Datathon, now in its second year, that kicks off in the weeks preceding the conference, with the winners announced at Stanford during the conference. This year\u2019s Datathon had participants working on a classic image classification problem using computer vision techniques. The challenge to be solved is an environmental one. Rampant deforestation caused by oil palm production (oil palm is a common ingredient across products in everyday use) has led to devastation of the eco habitats of many animal and plant species. One way to get ahead of the problem is to identify where the deforestation is taking place. These are remote regions and satellite imagery is an effective means of smart detection and intervention. <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/www.planet.com\/pulse\/planet-partners-with-women-in-data-science-datathon\/\">Planet<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> provided a set of hi-res satellite images and <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/www.figure-eight.com\/figure-eight-partners-with-the-women-in-data-science-conference-on-a-datathon-for-global-good\/\">Figure8<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> helped annotate them and created a training, testing and holdout dataset for the Datathon. The Datathon has led to <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/medium.com\/participate-in-the-wids-2019-datathon\/wids-datathon-workshops-worldwide-316b776dc43c\">workshops<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> in several countries with participants coming together to form teams to solve the challenge.<\/p>\n<p>Datathon rules allow for teams of up to four people, with the requirement that at least half of each team be female or identify as female. Within weeks, the Datathon attracted over 200 teams. I took a shot at solving the problem using Microsoft Custom vision, one of the cognitive services available on Azure. Using the custom vision UI, I was able to build a classifier with a handful of training images within minutes. Extending the classifier to include hundreds of images was easy using the Python SDK for Custom vision. Such is the power of cognitive services in Azure; you can build a transfer learning-based powerful image classification algorithm with less than 100 lines of code. The model improved by simply continuing to add more images from the geo-images training dataset to the existing custom vision model, which was a simple and effective demonstration of the importance of increasing training data for higher model accuracy.<\/p>\n<table class=\"aligncenter\" style=\"height: 143px; width: 95%; border-collapse: separate; border-spacing: 0px;\" border=\"1\" cellspacing=\"0\" cellpadding=\"5\">\n<tbody>\n<tr style=\"height: 24px;\">\n<td style=\"width: 24.9057%; padding: 5px; border: 1px solid; height: 24px;\"><strong>Training images count<\/strong><\/td>\n<td style=\"width: 24.6541%; padding: 5px; border: 1px solid; height: 24px;\"><strong>Precision<\/strong><\/td>\n<td style=\"width: 24.7799%; padding: 5px; border: 1px solid; height: 24px;\"><strong>Recall<\/strong><\/td>\n<\/tr>\n<tr style=\"height: 24px;\">\n<td style=\"width: 24.9057%; padding: 5px; border: 1px solid; height: 24px;\">60<\/td>\n<td style=\"width: 24.6541%; padding: 5px; border: 1px solid; height: 24px;\">79.60%<\/td>\n<td style=\"width: 24.7799%; padding: 5px; border: 1px solid; height: 24px;\">79.60%<\/td>\n<\/tr>\n<tr style=\"height: 24px;\">\n<td style=\"width: 24.9057%; padding: 5px; border: 1px solid; height: 24px;\">1,800<\/td>\n<td style=\"width: 24.6541%; padding: 5px; border: 1px solid; height: 24px;\">97.50%<\/td>\n<td style=\"width: 24.7799%; padding: 5px; border: 1px solid; height: 24px;\">97.10%<\/td>\n<\/tr>\n<tr style=\"height: 24px;\">\n<td style=\"width: 24.9057%; padding: 5px; border: 1px solid; height: 24px;\">5,000<\/td>\n<td style=\"width: 24.6541%; padding: 5px; border: 1px solid; height: 24px;\">99.60%<\/td>\n<td style=\"width: 24.7799%; padding: 5px; border: 1px solid; height: 24px;\">99.10%<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p>We hosted a WiDS webinar that covered basic machine learning concepts and a tutorial with the custom vision solution. The webinar <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/www.youtube.com\/watch?v=iFoxJDuhfds\">recording<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> and <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/drive.google.com\/file\/d\/13b7UutoZnhOf7xYBer02JvZWduN_se75\/view\">slides<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> are available for those who missed it.<\/p>\n<p>This democratization of machine learning tools is an important factor in opening up the field of data science to a wide audience of data science students and practitioners. The other factor, especially relevant to attracting women to data science, is the focus on socially relevant datasets and problems, such as this year\u2019s oil palm classification problem.<\/p>\n<p>Data science for social good is an important sub field within the data science community with efforts such as the annual <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/dssg.uchicago.edu\/kddsocialimpact\/\">Workshop on Social Impact<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> at <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/www.kdd.org\/kdd2019\/\">KDD<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> and efforts such as the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/dssg.uchicago.edu\/\">Data Science for Social Good Summer Fellowship<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> started at University of Chicago and now offered by <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/escience.washington.edu\/get-involved\/incubator-programs\/data-science-for-social-good\/\">University of Washington<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/dsi.ubc.ca\/apply-dssg-program\">University of British Columbia<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> and other universities. The emphasis on leveraging data for altruistic goals is also evident in computer science departments across higher education that are currently pivoting to data science education. For example, the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/datascience.berkeley.edu\/\">Data Science program offered at the University of California Berkeley<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, based on real datasets, has been a great catalyst in getting women into computing in unprecedented numbers\u2014<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/data.berkeley.edu\/news\/data-8-thrives-and-campus\">half the enrolled students are women<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, in contrast to traditional computer science courses. Greater numbers of women skilled in data science will help to fill the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/www.theguardian.com\/lifeandstyle\/2019\/feb\/23\/truth-world-built-for-men-car-crashes\">data gap that has created a pervasive but invisible bias with a profound effect<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> on women&#8217;s lives.<\/p>\n<div id=\"attachment_570834\" style=\"width: 1441px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-570834\" class=\"wp-image-570834 size-full\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/03\/WiDS-Datathon-Collaboration-Day.png\" alt=\"\" width=\"1431\" height=\"1073\" srcset=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/03\/WiDS-Datathon-Collaboration-Day.png 1431w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/03\/WiDS-Datathon-Collaboration-Day-300x225.png 300w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/03\/WiDS-Datathon-Collaboration-Day-768x576.png 768w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/03\/WiDS-Datathon-Collaboration-Day-1024x768.png 1024w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/03\/WiDS-Datathon-Collaboration-Day-80x60.png 80w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/03\/WiDS-Datathon-Collaboration-Day-240x180.png 240w\" sizes=\"auto, (max-width: 1431px) 100vw, 1431px\" \/><p id=\"caption-attachment-570834\" class=\"wp-caption-text\">Participants at the UC Berkeley WiDS Datathon Collaboration Day. Photo credit WiDS ambassadors Emily Liu and Mariah Rogers<\/p><\/div>\n<p>More broadly than data science, AI has a burgeoning effort of socially relevant subfields that are applicable to a growing demographic of women technologists and students. These include topics such as eliminating bias in AI systems through fairness, accountability and transparency, secure machine learning, privacy, ethics, policy impacting and domain specific machine learning.<\/p>\n<p>This year, the WiDS Datathon has resulted in regional <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/nam06.safelinks.protection.outlook.com\/?url=https%3A%2F%2Fmedium.com%2Fparticipate-in-the-wids-2019-datathon%2Fwids-datathon-workshops-worldwide-316b776dc43c&data=02%7C01%7Cvanim%40exchange.microsoft.com%7C7ef3a55d35fc47d82dde08d69792e70b%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C636863054589935817&sdata=J6FIiY0kiADTeNGVHz%2B9yup2K8fy3ECFsK2mhV95h%2FI%3D&reserved=0\">Datathon workshops<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> around the globe, for example, the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/data.berkeley.edu\/file\/871\">WiDS Data Collaboration Day at UC Berkeley<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, and a meetup at the Microsoft New England Research and Development center.<\/p>\n<p>Congratulations to all participants \u2013 <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/www.widsconference.org\/datathon.html\">visit the WiDS Datathon page<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0for the full list of winners. We look forward to continuing our engagement with the growing community of data scientists as they tackle challenges that will have positive lasting impact on research and technology!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>What started as a one-day conference organized by Stanford University in 2015, Women in Data Science (WiDS) has blossomed into a movement bringing together women data scientists and aspiring data scientists via a series of over 150 virtual and in-person events worldwide, ultimately culminating in the March 4, 2019 main event at Stanford. Microsoft is [&hellip;]<\/p>\n","protected":false},"author":38022,"featured_media":570561,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-author-ordering":[{"type":"user_nicename","value":"Vani Mandava","user_id":"34489"}],"msr_hide_image_in_river":0,"footnotes":""},"categories":[194453],"tags":[],"research-area":[13556,13563],"msr-region":[197900],"msr-event-type":[],"msr-locale":[268875],"msr-post-option":[],"msr-impact-theme":[],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-570525","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-data-science","msr-research-area-artificial-intelligence","msr-research-area-data-platform-analytics","msr-region-north-america","msr-locale-en_us"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[],"msr_impact_theme":[],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[],"related-projects":[],"related-events":[],"related-researchers":[],"msr_type":"Post","featured_image_thumbnail":"<img width=\"960\" height=\"540\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/02\/WiDS_Blog_Site_02_2019_1400x788.png\" class=\"img-object-cover\" alt=\"Datathon participants at the Microsoft New England Research and Development center. Photo Credit: Dana Quigley\" decoding=\"async\" loading=\"lazy\" srcset=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/02\/WiDS_Blog_Site_02_2019_1400x788.png 1400w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/02\/WiDS_Blog_Site_02_2019_1400x788-300x169.png 300w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/02\/WiDS_Blog_Site_02_2019_1400x788-768x432.png 768w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/02\/WiDS_Blog_Site_02_2019_1400x788-1024x576.png 1024w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/02\/WiDS_Blog_Site_02_2019_1400x788-1066x600.png 1066w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/02\/WiDS_Blog_Site_02_2019_1400x788-655x368.png 655w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2019\/02\/WiDS_Blog_Site_02_2019_1400x788-343x193.png 343w\" sizes=\"auto, (max-width: 960px) 100vw, 960px\" \/>","byline":"Vani Mandava","formattedDate":"March 12, 2019","formattedExcerpt":"What started as a one-day conference organized by Stanford University in 2015, Women in Data Science (WiDS) has blossomed into a movement bringing together women data scientists and aspiring data scientists via a series of over 150 virtual and in-person events worldwide, ultimately culminating in&hellip;","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/posts\/570525","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/users\/38022"}],"replies":[{"embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/comments?post=570525"}],"version-history":[{"count":30,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/posts\/570525\/revisions"}],"predecessor-version":[{"id":573063,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/posts\/570525\/revisions\/573063"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/media\/570561"}],"wp:attachment":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/media?parent=570525"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/categories?post=570525"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/tags?post=570525"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=570525"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=570525"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=570525"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=570525"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=570525"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=570525"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=570525"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=570525"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}