{"id":356072,"date":"2017-01-20T11:15:07","date_gmt":"2017-01-20T19:15:07","guid":{"rendered":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/?p=356072"},"modified":"2017-01-20T11:15:07","modified_gmt":"2017-01-20T19:15:07","slug":"project-privtree-blurring-location-privacy","status":"publish","type":"post","link":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/blog\/project-privtree-blurring-location-privacy\/","title":{"rendered":"Project PrivTree: Blurring your \u201cwhere\u201d for location privacy"},"content":{"rendered":"<p><em>By Winnie Cui, Senior Research Manager, Microsoft Research Asia<\/em><\/p>\n<p>Data scientist, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"https:\/\/www.linkedin.com\/in\/anthony-tockar-474a7252\/\" target=\"_blank\">Anthony Tockar<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, used publicly available location data to show how <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"https:\/\/research.neustar.biz\/author\/atockar\/\" target=\"_blank\">celebrities can be tracked throughout New York City<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, while working on his Master\u2019s Degree at Northwestern University. By cross-referencing public news and photos about celebrities hailing cabs in NYC, Tockar found out exactly where celebrities climbed into cabs, where they traveled and even how much they paid!<\/p>\n<p>As this example shows, location-based services, pulling an individual\u2019s location data from GPS, IP addresses and Wi-Fi network mapping, can be a privacy nightmare. But they can also be incredibly valuable, offering real-time navigation, local weather, geographically targeted search engine results, and other useful functions.<\/p>\n<p>A 2011 Microsoft survey, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/thelbma.com\/research\/3\/microsoft-location-usage-and-perceptions\/\" target=\"_blank\">Location Usage & Perceptions<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, found that 94 percent of customers considered location-based services valuable. However, the same survey found that 52 percent were concerned about the privacy issues related to the use of geolocation data.<\/p>\n<p>The privacy issue is now a focus of attention in the research community. \u201cToday\u2019s computing power and scale of publicly available data makes it easier to identify individuals from the data,&#8221; said <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www3.ntu.edu.sg\/home\/xkxiao\/\" target=\"_blank\">Professor Xiaokui Xiao<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> at Nanyang Technological University (NTU).<\/p>\n<p>Recently, the collaboration between Professor Xiaokui Xiao\u2019s team and <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/group\/social-computing-beijing\/\">Dr. Xing Xie\u2019s group<\/a> at Microsoft Research Asia in Beijing has found a way that might alleviate the privacy concerns. The team proposes a data manipulation technique, called PrivTree, which pre-processes geolocation data to protect individual privacy. Subsequently, the privatized data can be safely used in any prospective analysis, or even made publicly available, without further risk to an individual\u2019s privacy.<\/p>\n<p>PrivTree works by mathematically \u201cblurring\u201d the geolocation information of a specific individual, while maintaining overall accuracy for the dataset as a whole. In the example below, individuals in the dataset are projected onto a map by their geolocation coordinates.<\/p>\n<div id=\"attachment_356075\" style=\"width: 807px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-356075\" class=\"wp-image-356075 size-full\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2017\/01\/Map1.png\" alt=\"PrivTree geolocation example\" width=\"797\" height=\"407\" srcset=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2017\/01\/Map1.png 797w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2017\/01\/Map1-300x153.png 300w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2017\/01\/Map1-768x392.png 768w\" sizes=\"auto, (max-width: 797px) 100vw, 797px\" \/><p id=\"caption-attachment-356075\" class=\"wp-caption-text\">Each marker represents an individual in the geolocation database.<\/p><\/div>\n<p>Next, PrivTree goes through two phases to \u201cblur out\u201d the geolocation information of each individual.<\/p>\n<p><strong>Phase 1: Map Partitioning<\/strong><\/p>\n<div id=\"attachment_356078\" style=\"width: 812px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-356078\" class=\"size-full wp-image-356078\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2017\/01\/Map2.png\" alt=\"\" width=\"802\" height=\"412\" srcset=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2017\/01\/Map2.png 802w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2017\/01\/Map2-300x154.png 300w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2017\/01\/Map2-768x395.png 768w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2017\/01\/Map2-800x412.png 800w\" sizes=\"auto, (max-width: 802px) 100vw, 802px\" \/><p id=\"caption-attachment-356078\" class=\"wp-caption-text\">The map is partitioned into a few sub-regions, based on the density of the data points.<\/p><\/div>\n<p><strong>Phase 2:\u00a0Location Perturbation<\/strong><\/p>\n<div id=\"attachment_356081\" style=\"width: 812px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-356081\" class=\"size-full wp-image-356081\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2017\/01\/Map3.png\" alt=\"\" width=\"802\" height=\"412\" srcset=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2017\/01\/Map3.png 802w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2017\/01\/Map3-300x154.png 300w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2017\/01\/Map3-768x395.png 768w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2017\/01\/Map3-800x412.png 800w\" sizes=\"auto, (max-width: 802px) 100vw, 802px\" \/><p id=\"caption-attachment-356081\" class=\"wp-caption-text\">Using statistical analysis, individuals are subjected to a perturbation scheme where they are randomly removed, added or shuffled to guarantee privacy while maintaining statistical accuracy. A new geolocation database is ready to use, after applying location perturbation to each sub-region.<\/p><\/div>\n<p>This ends up with a new set of data points that follows a similar distribution to the original data, but the real location of each participant has been masked. The privatized data is then released as the output of PrivTree. PrivTree can be extended to support all kinds of location data \u2013 for example, your daily jogging route uploaded to a health app. The research paper, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/dl.acm.org\/citation.cfm?id=2882928\" target=\"_blank\">PrivTree: A Differentially Private Algorithm for Hierarchical Decompositions<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> was accepted by\u00a0ACM SIGMOD 2016, the world\u2019s top data management conference.<\/p>\n<p>Professor Xiao said this about collaborating with Microsoft researchers, \u201cMicrosoft Research Asia\u2019s expertise in managing large sets of geolocation data, such as Beijing taxi data, played a crucial role to the success of this project. It helped us develop and test our model.\u201d<\/p>\n<p>Professor Xiao plans to further integrate PrivTree techniques into Microsoft\u2019s location-based services to provide privacy protection. <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/people\/xingx\/\">Dr. Xing Xie<\/a>, Senior Researcher at Microsoft Research Asia, and a collaborator on this project, observed \u201cData privacy is a critical challenge in the cloud computing era, especially for user-generated location data that contains a lot of private knowledge about individuals. We hope this joint work can contribute to&#8211;and eventually lead to&#8211;a safer world for everyone.\u201d<\/p>\n<p><strong>Learn more:<\/strong><\/p>\n<ul>\n<li><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/dl.acm.org\/citation.cfm?id=2882928\" target=\"_blank\">PrivTree: A Differentially Private Algorithm for Hierarchical Decompositions<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<li><a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/project\/t-drive-driving-directions-based-on-taxi-traces\/\" target=\"_blank\">T-Drive: Driving Directions based on Taxi Traces<\/a><\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>By Winnie Cui, Senior Research Manager, Microsoft Research Asia Data scientist, Anthony Tockar, used publicly available location data to show how celebrities can be tracked throughout New York City, while working on his Master\u2019s Degree at Northwestern University. By cross-referencing public news and photos about celebrities hailing cabs in NYC, Tockar found out exactly where [&hellip;]<\/p>\n","protected":false},"author":34645,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-author-ordering":[],"msr_hide_image_in_river":0,"footnotes":""},"categories":[194453,194474,194460],"tags":[186854,186857,186454,230918,230921,186698,186415],"research-area":[13563],"msr-region":[197903],"msr-event-type":[],"msr-locale":[268875],"msr-post-option":[],"msr-impact-theme":[],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-356072","post","type-post","status-publish","format-standard","hentry","category-data-science","category-data-visulalization","category-search-and-information-retrieval","tag-data-mining","tag-data-privacy","tag-data-visualization","tag-geolocation","tag-geolocation-data","tag-location-based-services","tag-privacy","msr-research-area-data-platform-analytics","msr-region-asia-pacific","msr-locale-en_us"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[199560],"msr_impact_theme":[],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[],"related-projects":[],"related-events":[],"related-researchers":[],"msr_type":"Post","byline":"","formattedDate":"January 20, 2017","formattedExcerpt":"By Winnie Cui, Senior Research Manager, Microsoft Research Asia Data scientist, Anthony Tockar, used publicly available location data to show how celebrities can be tracked throughout New York City, while working on his Master\u2019s Degree at Northwestern University. By cross-referencing public news and photos about&hellip;","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/posts\/356072","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/users\/34645"}],"replies":[{"embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/comments?post=356072"}],"version-history":[{"count":1,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/posts\/356072\/revisions"}],"predecessor-version":[{"id":356090,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/posts\/356072\/revisions\/356090"}],"wp:attachment":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/media?parent=356072"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/categories?post=356072"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/tags?post=356072"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=356072"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=356072"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=356072"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=356072"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=356072"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=356072"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=356072"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=356072"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}