{"id":170241,"date":"2009-03-26T07:55:31","date_gmt":"2009-03-26T07:55:31","guid":{"rendered":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/project\/visualizing-the-text-of-childrens-book-series\/"},"modified":"2017-06-01T07:30:54","modified_gmt":"2017-06-01T14:30:54","slug":"visualizing-the-text-of-childrens-book-series","status":"publish","type":"msr-project","link":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/project\/visualizing-the-text-of-childrens-book-series\/","title":{"rendered":"Visualizing the text of (children&#8217;s) book series"},"content":{"rendered":"<div class=\"asset-content\">A stream of dancing lights, for all the world like the shimmering curtains of the aurora, blazed across the screen. They took up patterns that were held for a moment only to break apart and form again, in different shapes, or different colours; they looped and swayed, they sprayed apart, they burst into showers of radiance that suddenly swerved this way or that like a flock of birds changing direction in the sky. And as Lyra watched, she felt the same sense, as of trembling on the brink of understanding, that she remembered from the time when she was beginning to read the alethiometer.<\/div>\n<div id=\"en-usprojectsbookvizdefault\" class=\"page-content\">\n<p>&nbsp;<\/p>\n<p class=\"smallText\" dir=\"ltr\">\u2014 Philip Pullman, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.philip-pullman.com\/pages\/content\/index.asp?PageID=61\" target=\"_blank\">The Subtle Knife<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0(Scholastic, 1997; USA, Knopf, 1997)<\/p>\n<\/div>\n<div id=\"en-usprojectsbookvizdefault\" class=\"page-content\">\n<h1>Introduction<\/h1>\n<\/div>\n<div id=\"en-usprojectsbookvizdefault\" class=\"page-content\">\n<p><a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/01\/visualizing-the-text-of-childrens-book-series.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-170242 alignleft\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/01\/visualizing-the-text-of-childrens-book-series.jpg\" alt=\"Visualizing the text of (children\u2019s) book series\" width=\"240\" height=\"240\" srcset=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/01\/visualizing-the-text-of-childrens-book-series.jpg 240w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/01\/visualizing-the-text-of-childrens-book-series-150x150.jpg 150w, https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/01\/visualizing-the-text-of-childrens-book-series-180x180.jpg 180w\" sizes=\"auto, (max-width: 240px) 100vw, 240px\" \/><\/a>Digital technologies have repeatedly redefined the paper world of books. Digital printing has overhauled the publishing processes, and the internet has revolutionised the way audiences and authors connect to share their enthusiasm and criticism. Now the digitization of books themselves, either for searching, browsing, and reading on a computer screen through services like Google Books, or for reading on dedicated devices like Amazon&#8217;s Kindle or the Sony Reader are threatening the established order.<\/p>\n<p>But for this project we side-step these issues and concentrate instead on how the analytical power and display capabilities of computers may be used to enhance our understanding of book texts. We use the term &#8220;book texts&#8221; rather than the word &#8220;books&#8221; as we are not trying to build computer systems that might understand books, but rather we use the computer&#8217;s ability to treat books as an abstract sequence of words as the starting point for new analytical tools.<\/p>\n<p>Who would use such tools? Anyone with an interest in books, be they authors, readers, publishers, agents, critics, academics, etc may find such tools useful, but we have designed our visualizations with fans and academic readers in mind. These readers form theories about the books that stand alongside the author&#8217;s own understanding and we hope that the abstract visualizations provided may help such an endeavour.<\/p>\n<\/div>\n<div id=\"en-usprojectsbookvizdefault\" class=\"page-content\">\n<h1>Background and Related Work<\/h1>\n<p><span id=\"9070eafc-30c0-4849-85af-1bc5d8d35bd1\" class=\"ImageBlock fr\"><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.itsbeenreal.co.uk\/index.php?\/wwwords\/\/about-this-project\/\" target=\"_blank\"><img decoding=\"async\" id=\"Image9070eafc-30c0-4849-85af-1bc5d8d35bd1\" class=\"alignright\" title=\"Fragment of Posavec Sentence Diagram of 'On the Road'\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-posavecdiagramthumbnail.jpg\" alt=\"Fragment of Posavec Sentence Diagram of 'On the Road'\" \/><span class=\"sr-only\"> (opens in new tab)<\/span><\/a><span id=\"ImageCaption9070eafc-30c0-4849-85af-1bc5d8d35bd1\" class=\"ImageCaptionCoreCss ImageCaption\"><\/span><\/span>The statistical analysis of texts is an important area of work and is used widely in information retrieval (e.g. web search). It is also a mature area of research in its own right, and has been used in the past for things from author attribution to the ordering of works through time. For example in a letter published in 1882 Augustus De Morgan speculated about using statistical techniques to explore authorship questions around St Paul&#8217;s Epistles and the Epistle to the Hebrews [Lea76], while more recently <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/llc.oxfordjournals.org\/cgi\/content\/abstract\/23\/4\/465\" target=\"_blank\">Jockers, Witten, and Criddle <span class=\"sr-only\"> (opens in new tab)<\/span><\/a>used sophisticated statistical techniques to reassess the authorship of the Book of Mormon.<\/p>\n<p>In contrast, the abstract visualization of book texts is not a large or a mature field of study, but there are notable and inspirational examples. The following sections list some of these (more on separate tab)<\/p>\n<\/div>\n<div id=\"en-usprojectsbookvizdefault\" class=\"page-content\">\n<h1>Visualizations<\/h1>\n<p><span id=\"c72aa292-0b2a-4d70-9ec8-5c492dfdb478\" class=\"ImageBlock fl\"><img decoding=\"async\" id=\"Imagec72aa292-0b2a-4d70-9ec8-5c492dfdb478\" class=\"alignleft\" title=\"Fragment from whole text visualization\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-wholetext_lyra_w1ll_long_crop_thumbnail.jpg\" alt=\"Fragment from whole text visualization\" \/><\/span>Our work focuses on the abstract visualization of children&#8217;s book series, and in particular the trilogy &#8220;His Dark Materials&#8221; by Philip Pullman. Pullman&#8217;s trilogy is made up of the three novels &#8220;The Northern Lights&#8221; (called &#8220;The Golden Compass&#8221; in the USA and in the movie adaptation), &#8220;The Subtle Knife&#8221;, and &#8220;The Amber Spyglass&#8221;. We choose this genre partly through personal passion and partly because of the range of potential enthusiastic readers. The best children&#8217;s book series (especially before they are completed) are read and discussed by child and adult readers and many of these readers develop their own theories which they share with their friends and with other readers online. Similarly academic interest is piqued and there are conferences and journals dedicated to the study of children&#8217;s literature (more on separate tab)<\/p>\n<\/div>\n<div id=\"en-usprojectsbookvizdefault\" class=\"page-content\">\n<h1><\/h1>\n<h1>Future Work<\/h1>\n<p><span id=\"cd18bca9-ba08-438e-adbd-0c910566ca05\" class=\"ImageBlock fr\"><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/manyeyes.alphaworks.ibm.com\/manyeyes\/visualizations\/wordle-of-jane-austens-pride-and-pre-4\" target=\"_blank\"><img decoding=\"async\" id=\"Imagecd18bca9-ba08-438e-adbd-0c910566ca05\" class=\"alignright\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-b3_c38_not_speech.jpg\" \/><span class=\"sr-only\"> (opens in new tab)<\/span><\/a><span id=\"ImageCaptioncd18bca9-ba08-438e-adbd-0c910566ca05\" class=\"ImageCaptionCoreCss ImageCaption\"><\/span><\/span>There are several directions we\u2019d like to take this work in now.<\/p>\n<h2>User studies<\/h2>\n<p>We need to take these visualizations out of the research lab and engage both the fans and the academics who are theorising about Pullman\u2019s works. We should engage them with these tools and establish if the tools are useful, how they might be improved, and what other visualizations may be of value to the community.<\/p>\n<h2>Infer.Net<\/h2>\n<p>Throughout this work we took the view that computers were not adept at understanding books, but should just essentially count words and draw the results for people to interpret. However advances in machine learning, and especially toolkits enabling machine learning techniques to be applied quickly to new domains have led us to seek to apply <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/research.microsoft.com\/en-us\/um\/cambridge\/projects\/\/infernet\" target=\"_blank\">Infer.Net <span class=\"sr-only\"> (opens in new tab)<\/span><\/a>to the analysis phase of the visualization.<\/p>\n<h2>Other visualizations<\/h2>\n<p>Inevitably building visualizations leads to 1,001 other ideas as to how the data may be visualized. We would like to add the ability to pivot (e.g. for one flowers bud to open another flower side-by-side). We would like to add animations so that the dynamic movement between visualizations or as a visualization is formed is part of the semantics of the visualization itself.<\/p>\n<h2>Online version<\/h2>\n<p>The visualizations we made are not available for public use \u2013 either online or through downloading. This is partly because we have not spent time looking at the rights implications and partly because we have not engineered the code to the quality level required for public use. It would be great to get this to a level where people can try the visualizations we built for themselves without us present.<\/p>\n<h2>Other Books<\/h2>\n<p>It would be interesting to apply this work to other children\u2019s book series, to see if the characteristic patterns revealed in the visualizations were different from author to author. We might also move from a reader\u2019s perspective to a learner\u2019s perspective and choose books which often appear on high-school syllabuses. But most intriguing would be to build visualizations that contrast the content and style of different author\u2019s work.<\/p>\n<\/div>\n<div id=\"en-usprojectsbookvizdefault\" class=\"page-content\">\n<h1>References<\/h1>\n<ul>\n<li>[Bec07] Linda Becker, 2007 &#8220;In Translation&#8221; <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/lindabecker.net\/in-translation\/\" target=\"_blank\">http:\/\/lindabecker.net\/in-translation\/<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<li>[Dan05] Anh Dang, 2005 &#8220;Gospel Spectrum&#8221; <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/thirteensquares.com\/gospelspectrum\/\" target=\"_blank\">http:\/\/thirteensquares.com\/gospelspectrum\/<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<li>[Har08] Chris Harrison, 2008 &#8220;Visualizing the Bible&#8221;\u00a0<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.chrisharrison.net\/projects\/bibleviz\/\" target=\"_blank\">http:\/\/www.chrisharrison.net\/projects\/bibleviz\/<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<li>[JWC08] Jockers, Witten, and Criddle 2008 &#8220;Reassessing authorship of the Book of Mormon using delta and nearest shrunken centroid classification&#8221; in &#8220;Literary and Linguistic Computing&#8221; <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/llc.oxfordjournals.org\/cgi\/content\/abstract\/23\/4\/465\" target=\"_blank\">http:\/\/llc.oxfordjournals.org\/cgi\/content\/abstract\/23\/4\/465<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<li>[Lar14] Clarence Larking 1914 \u2013 1918 &#8220;Dispensational Charts&#8221;\u00a0<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.preservedwords.com\/charts.htm\" target=\"_blank\">http:\/\/www.preservedwords.com\/charts.htm<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<li>[Lea76] Peter Lea, &#8220;The Style is the Man&#8221;, unpublished lecture notes and slides, University of York, 1976<\/li>\n<li>[Pal02] W Bradford Paley, 2002 &#8220;TextArc&#8221;\u00a0<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/textarc.org\/\" target=\"_blank\">http:\/\/textarc.org\/<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<li>[Pos06] Stephanie Posavec, 2006 &#8220;Writing Without Words&#8221; <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.itsbeenreal.co.uk\/index.php?\/wwwords\/\/about-this-project\/\" target=\"_blank\">http:\/\/www.itsbeenreal.co.uk\/index.php?\/wwwords\/\/about-this-project\/<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<li>[PRYAKSCL06] Plaisant, Rose, Yu, Auvil, Kirschenbaum, Nell Smith, Clement, and Lord 2006 &#8220;Exploring erotics in Emily Dickinson&#8217;s correspondence with text mining and visual interfaces&#8221; <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/portal.acm.org\/citation.cfm?id=1141753.1141781\" target=\"_blank\">http:\/\/portal.acm.org\/citation.cfm?id=1141753.1141781<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<li>[Sha08] Ebany Spencer, 2008 &#8220;Romancing Dimensions&#8221;\u00a0<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.ebanyshae.com\/page11.htm\" target=\"_blank\">http:\/\/www.ebanyshae.com\/page11.htm<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<li>[SK07] Philipp Steinweber and Andreas Koller, 2007 &#8220;Similar Diversity&#8221; <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/similardiversity.net\/project\/\" target=\"_blank\">http:\/\/similardiversity.net\/project\/<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<li>[Wal08] Tim Walter, 2008 &#8220;textour&#8221;\u00a0<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.timwalter.de\/portfolio\/textour\/\" target=\"_blank\">http:\/\/www.timwalter.de\/portfolio\/textour\/<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<li>[Wat02] Martin Wattenburg, 2002 &#8220;Arc Diagrams: Visualizing Structure in Strings&#8221;\u00a0<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/portal.acm.org\/citation.cfm?id=857733\" target=\"_blank\">http:\/\/portal.acm.org\/citation.cfm?id=857733<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0 <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.research.ibm.com\/visual\/papers\/arc-diagrams.pdf\" target=\"_blank\">http:\/\/www.research.ibm.com\/visual\/papers\/arc-diagrams.pdf<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<li>[WV08] Martin Wattenberg and Fernanda B. Vi\u00e9gas,2008 &#8220;The Word Tree: An Interactive Visual Concordance&#8221;\u00a0<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/ieeexplore.ieee.org\/xpl\/freeabs_all.jsp?arnumber=4658133\" target=\"_blank\">http:\/\/ieeexplore.ieee.org\/xpl\/freeabs_all.jsp?arnumber=4658133<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0 <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.research.ibm.com\/visual\/papers\/wordtree_final2.pdf\" target=\"_blank\">http:\/\/www.research.ibm.com\/visual\/papers\/wordtree_final2.pdf<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<\/ul>\n<\/div>\n<div id=\"en-usprojectsbookvizdefault\" class=\"page-content\">\n<h1>Acknowledgements<\/h1>\n<p><span id=\"331a31fa-2f24-4235-95aa-17fe0c3e9bbc\" class=\"ImageBlock fl\"><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.flickr.com\/photos\/dumbledad\/3348936160\/\" target=\"_blank\"><img decoding=\"async\" id=\"Image331a31fa-2f24-4235-95aa-17fe0c3e9bbc\" class=\"alignleft\" title=\"Workshop Group Photo\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-pullman_visualization_workshop_group_photo.jpg\" alt=\"Workshop Group Photo\" \/><span class=\"sr-only\"> (opens in new tab)<\/span><\/a><span id=\"ImageCaption331a31fa-2f24-4235-95aa-17fe0c3e9bbc\" class=\"ImageCaptionCoreCss ImageCaption\"><\/span><\/span>This work was done as a collaboration between\u00a0<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.lindabecker.net\/\" target=\"_blank\">Linda Becker<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> and <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/research.microsoft.com\/~timregan\/\" target=\"_blank\">Tim Regan <span class=\"sr-only\"> (opens in new tab)<\/span><\/a>during Linda\u2019s internship at <a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/lab\/microsoft-research-cambridge\/\" target=\"_new\">Microsoft Research\u2019s Cambridge Lab <\/a>in the Summer of 2008. The work would not have been possible without the generous, thought provoking, and supportive help of Pullman\u2019s publishers especially <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www5.scholastic.co.uk\/zone\/book_marion-lloyd.htm\" target=\"_blank\">Marion Lloyd <span class=\"sr-only\"> (opens in new tab)<\/span><\/a>and Claire Tagg at <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.scholastic.co.uk\/\" target=\"_blank\">Scholastic<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, Pullman\u2019s agent <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.apwatt.co.uk\/\" target=\"_blank\">Caradoc King<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, and\u00a0<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.philip-pullman.com\/\" target=\"_blank\">Philip Pullman<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> himself.<\/p>\n<\/div>\n<div id=\"en-usprojectsbookvizvisualization\" class=\"page-content\">\n<p>&nbsp;<\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>A stream of dancing lights, for all the world like the shimmering curtains of the aurora, blazed across the screen. They took up patterns that were held for a moment only to break apart and form again, in different shapes, or different colours; they looped and swayed, they sprayed apart, they burst into showers of [&hellip;]<\/p>\n","protected":false},"featured_media":0,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","footnotes":""},"research-area":[13554,13555,13559],"msr-locale":[268875],"msr-impact-theme":[],"msr-pillar":[],"class_list":["post-170241","msr-project","type-msr-project","status-publish","hentry","msr-research-area-human-computer-interaction","msr-research-area-search-information-retrieval","msr-research-area-social-sciences","msr-locale-en_us","msr-archive-status-active"],"msr_project_start":"2016-08-24","related-publications":[],"related-downloads":[],"related-videos":[],"related-groups":[],"related-events":[],"related-opportunities":[],"related-posts":[],"related-articles":[],"tab-content":[{"id":0,"name":"Background","content":"<!-- .asset-content -->\r\n<div id=\"en-usprojectsbookvizbackground\" class=\"page-content\">\u00a0The statistical analysis of texts is an important area of work and is used widely in information retrieval (e.g. web search). It is also a mature area of research in its own right, and has been used in the past for things from author attribution to the ordering of works through time. For example, in a letter published in 1882\u00a0Augustus De Morgan speculated about using statistical techniques to explore authorship questions around St Paul's Epistles and the Epistle to the Hebrews [<a href=\"[permalink post_id=170241]#references\" target=\"_new\">Lea76<\/a>], while more recently <a href=\"http:\/\/llc.oxfordjournals.org\/cgi\/content\/abstract\/23\/4\/465\" target=\"_new\">Jockers, Witten, and Criddle <\/a>used sophisticated statistical techniques to reassess the authorship of the Book of Mormon.<\/div>\r\n<div id=\"en-usprojectsbookvizbackground\" class=\"page-content\">\r\n\r\nIn contrast, the abstract visualization of book texts is not a large or a mature field of study, but there are notable and inspirational examples. The following sections list some of these\r\n<h3>Clarence Larkin's \"Dispensational Charts\"<\/h3>\r\n<span id=\"eac91d4f-f5a3-4316-925d-fa47c3ec472d\" class=\"ImageBlock fl\"><a href=\"http:\/\/www.preservedwords.com\/charts.htm\" target=\"_new\"><img id=\"Imageeac91d4f-f5a3-4316-925d-fa47c3ec472d\" title=\"Clarence Larkin's Dispensational Truth chart: The Heavens\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-larkins_the_heavens_dispensational_chart.jpg\" alt=\"Clarence Larkin's Dispensational Truth chart: The Heavens\" \/><\/a><span id=\"ImageCaptioneac91d4f-f5a3-4316-925d-fa47c3ec472d\" class=\"ImageCaptionCoreCss ImageCaption\"><\/span><\/span>Data visualizations fall into two overlapping camps: exploration and communication. Larkin's 1914-1918\u00a0<a href=\"http:\/\/www.preservedwords.com\/charts.htm\" target=\"_new\">Dispensational Charts<\/a> are about communicating scripture and prophesy from The Bible. They diagram the structure of each topic (e.g. \"The Heavens\" or \"The Second Coming\") and use flow, representational images, and references back to Bible passages to illuminate each topic.\r\n<h3>Text Arc<\/h3>\r\n<span id=\"b8f2f0e5-ae01-43fd-a560-6088eae319f9\" class=\"ImageBlock fr\"><a href=\"http:\/\/textarc.org\/\" target=\"_new\"><img id=\"Imageb8f2f0e5-ae01-43fd-a560-6088eae319f9\" title=\"Brad Paley's Text Arc of Alice in Wonderland\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-alicetextarc.jpg\" alt=\"Brad Paley's Text Arc of Alice in Wonderland\" \/><\/a><span id=\"ImageCaptionb8f2f0e5-ae01-43fd-a560-6088eae319f9\" class=\"ImageCaptionCoreCss ImageCaption\"><\/span><\/span>The seminal work of abstract exploratory visualization of book texts is Brad Paley's \"<a href=\"http:\/\/textarc.org\/\" target=\"_new\">Text Arc<\/a>\". TextArc is a screen based application Paley designed and implemented that takes a text and displays it twice. Firstly, line by line in tiny font around the edge of a giant ellipse. And then secondly word-by-word with each word anchored by invisible springs to the sentences in which it occurs. Common words are removed (so called '<a href=\"http:\/\/en.wikipedia.org\/wiki\/Stop_words\" target=\"_new\">stop words<\/a>') and the remaining words are rendered so that more common words use a larger font and are drawn on top of any less common words sharing the same screen area. Paley's TextArc can be used to explore any text but he often demonstrates it using Alice in Wonderland and then at the centre, in big letters, is the word Alice as that occurs throughout the book. TextArc has many other features, including an elegant dynamic path sweeping through the work as the text is read through.\r\n\r\nText Arc was conceived as a tool to help academics and other readers analyse texts. Another outlet proved to be selling high quality printouts as a beautiful memento of one's favourite texts. The application of book visualization to academic literary studies has been continued in work like Plaisant et al's \"<a href=\"http:\/\/portal.acm.org\/citation.cfm?id=1141753.1141781\" target=\"_new\">Exploring Erotics in Emily Dickinson's Correspondence<\/a>\".\r\n\r\n<\/div>\r\n<div id=\"en-usprojectsbookvizbackground\" class=\"page-content\">\r\n\r\nPartly because of the widespread availability of electronic versions of the text, partly because of its cultural significance, and partly because of the huge numbers of people who care about it The Bible has proved an intriguing source of visualizations.\r\n<h3>Anh Dang's \"Gospel Spectrum\"<\/h3>\r\n<span id=\"5ea32697-a9d5-422a-a2fd-1a3937f8ddcf\" class=\"ImageBlock fl\"><a href=\"http:\/\/thirteensquares.com\/gospelspectrum\/\" target=\"_new\"><img id=\"Image5ea32697-a9d5-422a-a2fd-1a3937f8ddcf\" title=\"Ahn Dang's Gospel Spectrum\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-ahn_dang_gospel_spectrum.jpg\" alt=\"Ahn Dang's Gospel Spectrum\" \/><\/a><span id=\"ImageCaption5ea32697-a9d5-422a-a2fd-1a3937f8ddcf\" class=\"ImageCaptionCoreCss ImageCaption\"><\/span><\/span>While on NYU's <a href=\"http:\/\/itp.nyu.edu\/\" target=\"_new\">Interactive Telecommunications Programme <\/a>Anh Dang built \"<a href=\"http:\/\/thirteensquares.com\/gospelspectrum\/\" target=\"_new\">Gospel Spectrum<\/a>\", an interactive visualization exploring the gospel accounts of Christ's life. Each episode in Christ's life is represented as a coloured bar with the colours representing the different gospels and their length representing the number of verses spent on that episode. The resulting visualization allows one to see how Christ's life unfolds through the gospels: which gospels concentrate on which parts of his life, and when the gospels come together to record an episode.\r\n<h3>Linda Becker's \"In Translation\"<\/h3>\r\n<span id=\"97f1f7cd-b57e-4eb1-a716-8f3b09783125\" class=\"ImageBlock fr\"><a href=\"http:\/\/lindabecker.net\/in-translation\/\" target=\"_new\"><img id=\"Image97f1f7cd-b57e-4eb1-a716-8f3b09783125\" title=\"Linda Becker's In Translation\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-linda_becker_in_translation.jpg\" alt=\"Linda Becker's In Translation\" \/><\/a><span id=\"ImageCaption97f1f7cd-b57e-4eb1-a716-8f3b09783125\" class=\"ImageCaptionCoreCss ImageCaption\"><\/span><\/span>Started at <a href=\"http:\/\/www.macommunicationdesign.com\/\" target=\"_new\">Central Saint Martin's School of Art<\/a>, Becker's \"<a href=\"http:\/\/lindabecker.net\/in-translation\/\" target=\"_new\">In Translation<\/a>\" shows visually the structural similarities and differences between different language translations of the Tower of Babel story, for example showing the position allocated to each letter-combination. \"In Translation\" both enforces the message of The Tower of Babel Story by highlighting the differences between human languages, but also cuts across it by showing structural similarities.\r\n<h3>Chris Harrison's \"Bible Visualizations\"<\/h3>\r\n<span id=\"e8b1054c-0b90-4610-bfb4-b20a90bfb855\" class=\"ImageBlock fl\"><a href=\"http:\/\/www.chrisharrison.net\/projects\/bibleviz\/\" target=\"_new\"><img id=\"Imagee8b1054c-0b90-4610-bfb4-b20a90bfb855\" title=\"Chris Harrison's Bible visualizations\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-chris_harrison_bible_visualizations.jpg\" alt=\"Chris Harrison's Bible visualizations\" \/><\/a><span id=\"ImageCaptione8b1054c-0b90-4610-bfb4-b20a90bfb855\" class=\"ImageCaptionCoreCss ImageCaption\"><\/span><\/span>Chris Harrison's <a href=\"http:\/\/www.chrisharrison.net\/projects\/bibleviz\/\" target=\"_new\">visualizations of The Bible <\/a>follow two paths. Firstly Harrison took a set of textual cross references found in The Bible compiled by Lutheran Pastor Christoph Romhild and displayed the links visually, resulting in a beautiful picture that gives detail about which chapters contain most cross references that also impresses the viewer with the sheer number of cross references. The second set looks at proper nouns through The Bible and overlays them as a tag cloud. But rather than abstracting the positions of the nouns from their occurrence in the text they are placed at their 'centre of mass'.\r\n<h3>Steinweber and Koller's \"Similar Diversity\"<\/h3>\r\n<span id=\"425b1d9f-29b0-4d71-8107-e72e2fe4e030\" class=\"ImageBlock fr\"><a href=\"http:\/\/similardiversity.net\/project\/\" target=\"_new\"><img id=\"Image425b1d9f-29b0-4d71-8107-e72e2fe4e030\" title=\"Steinweber and Koller's Similar Diversity\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-similardiversity.jpg\" alt=\"Steinweber and Koller's Similar Diversity\" \/><\/a><span id=\"ImageCaption425b1d9f-29b0-4d71-8107-e72e2fe4e030\" class=\"ImageCaptionCoreCss ImageCaption\"><\/span><\/span>The last Bible visualization we'll touch on is Steinweber and Koller's \"<a href=\"http:\/\/similardiversity.net\/project\/\" target=\"_new\">Similar Diversity<\/a>\". Like Harrison's work Steinweber and Koller use <a href=\"http:\/\/portal.acm.org\/citation.cfm?id=857733\" target=\"_new\">arc-diagrams <\/a>and other visual features, but rather than using them to explore the structure within The Bible Similar Diversity shows the similarities and differences between holy books of different religions.\r\n\r\n<\/div>\r\n<div id=\"en-usprojectsbookvizbackground\" class=\"page-content\">\r\n\r\nBefore moving on to describe our own visual explorations of the text of Pullman's His Dark Materials trilogy there are four other interesting book visualization projects that are worth drawing attention to because of other potential features they make use of.\r\n<h3>Ebany Spencer's \"Romancing Dimensions\"<\/h3>\r\n<span id=\"24b167b1-3ed9-4e05-9ab2-9100fc4def0a\" class=\"ImageBlock fl\"><a href=\"http:\/\/www.ebanyshae.com\/page12.htm\" target=\"_new\"><img id=\"Image24b167b1-3ed9-4e05-9ab2-9100fc4def0a\" title=\"Ebany Spencer's Romancing Dimensions\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-ebany_spencer_romancing_dimensions.jpg\" alt=\"Ebany Spencer's Romancing Dimensions\" \/><\/a><span id=\"ImageCaption24b167b1-3ed9-4e05-9ab2-9100fc4def0a\" class=\"ImageCaptionCoreCss ImageCaption\"><\/span><\/span>In her CSM MACD project \"<a href=\"http:\/\/www.ebanyshae.com\/page12.htm\" target=\"_new\">Romancing Dimensions<\/a>\" Ebany Spencer attempts to use purely visual notations systems to retell Edwin Abbott Abbott's \"Flatlands\" story. Though entirely paper based Spencer's work uses three dimensions by using paper cut-outs to move some of her time-line representations of the work out from the background plane.\r\n<h3>Tim Walter's \"textour\"<\/h3>\r\nTim Walter's <a href=\"http:\/\/www.timwalter.de\/portfolio\/textour\/\" target=\"_new\">textour<\/a> (in German) has uses time and animation to show the structural elements of the book accruing as data is added or filtered.\r\n<h3>Stephanie Posavec's \"Writing Without Words\"<\/h3>\r\n<span id=\"f163c8b2-ad33-486d-805f-8ba3886e55ad\" class=\"ImageBlock fr\"><a href=\"http:\/\/www.itsbeenreal.co.uk\/index.php?\/wwwords\/\/about-this-project\/\" target=\"_new\"><img id=\"Imagef163c8b2-ad33-486d-805f-8ba3886e55ad\" title=\"Stephanie Posavec's Literary Organism\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-posavec_literary_organism.jpg\" alt=\"Stephanie Posavec's Literary Organism\" \/><\/a><span id=\"ImageCaptionf163c8b2-ad33-486d-805f-8ba3886e55ad\" class=\"ImageCaptionCoreCss ImageCaption\"><\/span><\/span>Stephanie Posavec's <a href=\"http:\/\/www.itsbeenreal.co.uk\/index.php?\/wwwords\/\/about-this-project\/\" target=\"_new\">beautiful visualizations<\/a>\u00a0of Jack Kerouac's \"On the Road\" (and some other contrasting novels)\u00a0are not the result of a computer analysis of the work but the result of careful, loving, and painstaking analysis by-hand of the text itself. Posavec produces several visualizations, from the spider-like Posavec diagrams which map the sentence lengths authors' use (a line continues for the length of the first sentence, then turns ninety degrees and continues for the length of the second sentence, etc) through to the elegant 'literary organism' flower like structures.\r\n<h3>IBM Research's Visual Communications Lab's \"Many Eyes\"<\/h3>\r\n<a href=\"http:\/\/manyeyes.alphaworks.ibm.com\/\" target=\"_new\">Many Eyes<\/a> is a social visualization site. It is social in many ways: users upload data sets that are immediately shared with all the other Many Eyes members; anyone can use any of the provided visualization tools to visualize the data sets; these visualizations can be shared and discussed on the Many Eyes sites, or embedded into blog posts to foster conversation and analysis beyond the site. Many Eyes was conceived, designed, and built by <a href=\"http:\/\/www.research.ibm.com\/visual\/\" target=\"_new\">IBM Research's Visual Communications Lab<\/a>. It was originally thought that most of the datasets and visualizations would be based on numeric data, and so the visualizations were tailored towards quantitative data. In fact the inventors were taken aback by the amount of textual data sets uploaded, including notably The Bible and political speeches, and they have written about the text based visualizations designed and added in response [<a href=\"[permalink post_id=170241]#references\" target=\"_new\">WV08<\/a>].\r\n\r\n<\/div>"},{"id":1,"name":"Visualizations","content":"<div id=\"en-usprojectsbookvizvisualization\" class=\"page-content\">\r\n\r\nOur work focuses on the abstract visualization of children's book series, and in particular the trilogy \"His Dark Materials\" by Philip Pullman. Pullman's trilogy is made up of the three novels \"The Northern Lights\" (called \"The Golden Compass\" in the USA and in the movie adaptation), \"The Subtle Knife\", and \"The Amber Spyglass\". We choose this genre partly through personal passion and partly because of the range of potential enthusiastic readers. The best children's book series (especially before they are completed) are read and discussed by both child and adult readers and many of these readers develop their own theories which they share with their friends and with other readers online. Similarly academic interest is piqued leading to conferences and journals dedicated to the study of children's literature.\r\n\r\n<\/div>\r\n<div id=\"en-usprojectsbookvizvisualization\" class=\"page-content\">\r\n<h1>Design Ideas<\/h1>\r\n<span id=\"a9c1297c-6671-4757-9a58-199146acd32c\" class=\"ImageBlock fl\"><a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-part_1_distribution.pdf\" target=\"_new\"><img id=\"Imagea9c1297c-6671-4757-9a58-199146acd32c\" class=\"alignleft\" title=\"Design Idea for Whole Text Visualization\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-wholetextdesignthumbnail.jpg\" alt=\"Design Idea for Whole Text Visualization\" \/><\/a><\/span>In order to narrow the design space Linda Becker and I decided to focus on two questions:\r\n<ol>\r\n \t<li>How the language used about different characters contrasts and how it changes through the series;<\/li>\r\n \t<li>How linguistic themes (like religious language) are used through the series.<\/li>\r\n<\/ol>\r\nWe started with Linda sketching out how some visualizations might look (without using actual) data.\r\n\r\nIn the first of these Linda looked at the distribution of words (e.g. characters names) throughout the text, using connecting arcs (among other ideas) to give a sense of the rhythm of related characters through the text.\r\n\r\n<\/div>\r\n&nbsp;\r\n\r\n&nbsp;\r\n\r\n&nbsp;\r\n<div id=\"en-usprojectsbookvizvisualization\" class=\"page-content\">\r\n\r\n<span id=\"5cd4ca58-006a-45fa-89f3-b7a7e644295f\" class=\"ImageBlock fr\"><a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-part_2_wordcluster.pdf\" target=\"_new\"><img id=\"Image5cd4ca58-006a-45fa-89f3-b7a7e644295f\" class=\"alignright\" title=\"Character Analysis Design Thumbnail\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-characteranalysisdesignthumbnail.jpg\" alt=\"Character Analysis Design Thumbnail\" \/><\/a><\/span>\r\n\r\nThe second set of sketches looks at the character word plots, what form they might take and what visual dimensions this would give us to plot differing data or to reinforce existing data.\r\n\r\n<span id=\"1705c94f-e208-43bf-b299-1397aac5bfae\" class=\"ImageBlock fl\"><a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-part_3_charactersandthemes.pdf\" target=\"_new\"><img id=\"Image1705c94f-e208-43bf-b299-1397aac5bfae\" class=\"alignleft\" title=\"Theme Design Thumbnail\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-themesdesignthumbnail.jpg\" alt=\"Theme Design Thumbnail\" \/><\/a><\/span>Thirdly Linda tackled the notion of themes, and the sketches she produced show how we might plot themes progression through the books. These are the sketches that we have had least success moving into functioning visualizations since they rely on a more sophisticated notion of theme than looking at individual word positions may provide.\r\n\r\n<span id=\"fd3244f4-b5de-43df-ac26-7dd062550c00\" class=\"ImageBlock fr\"><a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-part_4_textonly.pdf\" target=\"_new\"><img id=\"Imagefd3244f4-b5de-43df-ac26-7dd062550c00\" class=\"alignright\" title=\"Text Only Visualization Design Thumbnail\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-textonlyvisualization2designthumbnail.jpg\" alt=\"Text Only Visualization Design Thumbnail\" \/><\/a><\/span>The last series of visualization sketches Linda produced looked at text. Instead of drawing structures based on the relationships between words we looked at drawing the structures with the words themselves. This proved quite playful. I had wanted the visualisations to be legible themselves as text, but some of the sketches jump to the opposite pole, for example rendering only the words of interest and leaving the surrounding text as measured space.\r\n\r\nThe two ideas that we built up into working visualizations are the flower-like structures showing the words occurring near the characters names (or other given words) and renderings of the whole text with the character names of interest highlighted with colours and arcs.\r\n\r\n<\/div>\r\n<div id=\"en-usprojectsbookvizvisualization\" class=\"page-content\">\r\n<h1>Character Flowers<\/h1>\r\nThe first of the visualization ideas that we implemented were the character flowers. Figure 1 shows the character flower for the word Lyra. Central to the flower is the word \"lyra\" itself, surrounded by a 'lifebelt' which shows, starting from the 12 O'clock position, the occurrences of the word \"lyra\" through the series, with each occurrence resulting in a thin red line.\r\n<p align=\"center\"><span id=\"b2a51463-be00-4aca-96ae-e224d0b9a884\" class=\"ImageBlock fn\"><a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-characterflower_lyra.jpg\" target=\"_new\"><img id=\"Imageb2a51463-be00-4aca-96ae-e224d0b9a884\" class=\"aligncenter\" title=\"Character Flower: &quot;Lyra&quot;\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-characterflower_lyra_thumbnail.jpg\" alt=\"Character Flower: &quot;Lyra&quot;\" \/><\/a><span id=\"ImageCaptionb2a51463-be00-4aca-96ae-e224d0b9a884\" class=\"ImageCaptionCoreCss ImageCaption\">Figure 1: Character Flower of the word \"Lyra\" (click for larger version)<\/span><\/span><\/p>\r\nWe can see from the number of crowded red lines that \"lyra\" is a frequently occurring word, as we would expect, but that the second and third books contain episodes where she is not mentioned. Moving out from that each \u2018bud\u2019 represents a word. Here we are looking at all the words which immediately follow the word \"lyra\" in a sentence. Those words are arranged in order of the frequency with which they appear after \"lyra\", and the size of the bud reflects the frequency of the word overall (i.e. the number of chapters it occurs in, regardless of whether it occurs after the word \"lyra\"). The final measure is the distance from the centre that the bud is drawn. This reflects the probability that when the word occurs it occurs after \"lyra\". So we see two buds placed near the centre at the start are two words that occur frequently after the word \"lyra\" and are unlikely to occur elsewhere. Indeed the two words are Lyra\u2019s surnames: Silvertounge and Belacqua. Other words drawn towards the centre are evocative of Lyra's personality: \"joyfully\", \"quelled\", \"exulted\",\u00a0\"definitely\", \"judged\", \"raided\", ... but two stand out as anomalous: \"blushed\" and \"obediently\". Clicking on the bud brings up the sentences in which the word follows the word \"lyra\". From these sentences we find that the terms are used when Lyra is in disguise. In some respects this shows that the visualization works \u2013 the anomalies are indeed anomalies, but they are ones consciously placed by Pullman, rather than subconscious ones.\r\n<p align=\"center\"><span id=\"296f8ba3-02e3-4d2b-af42-da992bd044a1\" class=\"ImageBlock fn\"><a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-characterflower_lyras_heart.jpg\" target=\"_new\"><img id=\"Image296f8ba3-02e3-4d2b-af42-da992bd044a1\" class=\"aligncenter\" title=\"Character Flower for the word &quot;Lyra's&quot;\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-characterflower_lyras_heart_thumbnail.jpg\" alt=\"Character Flower for the word &quot;Lyra's&quot;\" \/><\/a><span id=\"ImageCaption296f8ba3-02e3-4d2b-af42-da992bd044a1\" class=\"ImageCaptionCoreCss ImageCaption\">Figure 2: Character Flower of the word \"Lyra's\" (click for larger version)<\/span><\/span><\/p>\r\nCharacters names can also be used in their possessive sense, e.g. \"lyra's\" and the character flower in Figure 2 shows the diagram for the words after \"lyra's\". These are mostly body parts (les, arms, hair, etc) and this style is born out in Pullman's writing about the other characters.\r\n\r\n<\/div>\r\n<div id=\"en-usprojectsbookvizvisualization\" class=\"page-content\">\r\n<h1>Whole Text<\/h1>\r\n<span id=\"f2d2db25-ad94-46e4-8a88-a59a46f3ffb1\" class=\"ImageBlock fl\"><a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-wholetext_lyra_w1ll_1536.jpg\" target=\"_new\"><img id=\"Imagef2d2db25-ad94-46e4-8a88-a59a46f3ffb1\" class=\"alignleft\" title=\"Whole Text Visualization Thumbnail\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-wholetext_lyra_w1ll_thumbnail.jpg\" alt=\"Whole Text Visualization Thumbnail\" \/><\/a><\/span>These visualizations show the entire text of the three volumes that make up the trilogy. We were interested to see the rhythm of the characters occurrences in the whole text, especially two related characters. Figure\u00a03 shows a fragment of the entire trilogy, with linked coloured disks over occurrences of Lyra and Will's names. We can quickly see simple facts like Will's absence from the first book, and more curious aspects like the periods of the second book where neither of them are mentioned (presumably the sections focussed on Mary Malone, Lord Asreil, or Mrs Coulter). Printed out this diagram is many feet long, and the text itself is (just) readable. This combination of text level detail and global pattern is particularly interesting. I was hoping that this visualization would highlight a poetic choice across the trilogy. Tolstoy starts and ends \"Anna Karenina\" at a railway station, and Pullman purposefully opens the first book with the word \"Lyra\" and ends the last book with the word \"Lyra\". This should stand out as the visualization should start and end with a coloured disk. But it does not. In fact Pullman precedes the start of his book with a quote from Milton's \"Paradise Lost\", which stops the poetic symmetry coming out in the visualization.\r\n\r\n<\/div>\r\n&nbsp;\r\n\r\n&nbsp;\r\n<div id=\"en-usprojectsbookvizvisualization\" class=\"page-content\">\r\n<p align=\"center\"><span id=\"11655486-20d1-4527-9500-1a064abea89b\" class=\"ImageBlock fn\"><a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-wholetext_lyra_w1ll_long_crop.jpg\" target=\"_new\"><img id=\"Image11655486-20d1-4527-9500-1a064abea89b\" class=\"aligncenter\" title=\"Cropped Whole Text Visualization Highlighting Lyra and Will's names\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-wholetext_lyra_w1ll_long_crop.jpg\" alt=\"Cropped Whole Text Visualization Highlighting Lyra and Will's names\" \/><\/a><span id=\"ImageCaption11655486-20d1-4527-9500-1a064abea89b\" class=\"ImageCaptionCoreCss ImageCaption\">Figure 3: Fragment of Whole Text Visualization Highlighting Lyra and Will's names (click for larger version)<\/span><\/span><\/p>\r\n\r\n<\/div>\r\n<div id=\"en-usprojectsbookvizvisualization\" class=\"page-content\">\r\n<h1>Implementation Detail<\/h1>\r\n<span id=\"fe7f7742-cf18-48b1-a852-5c1238631c63\" class=\"ImageBlock fl\"><a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-sql.jpg\" target=\"_new\"><img id=\"Imagefe7f7742-cf18-48b1-a852-5c1238631c63\" class=\"alignleft\" title=\"Screenshot of a SQL Query of Pullman Text Database\" src=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2016\/02\/bookviz-sql_thumbnail.jpg\" alt=\"Screenshot of a SQL Query of Pullman Text Database\" \/><\/a><span id=\"ImageCaptionfe7f7742-cf18-48b1-a852-5c1238631c63\" class=\"ImageCaptionCoreCss ImageCaption\"><\/span><\/span>The initial sketches were built in Adobe Illustrator. Having chosen our two initial candidates for implementation these were prototyped in Processing, a language aimed at designers new to programming. Later these prototypes were re-worked into C# and WPF. The texts themselves were drawn from the publishers Quark documents, saved to plain text, broken down into chapters, sentences, and words in C# and stored in a SQL Server 2008 database.\r\n\r\n<\/div>"}],"slides":[],"related-researchers":[{"type":"user_nicename","value":"krw","display_name":"Ken Woodberry","author_link":"<a href=\"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/people\/krw\/\" aria-label=\"Visit the profile page for Ken Woodberry\">Ken Woodberry<\/a>","is_active":false,"user_id":32585,"last_first":"Woodberry, Ken","people_section":0,"alias":"krw"}],"msr_research_lab":[],"msr_impact_theme":[],"_links":{"self":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/170241","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-project"}],"about":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-project"}],"version-history":[{"count":2,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/170241\/revisions"}],"predecessor-version":[{"id":388013,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/170241\/revisions\/388013"}],"wp:attachment":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/media?parent=170241"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=170241"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=170241"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=170241"},{"taxonomy":"msr-pillar","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-pillar?post=170241"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}