`James Allan
`College of Computer and Information Sciences
`140 Governors Drive
`University of Massachusetts
`Amherst, Massachusetts 01003-4610
`
`Date of Birth: June 19, 1961
`Citizenship: United States
`
`413/545-2742
`413/545-1789
`allan@cs.umass.edu
`http://cs.umass.edu/~allan
`
`Last updated: June 13, 2017
`
`Principal fields of interest
`Information retrieval, organization, and visualization; Topic/event Detection and
`Tracking; Evaluation of retrieval; Information access and capture
`
`Education
`Ph.D.
`in Computer Science, Cornell University, 1995 (Jan 18). Department of
`Computer Science. Thesis: Automatic Hypertext Construction, under Professor
`Gerard Salton. Minor: Theatre Arts (directing).
`
`M.S.
`
`in Computer Science, Cornell University, 1992 (Jan 15).
`
`A.B.
`
`in Mathematics, Grinnell College, 1983 (May 23).
`
`Employment
`2015-
`Chair of the Faculty, College of Information and Computer Sciences
`2008-
`Professor, Computer Science, University of Massachusetts Amherst
`2003-
`Co-Director, Center for Intelligent Information Retrieval, UMass Amherst
`
`2012
`Visiting Professor, University of Melbourne, Australia (January to June)
`2003-08 Associate Professor, Computer Science, University of Massachusetts Amherst
`
`1998-2003 Assistant Professor, Computer Science, University of Massachusetts Amherst.
`1996-2003 Assistant Director, Center for Intelligent Information Retrieval, University of
`Massachusetts at Amherst (Acting Director, 1999-2000).
`
`1996-98 Research Assistant Professor, Computer Science, University of Massachusetts
`Amherst.
`1994-96 Senior Post-doctoral Research Associate, Computer Science, University of
`Massachusetts at Amherst. Working with the Center for Intelligent
`Information Retrieval, under the direction of Professor Bruce Croft.
`
`1989-94 Research Assistant, Computer Science, Cornell University. Working with
`Professor Gerard Salton on Information Retrieval research using the Smart
`system. Part of the Information Access and Capture research group.
`
`Petitioner Microsoft Corporation - Ex. 1020, p.1
`
`
`
`Curriculum Vitae, James Allan
`
`2 of 23
`
`Publications (journals)
`1. E.K.F. Dang, R.W.P. Luk, and J. Allan, “Beyond Bag-of-Words: Bigram-
`Enhanced Context-Dependent Term Weights.” Journal of the American Society for
`Information Science and Technology, 65(6):1134-1148 (2014)
`2. E.K.F. Dang, R.W.P. Luk, J. Allan, K.S. Ho, S.C.F. Chan, K.F-L. Chung, and D.L.
`Lee, “A new context-dependent term weight computed by boost and discount using
`relevance information.” Journal of the American Society for Information Science
`and Technology, 61(12): 2514-2530 (2010)
`3. G. Kumaran and J. Allan, “Adapting Information Retrieval Systems to User
`Queries.” Information Processing and Management, special issue on Adaptive
`Information Retrieval, 44(6):1838-1862, 2008.
`4. E.M. Kornfield, R. Manmatha, and J. Allan, “Further Explorations in Text
`Alignment with Handwritten Documents.” International Journal of Document
`Analysis and Recognition, Springer Berlin, Heidelberg, 10(1):39-52, June 2007.
`5. J. Allan, V. Lavrenko, and M. Connell, “A Month to Topic Detection and Tracking
`in Hindi.” ACM Transactions on Asian Language Information Processing,
`2(2):85-10, June 2003 (actually published early 2004).
`6. A. Leuski and J. Allan, “Interactive information retrieval using clustering and
`spatial proximity.” Journal of User Modeling and user-Adapted Interaction,
`Kluwer Academic Publishers, 14(2-3):259-288, June 2004.
`7. A. Arnt, S. Zilberstein, J. Allan, and A-I Mouaddib, “Dynamic composition of
`information retrieval techniques.” Journal of Intelligent Information Systems,
`Kluwer Academic Publishers, 23(1):67-97, 2004.
`8. J. Allan, “Robust Techniques for Organizing and Retrieving Spoken Documents.”
`Eurasip Journal on Applied Signal Processing. Hindawi publishers, 2003(2):103-
`114, February 2003.
`9. J. Allan, “Knowledge Management and Speech Recognition” in IEEE
`Computer. April 2002, pages 46-47.
`10. J. Allan, “Detection as multi-topic tracking.” In Information Retrieval, Kluwer
`Academic Publishers, 5(2/3):139-157, 2002.
`11. R.W.P. Luk, A.T.S. Chan, T.S. Dillon, H.V. Leong, W.B. Croft, and J. Allan, “A
`Survey on Searching and Indexing of XML Documents”. Journal of the
`American Society of Information Science and Technology, 53(6):415-437, 2002.
`12. J. Allan, A. Leuski, R. Swan, and D. Byrd, “Evaluating combinations of ranked
`lists and visualizations of inter-document similarity.” Information Processing and
`Management, 37(3):435-458, 2001.
`13. A. Leuski and J. Allan, “Strategy-based interactive cluster visualization for
`information.” International Journal of Digital Libraries, 3(2): 170-184, 2000.
`14. D. Rus, J. Allan. “Structural queries in electronic corpora”. Multimedia Tools and
`Applications on Electronic Publishing, 6(2):153-170, March 1998.
`15. J. Allan. “Building Hypertext Using Information Retrieval”. Information
`Processing and Management , 33(2):145-159, 1997.
`16. G. Salton, J. Allan, and A. Singhal. “Automatic Text Decomposition and
`Structuring”. Information Processing and Management, 32(2):127-138, 1996.
`
`Petitioner Microsoft Corporation - Ex. 1020, p.2
`
`
`
`Curriculum Vitae, James Allan
`
`3 of 23
`
`17. G. Salton and J. Allan. “Selective Text Utilization and Text Traversal”.
`International Journal of Human-Computer Studies, 43:483-497, 1995.
`18. C. Buckley, J. Allan, and G. Salton. “Automatic Routing and Retrieval
`using Smart: TREC-2”. Information Processing and Management, 31(3):313-324,
`1995.
`19. G. Salton, J. Allan, C. Buckley, and A. Singhal. “Automatic Analysis, Theme
`Generation, and Summarization of Machine-Readable Texts”. Science 264 (3 June
`1994), pp. 1421-1426. Also in Information Retrieval and Hypertext, Agosti and
`Smeaton (Kluwer: 1996), pp. 51-72. Also in Readings in Information
`Visualization, S. Card, J. Mackinlay, and B. Schneiderman, eds., San Francisco:
`Morgan Kaufmann Publishers, 1999, pp. 413-418.
`20. G. Salton, C. Buckley, and J. Allan. “Automatic Structuring and Retrieval
`of Large Text Files”. Communications of the ACM, February, 1994.
`21. G. Salton, J. Allan and C. Buckley. “Automatic Structuring of Text Files”.
`Electronic Publishing, 5(1), March 1992, pp. 1-52.
`Publications (refereed conferences)
`The following conference abbreviations are used frequently below: SIGIR is the ACM
`SIGIR International Conference on Research and Development in Information Retrieval;
`CIKM is the ACM Conference on Information and Knowledge Management; ECIR is the
`European Conference on Information Retrieval; ICTIR is the ACM SIGIR International
`Conference on the Theory of Information Retrieval; CHIIR is the ACM SIGIR
`Conference on Computer Human Interaction in Information Retrieval; WSDM is the
`ACM Conference on Web Search and Data Mining.
`
`1. J. Jiang, D. He, and J. Allan, “Comparing In Situ and Multidimensional Relevance
`Judgments.” To appear in Proceedings of SIGIR 2017.
`2. J. Jiang, D. He, D. Kelly, and J. Allan, “Understanding Ephemeral State of
`Relevance.” Proceedings of CHIIR 2017, pp. 137-146. Best student paper award.
`3. M. Jang, J. Foley, S. Dori-Hacohen, and J. Allan, “Probabilistic Approaches to
`Controversy Detection.” Proceedings of CIKM 2016, pp. 2069-2072.
`4. J. Foley, B. O’Connor, and J. Allan, “Improving Entity Ranking for Keyword
`Queries.” Proceedings of CIKM 2016. Pp. 865-868.
`5. W. Kong and J. Allan, “Precision-Oriented Query Facet Extraction.” Proceedings
`of CIKM 2016, pp. 1433-1442.
`6. J. Jiang and J. Allan, “Correlation between System and User Metrics in a Session.”
`Proceedings of CHIIR (Conference on Human Information Interaction and
`Retrieval) 2016, pp. 285-288.
`7. S. Dori-Hacohen, D. Jensen, and J. Allan, “Controversy Detection in Wikipedia
`Using Collective Classification.” Proceedings of SIGIR 2016, pp. 797-800.
`8. M. Jang and J. Allan, “Improving Automated Controversy Detection on the Web.”
`Proceedings of SIGIR 2016, pp. 865-868.
`9. J. Jiang and J. Allan, “Adaptive Effort for Search Evaluation Metrics.” Proceedings
`of ECIR 2016, pp. 187-199.
`10. J. Foley and J. Allan, “Classifying exam questions into a subject-specific concept
`hierarchy.” Proceedings of ECIR 2016, pp. 575-586.
`
`Petitioner Microsoft Corporation - Ex. 1020, p.3
`
`
`
`Curriculum Vitae, James Allan
`
`4 of 23
`
`11. N. Naji and J. Allan, “On Cross-script Information Retrieval.” Proceedings of
`ECIR 2016, pp. 796-802.
`12. J. Jiang and J. Allan, “Reducing click and skip errors in search result ranking.”
`Proceedings of ACM WSDM (Web Search and Data Mining) 2016, pp. 183-192.
`13. D. Wemhoener, J. Allan, “Balancing Aspects in Retrieved Search Results.”
`Proceedings of ACM ICTIR (Theory of Information Retrieval) 2015, pp. 305-308.
`14. W. Kong, R. Li, L. Jie, A. Zhang, Y. Chang, and J. Allan, “Predicting Search Intent
`Based on Pre-Search Context.” Proceedings of SIGIR 2015, pp. 503-512.
`15. S. Dori-Hacohen and J. Allan, “Automated Controversy Detection on the Web.”
`Proceedings of ECIR 2015, pp. 423-434.
`16. J. Foley and J. Allan, “Retrieving Time from Scanned Books.” Proceedings of
`ECIR 2015, pp. 221-232.
`17. W. Kong and J. Allan, “Extending Faceted Search to the Web.” Proceedings of
`CIKM 2014, pp. 839-848.
`18. J. Jiang and J. Allan, “Necessary and Frequent Terms in Queries.” Proceedings of
`SIGIR 2014, pp. 1167=1170.
`19. J. Jiang, D. He, and J. Allan, “Searching, Browsing, and Clicking in a Search
`Session; Changes of User Behaviors by Tasks and Over Time.” Proceedings of
`SIGIR 2014, pp. 607-616.
`20. J. Dalton, L. Dietz, and J. Allan, “Entity Query Feature Expansion using
`Knowledge Base Links.” Proceedings of SIGIR 2014, pp. 365-364.
`21. W. Kong, E. Aktolga, and J. Allan, “Improving Passage Ranging with User
`Behavior Information.” Proceedings of CIKM 2013, pp. 1999-2008.
`22. S. Dori-Hacohen and J. Allan, “Detecting Controversy on the Web,” Proceedings
`of CIKM 2013, pp. 1845-1848.
`23. J. Dalton, J. Allan, and P. Mirajkar, “Zero-Shot Video Retrieval Using Content and
`Concepts.” Proceedings of ACM CIKM 2013, pp. 1857-1860.
`24. E. Aktolga and J. Allan, “Sentiment Diversification with Difference Biases.”
`Proceedings of SIGIR 2013, pp. 593-602.
`25. H. Field and J. Allan, “Task-aware Query Recommendation.” Proceedings of
`SIGIR 2013, pp. 83-92.
`26. W. Kong and J. Allan, “Extracting Query Facets from Search Results.”
`Proceedings of SIGIR 2013, pp. 93-102.
`27. M. Smucker, J. Allan, and B. Dachev, “Human Question Answering Performance
`Using an Interactive Document Retrieval System.” Proceedings of the 4th
`Information Interaction in Context Symposium (IIiX) 2012, pp. 35-44.
`28. M. Cartright and J. Allan, “Efficiency Optimizations for Interpolating Subqueries.”
`Proceedings of CIKM 2011, pp. 297-306.
`29. J. Dalton, J. Allan, and D. Smith, “Passage Retrieval for Incorporating Global
`Evidence in Sequence Labeling.” Proceedings of CIKM 2011, pp. 355-364.
`30. E. Aktolga and J. Allan, “Reranking Search Results for Sparse Queries.”
`Proceedings of CIKM 2011, pp. 173-183.
`31. X. Yi and J. Allan, “Discovering Missing Click-through Query Language
`Information for Web Search.” Proceedings of CIKM 2011, pp. 153-162.
`32. H. Feild, J. Allan, and J. Glatt, “CrowdLogging: Distributed, Private, and
`Anonymous Search Logs.” Proceedings of SIGIR 2011, pp. 375-384.
`
`Petitioner Microsoft Corporation - Ex. 1020, p.4
`
`
`
`Curriculum Vitae, James Allan
`
`5 of 23
`
`33. E. Aktolga, J. Allan, and D. Smith, “Passage reranking for question answering
`using syntactic structures and answer types.” Proceedings of ECIR 2011, pp. 617-
`628.
`34. M. Cartright, J. Allan, V. Lavrenko, and A. McGregor, “Fast query expansion
`using approximations of relevance models.” Proceedings of CIKM 2010, pp.
`1573-1576.
`35. H. Feild, J. Allan, and R. Jones, “Predicting searcher frustration.” Proceedings of
`SIGIR 2010, pp. 34-41.
`36. X. Yi and J. Allan, “A content based approach for discovering missing anchor text
`for web search.” Proceedings of SIGIR 2010, pp. 427-434.
`37. A. Feng and J. Allan, “Incident threading for news passages.” Proceedings of
`CIKM 2009, pp. 1307-1316.
`38. M. Smucker and J. Allan, “A new measure of the cluster hypothesis.” Proceedings
`of the 2nd Int’l conference on the Theory of Information Retrieval (ICTIR 2009),
`2009, pp.281-288.
`39. N. Balasubramanian and J. Allan, “Syntactic query models for restatement
`retrieval.” Proceedings of String Processing and Information Retrieval Symposium
`(SPIRE) 2009, pp. 143-155.
`40. X. Yi and J. Allan, “A Comparative Study of Utilizing Topic Models for
`Information Retrieval.” Proceedings of ECIR 2009, pp. 29-41.
`41. B. Carterette, V. Pavlu, E. Kanoulas, J. Aslam, and J. Allan, “If I had a Million
`Queries”. Proceedings of ECIR 2009, pp. 288-300.
`42. M. Lease, J. Allan, and W.B. Croft, “RegressionRank: Learning to Meet the
`Opportunity of Descritive Queries.” Proceedings of ECIR 2009, pp. 90-101.
`43. L. Friedland and J. Allan, “Joke Retrieval: Recognizing the same joke told
`differently.” Proceedings of CIKM 2008, pp. 883-892.
`44. K. Parton, K. McKeown, J. Allan, and E. Henestroza, “Simultaneous Multilingual
`Search for Translingual Information Retrieval.” Proceedings of CIKM 2008, pp.
`719-728.
`45. K. Lee, W.B. Croft, and J. Allan, “A Cluster-Based Resampling Method for
`Pseudo-Relevance Feedback.” Proceedings of ACM SIGIR 2008, pp. 235-242.
`46. B. Carterette, V. Pavlu, E. Kanoulas, J. Aslam, and J. Allan, “Evaluation over
`Thousands of Queries.” Proceedings of ACM SIGIR 2008, pp. 651-658.
`47. G. Kumaran and J. Allan, “Effective and Efficient User Interaction for Long
`Queries.” Proceedings of ACM SIGIR 2008, pp. 11-18.
`48. A. Feng and J. Allan, “Finding and Linking Incidents in News.” Proceedings of
`CIKM 2007, pp. 821-829.
`49. M. Smucker, J. Allan, and B. Carterette, “A Comparison of Statistical Significance
`Tests for Information Retrieval Evaluation.” Proceedings of CIKM 2007, pp. 623-
`632.
`50. V. Lavrenko, X. Yi, and J. Allan, “Information Retrieval on Empty Fields.”
`Proceedings of the NAACL HLT Conference, 2007, pp. 89-96.
`51. G. Kumaran and J. Allan, “A Case for Shorter Queries, and Helping Users Create
`Them.” Proceedings of the NAACL HLT Conference, 2007, pp. 220-227.
`
`Petitioner Microsoft Corporation - Ex. 1020, p.5
`
`
`
`Curriculum Vitae, James Allan
`
`6 of 23
`
`52. B. Schiffman, K. McKeown, R. Grishman, and J. Allan, “Question Answering
`Using Integrated Information Retrieval and Information Extraction.” Proceedings
`of the NAACL HLT Conference, 2007, pp. 532-539.
`53. R. Bekkerman, H. Raghavan, J. Allan, and K.Eguchi, “Interactive Clustering of
`Text Collections According to a User-Specified Criterion.” Proceedings of the
`International Joint Conference on Artificial Intelligence (IJCAI), pp. 684-689,
`2007.
`54. R. Bekkerman, S. Zilberstein, and J. Allan, “Web Page Clustering using Heuristic
`Search in the Web Graph.” Proceedings of the International Joint Conference on
`Artificial Intelligence (IJCAI), pp. 2280-2285, 2007.
`55. B. Carterette, J. Allan, and R. Sitaraman, “Minimal Test Collections for Retrieval
`Evaluation.” Proceedings of ACM SIGIR, pp. 268-283, 2006. Best paper award
`56. M. Smucker and J. Allan, “Find-Similar: Similarity Browsing as a Search Tool.”
`Proceedings of ACM SIGIR, pp. 461-468, 2006.
`57. B. Carterette and J. Allan, “Incremental Test Collections.” Proceedings of the
`Conference on Information and Knowledge Management (CIKM), pp. 680-687,
`2005.
`58. J. Allan, B. Carterette, and J. Lewis, “When Will Information Retrieval Be Good
`Enough?” Proceedings of SIGIR, pp. 433-440, 2005.
`59. H. Raghavan and J. Allan, “Matching Inconsistently Spelled Names in Automatic
`Speech Recognizer Output for Information Retrieval.” Proceedings of
`HLT/EMNLP, pp. 251-458, 2005.
`60. G. Kumaran and J. Allan, “Using Names and Topics for New Event Detection.”
`Proceedings of HLT/EMNLP, pp. 121-128, 2005.
`61. J. Allan, S. Harding, D. Fisher, A. Bolivar, S. Guzman-Lara, and P. Amstutz,
`“Taking Topic Detection From Evaluation to Practice.” To appear in Proceedings
`of the Hawaii International Conference on System Sciences, January 2005.
`62. R. Nallapati, A. Feng, F. Peng, and J. Allan, “Event Threading Within News
`Topics.” Proceedings of CIKM, November 2004, pp. 446-453.
`63. G. Kumaran and J. Allan, “Text Classification and Named Entities for New Event
`Detection.” Proceedings of SIGIR, pp. 297-304, July 2004.
`64. C.H. Gooi and J. Allan, “Cross-Document Coreference on a Large Scale Corpus.”
`Proceedings of NAACL/HLT, pp. 9-16, 2004.
`65. D. Kelly, F. Diaz, N. Belkin, and J. Allan, “A User-centered Approach to
`Evaluating Topic Models.” Proceedings of the European Conference on
`Information Retrieval, April 2004.
`66. E.M. Kornfield, R. Manmatha, and J. Allan, “Text Alignment with Handwritten
`Documents.” International Workshop on Document Image Analysis for Libraries
`(DIAL 2004), January 2004.
`67. J. Allan, A. Feng, and A. Bolivar, “Flexible Intrinsic Evaluation of Hierarchical
`Clustering for TDT.” Proceedings of CIKM, pp. 263-270, 2003.
`68. J. Allan, C. Wade, and A. Bolivar, “Retrieval and novelty detection at the sentence
`level.” Proceedings of SIGIR, pp. 314-321, 2003.
`69. R. Nallapati and J. Allan, “Capturing term dependencies using a sentence tree
`based language model.” Proceedings of CIKM, pp. 383-390, November 2002.
`
`Petitioner Microsoft Corporation - Ex. 1020, p.6
`
`
`
`Curriculum Vitae, James Allan
`
`7 of 23
`
`70. A. Leuski and J. Allan, “Improving realism of topic tracking evaluation.”
`Proceedings of SIGIR, pp. 89-96, 2002.
`71. J. Allan and H. Raghavan, “Using part-of-speech patterns to reduce query
`ambiguity.” Proceedings of SIGIR, pp. 307-314, 2002.
`72. V. Lavrenko, J. Allan, E. DeGuzman, D. LaFlamme, V. Pollard, and S. Thomas,
`“Relevance Models for Topic Detection and Tracking.” Proceedings of the
`Human Language Technology Conference (HLT), pp. 104-110, 2002.
`73. J. Allan, V. Khandelwal, and R. Gupta, “Time-based Summarization of News”,
`Proceedings of SIGIR, pp. 10-18, 2001. Best paper award.
`74. J. Allan, V. Lavrenko, and H. Jin, “First story detection in TDT is hard”,
`Proceedings of CIKM, pp. 374-181, 2000.
`75. V. Lavrenko, M. Schmill, D. Lawrie, P. Ogilvie, D. Jensen, and J. Allan,
`“Language models for financial news recommendation,” Proceedings of CIKM, pp.
`389-396, 2000.
`76. A. Leuski and J. Allan, “Lighthouse: Showing the Way to Relevant Information,”
`Proceedings of the IEEE Symposium on Information Visualization, IEEE Computer
`Society, pp. 125-130, 2000.
`77. R. Swan and J. Allan, “Automatic Generation of Overview Timelines,”
`Proceedings of SIGIR, pp. 49-56, 2000.
`78. A. Leuski and J. Allan, “Improving Interactive Retrieval by Combining Ranked
`List and Clustering.” Proceedings of RIAO, College de France, pp. 665-681, 2000.
`79. R. Swan and J. Allan, “Extracting Significant Time Varying Features from Text,”
`Proceedings CIKM, ACM Press, pp. 38-45, 1999.
`80. R. Papka and J. Allan, “Document Classification using Multiword Features,”
`Proceedings of CIKM, 1998.
`81. A. Leouski and J. Allan, “Evaluating a Visual Navigation System for a Digital
`Library,” Proceedings of European Digital Libraries Conference, pp. 535-554,
`1998.
`82. J. Allan, R. Papka, and V. Lavrenko, “On-line New Event Detection and
`Tracking.” Proceedings of SIGIR, pp. 37-45, 1998. SIGIR Test of Time Award.
`83. R. Swan and J. Allan, “Aspect Windows, 3-D Visualizations, and Indirect
`Comparisons of Information Retrieval Systems.” Proceedings of SIGIR, pp. 173-
`181, 1998.
`84. M. Hirsch and J. Allan, “A Graphic Interface for User Directed Clustering of
`Retrieved Documents.” Proceedings of the 1997 Spring Congress, American
`Medical Informatics Association, p. 95, 1997.
`85. J. Allan. “Incremental Relevance Feedback for Information Filtering”. Proceedings
`of SIGIR, pp. 270-278, 1996.
`86. J. Allan. “Automatic Hypertext Link Typing”. Proceedings of the ACM
`Conference on Hypertext, pp. 42-52, 1996.
`87. J. Allan. “Relevance Feedback with Too Much Data”. Proceedings of SIGIR, pp.
`337-343, 1995.
`88. J. Allan and D. Rus. “Structural Queries in Electronic Corpora”. Proceedings of
`DAGS, 1995.
`89. G. Salton and J. Allan. “Automatic Text Decomposition and Structuring”.
`Proceedings of RIAO, 1994.
`
`Petitioner Microsoft Corporation - Ex. 1020, p.7
`
`
`
`Curriculum Vitae, James Allan
`
`8 of 23
`
`90. C. Buckley, G. Salton, and J. Allan. “The Effect of Adding Relevance Information
`in a Relevance Feedback Environment”. Proceedings of SIGIR, 1994.
`91.G. Salton and J. Allan. “Text Retrieval Using the Vector Processing Model.”
`Proceedings of the Third Annual Symposium on Document Analysis and
`Information Retrieval (SDAIR), 1994.
`92. G. Salton and J. Allan. “Selective Text Utilization and Text Traversal”.
`Proceedings of ACM Hypertext, pp. 131-144, 1993.
`93. J. Allan and G. Salton. “The Identification of Text Relations Using Automatic
`Hypertext Linking.” Workshop on Intelligent Hypertext, in conjunction with the
`Second International Conference on Information and Knowledge Management,
`1993.
`94. J. Allan, J. Davis, D. Krafft, D. Rus, and D. Subramanian. “Information Agents for
`Building Hyperlinks.” Workshop on Intelligent Hypertext, in conjunction with the
`Second International Conference on Information and Knowledge Management,
`1993.
`95. G. Salton, J. Allan, and C. Buckley. “Approaches to Passage Retrieval in Full
`Text Information Systems”. Proceedings of SIGIR, pp. 49-58, 1993.
`96. G. Salton, J. Allan, and C. Buckley. “Automatic Determination of Content
`Relationships in Natural Language Texts”. EP-92 Proceedings, C. Vanoirbeek and
`G. Coray, eds., Cambridge Univ. Press, Cambridge, England 1992, pp. 165-182.
`Publications (refereed posters and demonstrations)
`97. H. Field and J. Allan, “Task Aware Search Assistant.” Proceedings of ACM
`SIGIR, p. 1015, 2012.
`98. M. Cartright, E. Can, W. Dabney, J. Dalton, K. Krstovski, X. Wu, I. Yalniz, J.
`Allan, R. Manmatha, and D. Smith, “A Framework for Manipulating and Searching
`Multiple Retrieval Types.” Proceedings of ACM SIGIR, p. 1001, 2012.
`99. N. Balasubramanian and J. Allan, “Learning to select rankers.” Proceedings of
`SIGIR 2010.
`100. M. Smucker, J. Allan, and B. Carterette, “Agreement among statistical
`significance tests for information retrieval evaluation at varying sample sizes.”
`Proceedings of ACM SIGIR 2009, pp. 630-631.
`101. E. Aktolga, M.A. Cartright, and J. Allan, “Cross-document Cross-lingual
`Coreference Retrieval.” Poster published in Proceedings of CIKM 2008, pp. 1359-
`1360.
`102. X. Yi and J. Allan, “Evaluating topic models for information retrieval.” Poster
`published in Proceedings of CIKM 2008, pp. 1431-1432.
`103. G. Kumaran and J. Allan, “Selective User Interaction.” Poster published in the
`Proceedings of CIKM, pp. 923-926. 2007.
`104. B. Carterette and J. Allan, “Semiautomatic Evaluation of Retrieval Systems Using
`Document Similarities.” Poster published in the Proceedings of CIKM, pp. 873-
`876. 2007.
`105. N. Balasubramanian, J. Allan, and W.B. Croft, “A Comparison of Sentence
`Retrieval Techniques.” Poster published in Proceedings of the ACM SIGIR, 2007,
`pp. 813-4.
`
`Petitioner Microsoft Corporation - Ex. 1020, p.8
`
`
`
`Curriculum Vitae, James Allan
`
`9 of 23
`
`106. X. Yi, J. Allan, and W.B. Croft, “Matching Resumes and Jobs Based on
`Relevance Models.” Poster published in Proceedings of the ACM SIGIR, 2007,
`pp. 809-10.
`107. M.D. Smucker and J. Allan, “Using Similarity Links as Shortcuts to Relevant
`Web Pages.” Poster published in the Proceedings of ACM SIGIR 2007, p. 863-4.
`108. G. Kumaran and J. Allan, “Information Retrieval Techniques for Templated
`Queries.” Poster published in the Proceedings of RIAO 2007, p. 104.
`109. B. Carterette and J. Allan, “Research Methodologies in Studies of Information
`Retrieval Evaluation.” Poster published in the Proceedings of RIAO 2007, p. 112.
`110. X. Yi, J. Allan, and V. Lavrenko, “Discovering Missing Values in Semi-
`Structured Databases.” Poster published in the Proceedings of RIAO 2007, poster
`number 113, pages 1-15.
`111. M. Smucker and J. Allan, “Lightening the Load of Document Smoothing for
`Better Language Modeling Retrieval.” Poster published in the Proceedings of
`ACM SIGIR 2006, pp. 699-700, 2006.
`112. G. Kumaran and J. Allan, “Simple Questions to Improve Pseudo-Relevance
`Feedback Results.” Poster published in the Proceedings of ACM SIGIR 2006, pp.
`661-662, 2006.
`113. R. Nallapati, W.B. Croft, and J. Allan, “Relevant Query Feedback in Statistical
`Language Modeling.” Poster in Proceedings of CIKM, pp.560-563, 2003.
`114. J. Allan and G. Kumaran, “Stemming in the language modeling framework.”
`Poster in Proceedings of SIGIR, pp. 455-456, 2003.
`115. R. Manmatha, A. Feng, and J. Allan, “A critical examination of TDT’s cost
`function.” Poster in Proceedings of SIGIR, pp. 403-404, 2002.
`116. V. Khandelwal, R. Gupta, and J. Allan, “An Evaluation Scheme for Summarizing
`Topic Shifts in News Streams”, poster appearing in Proceedings of the Human
`Language Technology Conference (HLT), 2001.
`117. A. Leouski and J. Allan, “Visual Interactions with a Multidimensional Ranked
`List.” Poster presentation in Proceedings of SIGIR, 1998.
`Publications (book chapters)
`118. J. Allan, W.B. Croft, and J. Callan, “The University of Massachusetts and a
`Dozen TRECs.” In TREC: Experiment and Evaluation in Information Retrieval,
`E.M. Voorhees and D.K. Harman, eds. MIT Press, chapter 11, pp. 261-286, 2005.
`119. J. Allan, “Modeling Topics for Detection and Tracking” in Pattern Recognition in
`Speech and Language Processing, Wu Chou and Biing Hwang Juang, eds., pp.
`353-376. CRC Press, 2003.
`120. J. Allan, “Perspectives on Information Retrieval and Speech” in Information
`Retrieval Techniques for Speech Applications, A.R. Coden, E.W. Brown, and S.
`Srinivasan, eds. Springer-Verlag Lecture Notes in Computer Science, volume
`2273, pp. 1-10, 2002.
`121. J. Allan, “Introduction to Topic Detection and Tracking” in Topic Detection and
`Tracking: Event-based Information Organization, J. Allan, ed., Kluwer Academic
`Publishers, pp. 1-16, 2002.
`
`Petitioner Microsoft Corporation - Ex. 1020, p.9
`
`
`
`Curriculum Vitae, James Allan
`
`10 of 23
`
`122. J. Allan, V. Lavrenko, and R. Swan, “Explorations Within Topic Tracking and
`Detection” in Topic Detection and Tracking: Event-based Information
`Organization, J. Allan, ed., Kluwer Academic Publishers, pp. 197-224, 2002.
`123. R. Papka and J. Allan, “Topic Detection and Tracking: Event Clustering as a
`Basis for First Story Detection,” in Advances in Information Retrieval, W. B. Croft,
`ed., Kluwer Academic Publishers, pp. 97-126, 2000.
`Publications (as editor)
`124. J. Allan, W.B. Croft, A. Moffat, and M. Sanderson, eds., “Frontiers, Challenges,
`and Opportunities for Information Retrieval: Report from SWIRL 2012.” SIGIR
`Forum, 46(1):2-32, June 2012.
`125. K. Jarvelin, J. Allan, P. Bruza, and M. Sanderson, Proceedings of SIGIR 2004,
`ACM Press, 2004.
`126. J. Allan, ed., Topic Detection and Tracking: Event Based Information Retrieval,
`Kluwer Academic Press, 2002.
`127. J. Allan, ed., Proceedings of the First International Conference on Human
`Language Technology Research (HLT 2001), Morgan Kauffman, 2001.
`Publications (workshops)
`128. S. Dori-Hacohen, E. Yom-Tov, and J. Allan, “Navigating Controversy as a
`Complex Search Task.” Proceedings of the ECIR Workshop on Supporting
`Complex Search Tasks, CEUR Workshop 2015, volume 1338: 5 pages.
`129. J. Allan, J. Dalton, J. Foley, R. Manmatha, Venkatesh Murthy, D. Wemhoener,
`“Short Text Queries for Video Retrieval: Multimedia Event Detection at
`TRECVID 2013.” Proceedings of the TRECVID 2013 workshops, NIST.
`130. J. Liu, H. Cheng, O. Javed, Q. Yu, I. Chakraborty, W. Zhang, A. Divakaran, H.S.
`Sawhney, J. Allan, R. Manmatha, J. Foley, M. Shah, A. Dehghan, M. Witbrock, J.
`
`Curtis, G. Friedland(cid:481)(cid:3)(cid:498)SRI-Sarnoff AURORA System at TRECVID 2013:
`
`Multimedia Event Detection and Recounting”. Proceedings of the TRECVID 2013
`workshop, NIST.
`131. H. Field, M. Cartright, J. Allan, “The University of Massachusetts Amherst’s
`participation in the INEX 2011 Prove It Track.” Proceedings of the Initiative for
`the Evaluation of XML (INEX), 2011.
`132. M. Cartright, H. Feild, and J. Allan, “Evidence Finding Using a Collection of
`Books.” Proceedings of the BooksOnline’11 workshop at CIKM 2011, pp. 11-18.
`133. D.A. Smith, R. Manmatha, and J. Allan, “Mining Relational Structure from
`Millions of Books.” Proceedings of the BooksOnline’11 workshop at CIKM 2011.
`134. N. Balasubramanian, M. Bendersky, and J. Allan, “Cost-effective combination of
`multiple rankers: learning when not query.” New England Student Conference in
`Artificial Intelligence (NESCAI) 2010.
`135. H. Field, J. Allan, “Modeling Searcher Frustration.” Proceedings of the
`Workshop on Human-Computer Interaction and Information Retrieval (HCIR),
`2009, pp. 5-8.
`136. J. Allan, J. Aslam, B. Carterette, V. Pavlu, and E. Kanoulas, “Million Query
`Track 2008 Overview.” Proceedings of TREC. Notebook version, October 2008.
`Final version on-line February 2008, in print forthcoming.
`
`Petitioner Microsoft Corporation - Ex. 1020, p.10
`
`
`
`Curriculum Vitae, James Allan
`
`11 of 23
`
`137. B. Armstrong, X. Yi, and J. Allan, “Indri at TREC 2008: Million Query (1MQ)
`Track.” Proceedings ofTREC 2008. Notebook version, October 2008. Final
`version on-line February 2009.
`138. J. Allan, B. Carterette, J.A. Aslam, V. Pavlu, B. Dachev, and E. Kanoulas,
`“Million Query Track 2007 Overview.” Proceedings of TREC. Notebook version,
`October 2007. Final version online February 2008, in print December 2008, pp.
`85-104.
`139. M.D. Smucker, J. Allan, and B. Dachev, “UMass Complex Interactive Question
`Answering (ciQA) 2007: Human Performance as Question Answerers.”
`Proceedings of TREC. (4 pages) Notebook version, October 2007. On-line
`version, February 2008.
`140. X.Yi and J. Allan, “Indri at TREC 2007: Million Query (1MQ) Track.”
`Proceedings of TREC. (4 pages) Notebook version, October 2007. On-line
`version, February 2008.
`141. M. Smucker and J. Allan, “Measuring the Navigability of Document Networks.”
`SIGIR 2007 workshop on Web Information Seeking and Interaction, 2007.
`142. G. Kumaran and J. Allan, “UMass at TREC ciQA.” Proceedings of the 2006 Text
`Retrieval Conference (TREC 2006). Available on-line at http://trec.nist.gov.
`143. G. Kumaran and J. Allan, “Eliciting Information for Adaptive Retrieval.”
`Proceedings of the First International Workshop on Adaptive Information Retrieval
`(AIR), pp. 18-19, October 2006.
`144. A. Feng and J. Allan, “Combining Evidence from Homologous Datasets.”
`Workshop on New Directions in Multilingual Information Access at SIGIR 2006,
`August 2006.
`145. J. Allan, “HARD Track Overview in TREC 2005: High Accuracy Retrieval From
`Documents.” Proceedings of 2005 Text Retrieval Conference (TREC 2005), NIST
`special publication 500-266, November 2006.
`146. F. Diaz and J. Allan, “When Less is More: Relevance Feedback Falls Short and
`Term Expansion Succeeds at HARD 2005.” Proceedings of the 2005 Text
`Retrieval Conference (TREC 2005), 2006. Available on-line at http://trec.nist.gov.
`147. N. Abdul-Jaleel, J. Allan, W.B. Croft, F. Diaz, L. Larkey, X. Li, M.D. Smucker,
`and C. Wade, “UMass at TREC 2004: Novelty and HARD.” Proceedings of 2004
`Text Retrieval Conference (TREC 2004), 2005.
`148. J. Allan, “HARD Track Overview in TREC 2004: High Accuracy Retrieval from
`Documents.” Proceedings of TREC 2004, NIST special publication 500-261, pp.
`25-35, 2005.
`149. H. Raghavan, J. Allan, and A. McCallum, “An Exploration of Entity Models,
`Collective Classification and Relation Description.” Proceedings of the Second
`International Workshop on Lnk Analysis and Group Detection (LinkKDD),at ACM
`SIGKDD, pp. 1-10, August 2004.
`150. H. Raghavan and J. Allan, “Using Soundex Codes for Indexing ASR
`Documents.” Proceedings of the Workshop on Interdisciplinary Approaches to
`Speech Indexing and Retrieval, at HLT-NAACL, pp. 22-26, 2004.
`151. J. Allan, “HARD Track Overview in TREC 2003: High Accuracy Retrieval from
`Documents.” Proceedings of TREC 2003, NIST special publication 500-255, pp.
`24-37, 2004.
`
`Petitioner Microsoft Corporation - Ex. 1020, p.11
`
`
`
`Curriculum Vitae, James Allan
`
`12 of 23
`
`152. N. Abdul Jaleel, A. Corrada-Emmanuel, Q. Li, X. Liu, C. Wade, and J.