`Department of Computer Science, Bren Hall 4216
`School of Information and Computer Sciences
`University of California, Irvine
`CA 92697-3435
`telephone: (949) 824 2558
`fax: (949) 824 4056
`email: smyth@ics.uci.edu
`
`Professional Positions
`
`April 1996–present: Professor, Department of Computer Science, University of California, Irvine
`• Full Professor: July 2003 to present
`• Asssociate Professor: July 1998 to June 2003
`• Assistant Professor: April 1996 to June 1998
`October 1988–March 1996: Member of Technical Staff and Technical Group Leader (from 1992), Jet
`Propulsion Laboratory, California Institute of Technology, Pasadena.
`
`Education
`
`PhD, 1988: California Institute of Technology, Department of Electrical Engineering.
`
`MSEE, 1985: California Institute of Technology, Department of Electrical Engineering.
`
`BE, 1984: National University of Ireland, University College Galway. Bachelor of Engineering (Electronic)
`with First-Class Honors.
`
`Additional Professional Roles and Affiliations
`
`Director, UCI Data Science Initiative, University of California, Irvine, July 2014–present.
`
`Director, Center for Machine Learning and Intelligent Systems, University of California, Irvine, January
`2007–July 2014.
`
`Joint Faculty Appointment with Department of Statistics, UC Irvine, July 2008–present.
`
`Joint Faculty Appointment with Department of Biomedical Engineering, UC Irvine, July 2001–2012.
`
`Faculty Member, Institute for Genomics and Bioinformatics (IGB), UC Irvine, Member 2001–present.
`
`Faculty Member, Institute for Mathematical Behavioral Sciences (IMBS), UC Irvine, 1999-present.
`
`Faculty Member, Center for Digital Transformation, UC Irvine, 2012–present.
`
`Faculty Member, Program for Mathematical, Computational, and Systems Biology (MCB), UC Irvine,
`2007–present.
`
`Faculty Member, Center for Research on Information Technology and Organizations (CRITO), UC Irvine,
`2008–2012.
`
`Founding Director and Executive Committee Member of the ACM Special Interest Group on Knowledge
`Discovery and Data Mining (SIGKDD), 1998.
`
`Visiting Principal Researcher, Jet Propulsion Laboratory, California Institute of Technology, Pasadena,
`1996–2001.
`
`Member of IEEE (1988–present), American Statistical Association (1997–present), and the Association for
`Computing Machinery (ACM) (1999–present).
`
`1
`
`IPR2017-01039
`Unified EX1009 Page 1
`
`
`
`Honors and Awards
`
`Fellow, Association for Computing Machinery (ACM), 2013
`
`Fellow, Association for the Advancement of Artificial Intelligence (AAAI), 2010
`
`ACM SIGKDD Innovation Award, 2009
`
`ACM SIGKDD Conference best paper awards (1997, 2002), runner-up best paper awards (1998, 2000),
`
`ACM/IEEE Joint Conference on Digital Libraries (JCDL), shortlist for best paper award, 2007.
`
`IBM Faculty Partnership Award, 2001.
`
`National Science Foundation CAREER award, 1997
`
`ACM Teaching Award, UC Irvine, 1997
`
`NASA Group Achievement award, Jet Propulsion Labaratory, 1997.
`
`Lew Allen Award for Excellence in Research, Jet Propulsion Laboratory, 1993
`
`17 NASA Certificates for Technical Innovation (1991–1996)
`
`Consulting and Business Activities
`
`Consultant/Advisor to emnos Inc (2015-2016); Frost Data Capital (2014-2015); AS&T Inc (2013-2015);
`Samsung (2012-2015); SOCCCD (2012-present); DigitalRisk (2010-2012); CoreLogic (2011-2014); Iden-
`tityMetrics (2010-2012); Microsoft (2010-2011); ImageCat (2010); eBay (2009-2011); DataAnalytics
`LLC (2009-2011); Oracle (2008-2011); Netflix (2006-2009); Topicseek LLC (2005-2008); Yahoo! (2005-
`2008); Strativa (2005); IET (2004-2005); JWDirect (2001-2004); Credit Sciences (2000-2004); Nokia
`Research (2000); First Quadrant Financial Services (1998-1999); Smith-Kline Beecham (1998); AT&T
`(1996-1998).
`
`Professional Activities
`
`Journals: Associate/Action Editor
`
`ACM Transactions on Knowlege Discovery and Data, guest editor of special issue on best papers from ACM
`SIGKDD 2011 Conference, TKDD 6(4), 2012.
`
`Journal of the American Statistical Association, 2002 to 2005.
`
`IEEE Transactions on Knowledge and Data Engineering, 2002 to 2004.
`
`Machine Learning Journal, July 1998 to December 2001.
`
`Machine Learning Journal, guest editor of special issue on probabilistic learning, 1997.
`
`Journals, Book Series, Centers: Editorial Board/Advisory Board Member
`
`Journal of Machine Learning Research, 2000-present.
`
`Journal of Data Mining and Knowledge Discovery, 1997-present.
`
`Chapman and Hall: Series in Computer Science and Data Analysis, 2002-2008.
`
`Bayesian Analysis, 2004-2007.
`
`Insight Center for Data Analytics, University College Dublin, Scientific Advisory Member, 2015-present.
`
`2
`
`IPR2017-01039
`Unified EX1009 Page 2
`
`
`
`Journals: Reviewer
`
`Reviewer for IEEE Transactions on Information Theory, IEEE Trans. on Neural Networks, IEEE Trans.
`on Signal Processing, IEEE Trans. on Circuits and Systems, IEEE Trans. on Pattern Analysis and
`Machine Intelligence, IEEE Trans. on Knowledge and Data Engineering, Statistics and Computing,
`Journal of Artificial Intelligence Research, Pattern Recognition Letters, Neural Networks, Machine
`Learning, Journal of Machine Learning Research, ACM Transactions on Knowledge Discovery from
`Data, Communications of the ACM, Journal of the American Statistical Association, Bayesian Anal-
`ysis.
`
`Conference Program and General Chair Positions
`
`Program Chair for the Uncertainty in Artificial Intelligence (UAI) Conference, 2013.
`
`Program Chair for 17th ACM SIGKDD Conference, San Diego, 2011.
`
`Program Chair for the Symposium on the Interface between Statistics and Computing, Costa Mesa, CA,
`June 2001.
`
`General Chair for the Sixth International Conference on Artificial Intelligence and Statistics, January 1997.
`
`Other Conference and Workshop Organization Roles
`
`Workshop Co-Chair/Organizer for: Workshop on Algorithmic and Statistical Approaches for Large So-
`cial Network Data Sets, NIPS Conference, Lake Tahoe, 2012; Workshop on User-Centered Modeling,
`Institute for Mathematics and its Applications (IMA), University of Minnesota, 2012.; Workshop on
`Scientific Data Mining, Institute for Pure and Applied Mathematics (IPAM), UCLA, 2002; Workshop
`on Temporal and Spatial Machine Learning, International Conference on Machine Learning (ICML),
`2001; Massive Datasets workshop at the 1998 Neural Information Processing Conference (NIPS).
`
`Other Conference Organization Roles: Panels chair for ACM SIGKDD Fifth International Conference on
`Knowledge Discovery and Data Mining, 1999; Tutorials co-chair for National Conference on Artificial
`Intelligence, 1998; Tutorials chair for the ACM SIGKDD Conferences on Knowledge Discovery and Data
`Mining, 1997 and 1998; Publicity Chair for the ACM SIGKDD Conferences on Knowledge Discovery
`and Data Mining, 1995 and 1996.
`
`Conference Reviewing and Program Committees
`
`Neural Information Processing Conference (NIPS), International Conference on Machine Learning (ICML),
`Uncertainty in Artificial Intelligence Conference (UAI), Artificial Intelligence and Statistics Conference
`(AI-Stats), European Conference on Machine Learning (ECML/PKDD), ACM Conference on Knowl-
`edge Discovery and Data Mining (SIGKDD), WWW Conference, International Conference on Pattern
`Recognition (ICPR), International Joint Conference on Artificial Intelligence (IJCAI), American As-
`sociation for Artificial Intelligence Conference (AAAI), Pattern Recognition in Practice Workshops.
`
`Postdoctoral Advisees
`
`Michal Rosen-Zvi, 2003-2004; IBM Research, Israel.
`Michael Duff, 2005-2006; Assistant Professor, Genetics/Developmental Biology, University of Connecticut.
`Alex Ihler, 2005-2006; Associate Professor, Department of Computer Science, UC Irvine.
`Romain Thibaux, 2008-2009; Google, Mountain View, CA
`Ralf Krestel, 2011-2013; Senior Researcher, Hasso-Plattner Institute, Potsdam, Germany.
`Tracy Holsclaw, 2011-2014; Consultant, San Jose, CA
`
`3
`
`IPR2017-01039
`Unified EX1009 Page 3
`
`
`
`Graduate Students
`
`PhD Advisees and Current Positions
`
`Nick Navaroli, PhD 2014; Google, Irvine, CA
`Jimmy Foulds, PhD 2014: Postdoc, UC San Diego
`Chris DuBois, PhD 2013: Software Engineer, Dato, Seattle
`America Chambers, PhD 2013: Assistant Professor, Department of Mathematics and Computer Science,
`University of Puget Sound
`Drew Frank (co-advised with Alex Ihler), PhD 2013: Google, UK
`Arthur Asuncion, PhD 2011: Google, Seattle, WA
`Jon Hutchins (co-advised with Alex Ihler), PhD 2010: Google, Irvine, CA
`Chaitanya Chemudugunta, PhD 2009: Manager, Data Science, Blizzard Inc., Irvine, CA
`Seyoung Kim, PhD 2007: Assistant Professor, Department of Bioinformatics, CMU, Pittsburgh
`Darya Chudova, PhD 2007: Senior Director of Bioinformatics, Guardant Health, Redwood City, CA
`Sergey Kirshner, PhD 2005: Researcher, SkyTree Inc, San Jose, CA
`Scott Gaffney, PhD 2004: VP, Search Engineering, Sunnyvale, CA
`Xianping Ge, PhD 2002: Google, Mountain View, CA
`Igor V. Cadez, PhD 2002: Consultant, Orange County, CA.
`Dimitry Pavlov, PhD 2001: VP, Advertising Technology, Sunnyvale, CA
`
`Current PhD Students
`
`Advanced to Candidacy: Kevin Bache (2014), Moshe Lichman (2014), Eric Nalisnick (2015)
`Pre-Candidacy: Zach Butler, Dimitris Kotzias, Jihyun Park, Chris Galbraith
`
`PhD Thesis Committee Member
`
`UC Irvine, Computer Science:
`Sam Hallman (2015), David Keator (2015), Qiang Liu (2014), Anoop Korattikara (2014), Levi Boyles
`(2014), Yutian Chen (2013), Lars Otten (2013), Chaitanya Desai (2012), Hamed Pirsiavash (2012),
`Behzad Sajadi (2012), David Orendorff (2012), Pinaki Sinha (2011), Chloe Azencott (2010), Vibhav
`Gogate (2009), Radu Marinescu (2008), Robert Mateescu (2007), Bozhena Bidyuk (2005), Stephen
`Bay (2001), Irina Rish (1999), Chris Merz (1998), Pedro Domingos (1997).
`
`UC Irvine, Other Departments:
`Sepide Sarachi (Civil and Environmental Engineering, 2015), Justin Chung (Informatics, 2015), Co-
`lene Haffke (Earth Systems Science, 2015), Kevin Heins (Statistics, 2014), Michael Salmans (Biological
`Chemistry, 2014), (Emma Spiro (Sociology, 2013), Zack Almquist (Sociology, 2013), Kim Aeling (Mi-
`crobiology and Molecular Genetics, 2007), Bethany Knapp (Cognitive Science, 2002).
`
`Other Universities (External Committee Member or Examiner):
`Ramnath Balasubramanyan (CMU, 2013), Mindaugus Norkus (National University of Ireland, Galway,
`2013), Xuerei Wang (U Mass Amherst, 2009), Sangmin Oh (Georgia Tech, 2009), Carla Domencioni
`(UC Riverside, 2002), John Lindal (Caltech, 2000), Srinivas Aji (Caltech, 2000), David Babcock (Cal-
`tech, 2000), Gavin Horn (Caltech, 1999), Lonnie Chrisman (CMU, 1996), Michael Burl (Caltech, 1996),
`Barry Ambrose (Caltech, 1995), Zheng Zeng (Caltech, 1995).
`
`PhD Candidacy/Thesis Proposal Committees
`
`UC Irvine: Daniel Quang, 2015 (CS), Bailey Kong, 2015 (CS), Sholeh Fourazan, 2015 (CS), Coral Wheeler,
`2014 (Physics), Raul Diaz, 2014 (CS), Golnaz Ghiasi, 2014 (CS), Wei Ping, 2014 (CS), Sam Hallman,
`2013 (CS), Peter Sadowski, 2013 (CS), William Lam, 2013 (CS), Justin Chung, 2013 (Informatics),
`Kevin Heins, 2012 (Statistics), Zack Almquist, 2012 (Sociology), Ashley Payne, 2012 (Earth System
`Sciences), Ragupathyraj Valluvan, 2012 (EECS), Emma Spiro, 2012 (Sociology), Michael Salmans
`
`4
`
`IPR2017-01039
`Unified EX1009 Page 4
`
`
`
`(Biological Chemistry), Colene Haffke, 2011 (Earth System Sciences), Tim Rubin, 2011 (Cognitive
`Sciences), Brendan Rogers, 2011 (Earth System Sciences), Hamed Pirsiavash, 2011 (CS), Behzad Sa-
`jadi, 2011 (CS), Qiang Liu, 2011 (CS), Anoop Korattikara, 2011 (CS), David Keator, 2010 (CS),
`Kenny Daily, 2010 (CS), Yutian Chen, 2009 (CS), Lars Otten, 2009 (CS), David Orendorff, 2009 (CS),
`Chloe Azencott, 2009 (CS), Chaitanya Desai, 2008 (CS), Pinaki Sinha, 2007 (CS), Guy Yosiphon, 2006
`(ICS), Bo Gong, 2006 (ICS), Lin Wu, 2005 (ICS), Yiming Ma, 2004 (ICS), Dawit Seid, 2004 (ICS),
`John Abatzoglu, 2004 (Earth System Sciences), Suman Sundaresh, 2003 (ICS), Mingliang Li, 2002
`(Economics), Ye Sun, 2001 (ICS), Bethany Knapp, 2000 (Cognitive Science), Stephen Bay, 1999 (ICS),
`Daniel Billsus, 1998 (ICS), Pei Suen, 1998 (ECE), Chris Merz, 1997 (ICS).
`
`Other Universities: Ramnath Balasubramanyan, 2012 (CMU), Srinivas Aji, 1999 (Caltech), Gavin Horn,
`1998 (Caltech), John Lindal, 1998 (Caltech).
`
`Masters Students Supervised
`
`UC Irvine, Information and Computer Science: Homer Strong (2016), Scott Crawford (2012), Corey
`Schaninger (2012), Scott Triglia (2011), Ajay Mishra (2008), Scott White (2006), Joshua O Madadhain
`(2006), Vasanth Kumar (2006), Sridevi Parise (2003), Naval Verma (2002), Wagner Truppel (2001),
`Scott Lundgren (1997).
`
`Royal Institute of Technology: Stefan Edlund (1997), Department of Numerical Analysis and Computing
`Science, Stockholm: Thesis entitled Methods for Cluster Analysis with Applications to Large NASA
`Data Sets.
`
`University of Freiburg: Daniel Henke (2007), Department of Computer Science, MS Diplom Thesis.
`
`Research Grants, Contracts and Gifts
`
`62. Development of Computational Methods for Evaluating Patient-Doctor Communication, PCORI,
`$395,745 (UCI portion), Feb 1 2017 to Jan 31st 2020, co-Investigator (PI: Zac Imel, U Utah).
`
`61. NRT-DESE: Team Science for Integrative Graduate Training in Data Science and Physical Science,
`NSF, award number 1633631, Sep 15 2016 to Aug 31 2021, $2,967,150, Principal Investigator.
`
`60. Learning Individual Predictive Choice Models, Adobe Research Award, $50,000, October 2016, , Prin-
`cipal Investigator.
`
`59. Transformative Computational Infrastructures for Cell-Based Biomarker Diagnostics, NIH, award num-
`ber U01TR001801-01, 09/01/16
`08/31/21, $766,000 (UCI portion), co-Investigator (PI: Richard
`Schueurmann, Venter Institute/UCSD).
`
`58. The Big DIPA: Data Image Processing and Analysis, NIH BD2K Program, award number
`1R25EB022366-01, $486,000, Sept 30 2015 to June 30th 2018, co-Investigator (UCI PI: Charless
`Fowlkes).
`
`57. Investigating Virtual Learning Environments, National Science Foundation, award number NSF-
`1535300, $2,500,000, Oct 1 2015 to Sept 30th 2020, co-Investigator (UCI PI: Mark Warschauer).
`
`56. Forensic Science Center of Excellence, National Institute of Standards and Technology (NIST), award
`number 70NANB15H176, $20,000,000 ($4,000,000 for UC Irvine), Oct 1 2015 to Sept 30th 2020, co-
`Investigator (UCI PI: Hal Stern).
`
`55. Data-Intensive Research and Education Center in Science, Technology, Engineering, and Mathematics
`(DIRECT-STEM), NASA MIRO program, award number NNX15AQ06A, $5,000,000 ($1,250,000 for
`UC Irvine), Sept 1 2015 to Aug 31st 2020, Principal Investigator.
`
`54. Analyzing Individual Event Data over Time, Google Faculty Research Award, $60,000, March 2014,
`Principal Investigator.
`
`5
`
`IPR2017-01039
`Unified EX1009 Page 5
`
`
`
`53. Peer Assessment and Academic Achievement in a Gateway MOOC, Bill and Melinda Gates Foundation,
`Oct 1 2013, $25,000, Co-Investigator (PI: Mark Warschauer, UC Irvine).
`
`52. Statistical Learning Algorithms for Micro-Event Time Series Data, National Science Foundation, award
`number IIS-1320527, Oct 1 2013 to Sept 30th 2016, $499,880, Principal Investigator.
`
`51. Balancing the Portfolio: Efficiency and Productivity of Federal Biomedical R&D Funding, National Sci-
`ence Foundation, award number 1158699, Aug 15 2012 to July 31 2015, $297,331, Principal Investigator
`(original PI, David Newman).
`
`50. Location-based Social Media for Context-based Analysis of Transportation Data, Xerox UAC Research
`Award, Jan 1st 2013 to Dec 31st 2015, $90,000 gift, Principal Investigator.
`
`49. Collaborative Research, Type 1: Decadal Prediction and Stochastic Simulation of Hydroclimate over
`Monsoonal Asia, US Department of Energy, award number DOE SC0006619, Sept 1st 2011 to August
`31st 2014, $180,000, Co-Investigator (PI: Andrew Robertson, Columbia University).
`
`48. Copernicus: System for Foresight and Understanding from Scientific Exposition, IARPA, contract
`number D11PC20155, September 2011 to August 2016, $1,097,420, Principal Investigator.
`
`47. Probabilistic Alignment and Distributed Analytics, IARPA/AFRL FA8650-10-C-7060, Oct 1 2010 to
`Dec 31 2011, $334,537, Principal Investigator.
`
`46. Biomedical Informatics Training Program (supplement), award number NIH LM07443-10S1, 7/1/10-
`6/30/11, $153,485, Senior Personnel (PI: Pierre Baldi, UC Irvine).
`
`45. Automating Behavioral Coding via Text-Mining and Speech Signal Processing, National Institutes of
`Health, award number R01AA018673, $3.1 million, (UC Irvine portion is $953,952), Sept 1 2010 to
`August 31 2015, Co-Investigator (PI: David Atkins, University of Washington).
`
`44. UC Irvine Clinical Translational Science Center, National Institutes of Health, award number
`UL1RR031985, $7,075,320 awarded to date, July 1 2010 to March 31st 2015, Senior Personnel (PI:
`Dan Cooper, UC Irvine).
`
`43. Scaling Statistical Topic Modeling Algorithms to Massive Data Sets, Yahoo! Faculty Research (FREP)
`award, $10,000 gift, May 2010, Principal Investigator.
`
`42. Scalable Methods for the Analysis of Network-based Data, Office of Naval Research: Multidisciplinary
`University Research Initiative (MURI) Award), award number N00014-08-1-1015, $5,381,300, May 1
`2008 to April 30 2013, Principal Investigator.
`
`41. Scaling Statistical Topic Modeling Algorithms to Massive Data Sets, Google Research Award, $60,000,
`April 2008, Principal Investigator.
`
`40. Research in Cyber-Fraud Detection and Prevention, gift from Experian, Inc., $200,000, February 2008,
`Co-Principal Investigator with Michael Goodrich.
`
`39. Collaborative Research: Regional Climate-Change Projections Through Next-Generation Empirical and
`Dynamical Models, Department of Energy, Scientific Discovery through Advanced Computing: Climate
`Change Prediction, award number DE-FG02-07ER64429, $360,000, Oct 1 2007 to Sept 30 2010, Prin-
`cipal Investigator.
`
`38. CRI: Collaborative Research: Improving Experimental Computer Science with a Searchable Web Portal
`for Datasets, National Science Foundation, award number CNS-0551510, $400,000, March 15, 2006 to
`February 28, 2009, Co-Principal Investigator with Andrew McCallum (University of Massachusetts).
`
`Institutes of Health,
`37. Functional Biomedical Informatics Research Network (FBIRN), National
`U24RR021992, $23,992,092, from February 8th 2006 to November 30th 2010, Senior Personnel (PI:
`Steven Potkin, UC Irvine).
`
`6
`
`IPR2017-01039
`Unified EX1009 Page 6
`
`
`
`36. Characterizing ITCZ Dynamics and Breakdown using Statistical Learning Methods and Satellite Data,
`National Science Foundation, award number ATM-0530926, $618,000, 10/1/2005 to 9/30/2008, Co-
`Investigator (PI: Gudrun Magnusdottir, UC Irvine).
`
`35. UC Irvine Knowledge Discovery Evaluation Challenge Project, Entity Analytics Division, International
`Business Machines (IBM), $73,430, 7/15/05 to 12/31/05, Principal Investigator.
`
`34. Bringing Probabilistic Text Mining Techniques to Historical Document Collections: An Early American
`Case Study, UCI CORCLR Award MI-05-06-14, $18,080, 7/1/2005 - 6/30/2006, Co-Investigator (PI:
`Sharon Block, UC Irvine).
`
`1-P20-RR020837-01, total award is
`33. Transdisciplinary Imaging Genetics Center, NIH Grant No.
`$1,724,026, 9/28/04 to 7/31/07, Co-Investigator (PI: Steven Potkin, UC Irvine).
`
`32. National Alliance for Medical Image Computing (NAMIC), National Institutes of Health, award number
`NIH U54 EB005149, total UCI award is $609,253 from 9/17/04 to 8/31/06, Co-Investigator (PI: Ron
`Kikinis, Brigham and Women’s Hospital).
`
`31. Morphometry Biomedical Informatics Research Network (MBIRN), National Institutes of Health, U24-
`RR021382, total UCI award is $579,880 from 9/30/04 to 5/31/06, Co-Investigator (PI: Bruce Rosen,
`Massachusetts General Hospital).
`
`30. Studies of regional-scale climate variability and change: Hidden Markov models and coupled ocean-
`atmosphere modes, funded by the Climate Change Prediction Program, US Department of Energy,
`October 1st 2004 to September 30th 2007, Principal Investigator.
`
`29. Statistical Data Mining of Time-Dependent Data with Applications in Geoscience and Biology, NSF-IIS-
`0431085, National Science Foundation, $566,644, October 1st 2004 to September 30th 2007, Principal
`Investigator.
`
`28. NSF-ITR: Responding to the Unexpected, Information Technology Research (ITR) program, National
`Science Foundation, $9,480,928, award number NSF-ITR-0331707, October 1st 2003 to September 30th
`2008, Co-Investgator (PI: Sharad Mehrotra, UC Irvine).
`
`27. NSF-ITR: The OptIPuter, Information Technology Research (ITR) program, National Science Foun-
`dation, award number , $13,500,000, October 1st 2002 to September 30th 2007, Co-Investigator (PI:
`Larry Smarr, UCSD).
`
`26. Biomedical Informatics Training Program, National Institutes of Health and National Library of
`Medicine, award number T15-LM-07443, $8,840,297, July 1st 2002 to June 30th 2012, Senior Per-
`sonnel (PI: Pierre Baldi, UC Irvine).
`
`25. Predicting Coupled Ocean-Atmosphere Modes With A Climate Modeling Hierarchy, US Department
`of Energy: Climate Change Prediction Program, $396,000, February 1st 2002 to January 31st 2005,
`Co-Investigator (with Andrew Robertson and Michael Ghil, UCLA).
`
`24. Intelligent Time-Series Pattern Matching, Jet Propulsion Laboratory, June 15th to September 30th
`2002, $80,920, Principal Investigator.
`
`23. Preclinical Detection and Disease Measurement of Alzheimer’s Disease and Related Disorders Using
`EEG, Psychophysical and Data Mining Methods, Alzheimer’s Association of America, September 1st
`2001 to August 30th 2003, $250,000, Co-Investigator (PI: Rod Shankle, UC Irvine).
`
`22. Spatial Data Mining for Massive Scientific Data Sets from Lawrence Livermore National Laboratory,
`May 1st 2001 to August 31st 2002, $100,000, Principal Investigator.
`
`21. IBM Faculty Partnership Award, Gift from IBM Watson Research Center, May 18th 2001, $40,000,
`Principal Investigator.
`
`20. Data Mining of Digital Behavior, NSF-IIS-0083489, Principal Investigator:
`
`7
`
`IPR2017-01039
`Unified EX1009 Page 7
`
`
`
`• Original award: September 15th 2001 to August 30th 2004, $425,000.
`• Supplemental award: September 1st 2003 to December 31st 2010, $1,816,750.
`
`19. Predictive Models for Cancer Detection and Therapy, November 1st 2000 to October 31st 2001, Uni-
`versity of California, Irvine, Cancer Research Grants, $14,301, Co-Investigator (PI: Christine McLaren,
`UC Irvine).
`
`18. Probabilistic Clustering of Dynamic Trajectories for Scientific Data Mining, Institute for Scientific
`Computer Research, Lawrence Livermore National Laboratory, October 1 2000 to September 30 2001,
`$39,178, Renewal: October 1 2001 to September 30 2002, $28,448, Principal Investigator.
`
`17. Sequential Data Analysis for Biomedical Applications, UCI CORCLR Program, July 1 2000 to June
`30th 2001, $12,000, Co-Investigator (PI: Christine McLaren, UC Irvine).
`
`16. Spatio-Temporal Data Mining of Scientific Trajectory Data, from Lawrence Livermore National Labo-
`ratory, March 1st to September 30th 2000, $42,937, Principal Investigator.
`
`15. Research in Data Mining, Gift from Microsoft Research, October 1999, $60,000, Principal Investigator.
`
`from
`14. Data Mining of Multivariate Time-Series Sensor Data for Semiconductor Manufacturing,
`NIST/National Semiconductor corporation, April 1 1999 through Dec 31 2001, $162,000, Principal
`Investigator.
`
`13. Clustering of Sequences and Time Series, from HNC Software, Inc, $40,913, January 1 1999 through
`Dec 31 1999, Principal Investigator.
`
`12. SGER: An Online Repository of Large Data Sets for Data Mining Research and Experimentation,
`from NSF, NSF IIS-9813584, co-PIs Dennis Kibler, Michael Pazzani, Aug 15, 1998 to January 31,
`2000, $99,737, Principal Investigator.
`
`11. Data Mining of High-Dimensional Structure-Activity Data Sets, from SmithKline Beecham Research,
`September 1st 1998 to April 1st 1999, $22,730, Principal Investigator.
`
`10. Graduate Fellowships in Biomedical Computing, from US Department of Education, $750,000. Sept 1,
`1997 to August 31, 2001, Co-Investigator (PI: Lubomir Bic, UC Irvine).
`
`9. A Distributed Biomedical Computing Laboratory, from NSF (CISE Research Instrumentation), NSF-
`9617349, co-investigator with L. Bic et al. (University of California, Irvine), March 1 1997 to February
`1 1998, $69,986. Co-Investigator.
`
`8. Turbo-Decoding of High Performance Error-Correcting Codes via Belief Propagation, from AFOSR,
`grant F49620-97-1-0313, May 1 1997 to December 31 1998, $300,000. Co-Investigator (PI: Robert
`McEliece, Caltech).
`
`7. Automated Cloud Screening for Remote Exploration and Experimentation (REE) Applications to the
`Earth Orbiting-1 (EO-1) Satellite and Similar Platforms, from the Jet Propulsion Laboratory, June
`16th 1997 to November 15th 1997, $34,601, Principal Investigator.
`
`6. Exploring QSAR Data using Probabilistic Data Mining, from SmithKline Beecham Research, July 1st
`to December 31st 1997, $35,048, Principal Investigator.
`
`5. Probabilistic Knowledge Discovery and Data Mining: An Integrated Approach at the Interface of Com-
`puter Science and Statistics, from NSF (CAREER award), NSF-9703120, September 1st 1997 to August
`31st 2001, $304,379, Principal Investigator.
`
`4. Clustering and Mode Classification of Engineering Time Series Data, Jet Propulsion Laboratory, June
`15th 1996 to October 17th 1996, $34,401, Principal Investigator.
`
`3. Automated Detection of Natural Features in SAR Images, Jet Propulsion Laboratory Director’s Discre-
`tionary Fund, January 1st 1994 to December 31st 1994, $140,000, Co-Investigator with Usama Fayyad
`(JPL) and Pietro Perona (Caltech).
`
`8
`
`IPR2017-01039
`Unified EX1009 Page 8
`
`
`
`2. Using Information Theory to Discover Patterns in Databases, Lew Allen Award research grant, Jet
`Propulsion Laboratory. January 1st 1994 to December 31st 1995, $25,000, Principal Investigator.
`
`1. An Information-Theoretic Approach to Distributed Inference and Learning, from DARPA, AFOSR,
`and ONR, Co-Investigator (PI: Rodney Goodman, Caltech).
`• Original award AFOSR-90-0199, February 1st 1990 to May 30th 1992, $338,161.
`• Continuation award NOOO14-92-J-1860: July 1st 1992 to March 30th 1995, $394,118.
`
`Publications List
`
`Books and Conference Proceedings
`
`B5 A. Nicholson and P. Smyth (eds.), Uncertainty in Artificial Intelligence: Proceedings of the 29th Con-
`ference, ISBN 978-0-9749039-9-6, AUAI Press, Corvallis, OR, 2013.
`
`B4 C. Apte, J. Ghosh, P. Smyth (eds.), Proceedings of the 17th ACM SIGKDD International Conference
`on Knowledge Discovery and Data Mining, ISBN 978-1-4503-0813-7, ACM Press, New York, NY, 2011.
`
`B3 Modeling the Internet and the Web: Probabilistic Methods and Algorithms, P. Baldi, P. Frasconi, and
`P. Smyth, John Wiley, June 2003.
`
`B2 Principles of Data Mining, D. Hand, H. Mannila, and P. Smyth, Cambridge, MA: MIT Press, 2001.
`
`B1 Advances in Knowledge Discovery and Data Mining, U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and
`R. Uthurasamy (eds.), Palo Alto, CA: AAAI/MIT Press, 1996.
`
`Journal Papers
`
`J67 T. Holsclaw, A. M. Greene, A. W. Robertson, P. Smyth, ‘Bayesian non-homogeneous Markov mod-
`els via Polya-Gamma data augmentation with applications to rainfall modeling’, Annals of Applied
`Statistics, in press.
`
`J66 G. Gaut, M. Steyvers, Z. E. Imel, D. C. Atkins, P. Smyth, ‘Content coding of psychotherapy transcripts
`using labeled topic models,’ IEEE Journal of Biomedical and Health Informatics, in press.
`
`J65 C. Haffke, G. Magnusdottir, D. Henke, P. Smyth, Y. Peings, ‘Daily states of the March-April east
`Pacific ITCZ in three decades of high-resolution satellite data,’ Journal of Climate, doi:10.1175/JCLI-
`D-15-0224.1, 29(8), 2981-2995, 2016.
`
`J64 P. Arnesen, T. Holsclaw, P. Smyth, ‘Bayesian detection of changepoints in finite-state Markov chains
`for multiple sequences,’ Technometrics, doi:10.1080/00401706.2015.1044118, 58(2), 205-213, 2016.
`
`J63 T. Hoslclaw, A. Greene, A. R. Robertson, P. Smyth, ‘A Bayesian hidden Markov model of daily
`precipitation over South and East Asia,’ Journal of Hydrometeorology, doi:10.1175/JHM-D-14-0142.1,
`17(1):3–25, 2016.
`
`J62 T. Hoslclaw, K. A. Hallgren, M. Steyvers, P. Smyth, D. C. Atkins, ‘Measurement error and outcome
`distributions: Methodological issues in regression analyses of behavioral coding data,’ Psychology of
`Addictive Behaviors, doi:10.1037/adb0000091, 29(4):1031-1040, 2015
`
`J61 M. L. Salmans, Z. Yu, K. Watanabe, E. Cam, P. Sun, P. Smyth, X. Dai, B. Andersen, ‘The co-factor of
`LIM domains (CLIM/LDB/NLI) maintains basal mammary epithelial stem cells and promotes breast
`tumorigenesis,’ PLOS Genetics, July 2014, doi: 10.1371/journal.pgen.100452.
`
`J60 A. J. Frank, P. Smyth, A. T. Ihler, ‘Beyond MAP estimation with the track-oriented multiple hypothesis
`tracker,’ IEEE Transactions on Signal Processing, 62(9):2413–2423, 2014.
`
`9
`
`IPR2017-01039
`Unified EX1009 Page 9
`
`
`
`J59 D. C. Atkins, M. Steyvers, Z. E. Imel, P. Smyth, ‘Scaling up the evaluation of psychotherapy: evaluating
`motivational interviewing fidelity via statistical text classification,’ Implementation Science, 9:49:1–11,
`2014.
`
`J58 C. DuBois, C. T. Butts, D. McFarland, P. Smyth, ‘Hierarchical models for relational event sequences,’
`Journal of Mathematical Psychology, 57(6):297–309, 2013.
`
`J57 N. Navaroli, C. DuBois, P. Smyth, ‘Modeling individual email patterns over time with latent variable
`models,’ Machine Learning, 92(2–3):431-455, May 2013.
`
`J56 M. Geyfman, V. Kumar, Q. Liu, R. Ruiz, W. Gordon, F. Espitia, E. Cam, S. E. Millar, P. Smyth,
`A. Ihler, J. Takahashi, B. Andersen, ‘Bmal1 controls circadian cell proliferation and susceptibility
`to UVB-induced DNA damage in the epidermis,’ Proceedings of the National Academies of Science,
`109(29):11758-63, doi:10.1073/pnas.1209592109, July 2012.
`
`J55 D. Henke, P. Smyth, C. Haffke, G. Magnusdottir, ‘Automated analysis of the temporal behavior of
`the double Intertropical Convergence Zone over the east Pacific,’ Remote Sensing of Environment,
`123:418–433, August 2012.
`
`J54 T. Rubin, A. Chambers, P. Smyth, and M. Steyvers, ‘Statistical topic models for multi-label document
`classification,’ Machine Learning, doi: 10.1007/s10994-011-5272-5, 88(1-2):157–208, July 2012.
`
`J53 B. Gretarsson, J. O’ Donovan, S. Bostandjiev, T. Hollerer, A. Asuncion, D. Newman, and P. Smyth,
`‘TopicNets: Visual analysis of large text corpora with topic modeling,’ ACM Transactions on Intelligent
`Systems and Technology, 3(2):1–26, February 2012.
`
`J52 M. Steyvers, P. Smyth, and C. Chemudugunta, ‘Combining background knowledge and learned topics,’
`Topics in Cognitive Science, 3(1):18–47, January 2011.
`
`J51 A. M. Greene, A. W. Robertson, P. Smyth, and S. Triglia, ’Downscaling projections of Indian monsoon
`rainfall using a nonhomogeneous hidden Markov model,’ Quarterly Journal of the Royal Meteorological
`Society, 137(655):347–359, January 2011.
`
`J50 T. T. Van Leeuwen, A. J. Frank, Y. Jin, P. Smyth, M. L. Goulden, G. R. van der Werf, J. T. Randerson,
`’Optimal use of land surface temperature data to detect changes in tropical forest cover,’ Journal of
`Geophysical Research—Biogeosciences, 116, G02002, doi:10.1029/2010JG00148, 2011.
`
`J49 A. Asuncion, P. Smyth, and M. Welling, ’Asynchronous distributed estimation of topic models for
`document analysis,’ Statistical Methodology, 8(1):3–17, January 2011.
`
`J48 C. Bain, G. Magnusdottir, P. Smyth, H. Stern, ‘The diurnal cycle of the intertropical convergence zone
`in the east Pacific,’ Journal of Geophysical Research, 115, D23116, doi:10.1029/2010JD014835, 2010.
`
`J47 C. Bain, J. DePaz, J. Kramer, G. Magnusdottir, P. Smyth, H. Stern, C-C. Wang, ’Detecting the ITCZ
`in instantaneous satellite data using spatial-temporal statistical modeling: ITCZ climatology in the
`east Pacific,’ Journal of Climate, 138(6):2132-2148, 2010.
`
`J46 S. Kim, P. Smyth, and H. Stern, ’A Bayesian mixture approach to modeling spatial activation patterns
`in multi-site fMRI data,’ IEEE Transactions on Medical Imaging, 29(6):1260–1274, June 2010.
`
`J45 L. Scharenbroich, G. Magnusdottir, P. Smyth, H. Stern and C. Wang, ‘A Bayesian framework for storm
`tracking using a hidden-state representation,’ Monthly Weather Review, 138(6):2132–2148, June 2010.
`
`J44 Q. Liu, K. K. Lin, B. Andersen, P. Smyth, and A. Ihler, ’Estimating replicate time-shifts using Gaussian
`process regression,’ Bioinformatics, 26(6):770–776, 2010.
`
`J43 M. Rosen-Zvi, C. Chemudugunta, T. Griffiths, P. Smyth, and M. Steyvers, ‘Learning author-topic
`models from text corpora,’ ACM Transactions on Information Systems, 28(1):1–38, 2010.
`
`10
`
`IPR2017-01039
`Unified EX1009 Page 10
`
`
`
`J42 D. Chudova, A. T. Ihler, K. K. Lin, B. Andersen, P. Smyth, ’Bayesian detection of non-sinusoidal
`periodic patterns in circadian expression data,’ Bioinformatics, 25(23):3114–3120, 2009.
`
`J41 D. Newman, A. Asuncion, P. Smyth, and M. Welling, ‘Distributed algorithms for topic models,’ Journal
`of Machine Learning Research, 10:1801–1828, 2009.
`
`J40 K. K. Lin, V. Kumar, M. Geyfman, D. Chudova, A. T. Ihler, P. Smyth, R. Paus, J. S. Takahashi, B.
`Andersen, ‘Circadian clock genes contribute to the regulation of hair follicle cycling,’ PLOS Genetics,
`5(