`Department of Computer Science, Bren Hall 4216
`School of Information and Computer Sciences
`University of California, Irvine
`CA 92697-3435
`telephone: (949) 824 2558
`fax: (949) 824 4056
`email: smyth@ics.uci.edu
`
`Professional Positions
`
`April 1996–present: Professor, Department of Computer Science, University of California, Irvine
`• Chancellor’s Professor: 2018 to present
`• Full Professor: July 2003 to 2018
`• Associate Professor: July 1998 to June 2003
`• Assistant Professor: April 1996 to June 1998
`
`October 1988–March 1996: Member of Technical Staff and Technical Group Leader (from 1992), Jet
`Propulsion Laboratory, California Institute of Technology, Pasadena.
`
`Education
`
`PhD, 1988: California Institute of Technology, Department of Electrical Engineering.
`
`MSEE, 1985: California Institute of Technology, Department of Electrical Engineering.
`
`BE, 1984: National University of Ireland, University College Galway. Bachelor of Engineering (Electronic)
`with First-Class Honors.
`
`Additional Professional Roles and Affiliations
`
`Joint Faculty Appointments:
`• Department of Statistics, UC Irvine, July 2008–present.
`• Department of Education, UC Irvine, July 2017–present.
`• Department of Biomedical Engineering, UC Irvine, July 2001–2012.
`
`Founding Director, UCI Data Science Initiative, University of California, Irvine, July 2014–June 2018.
`
`Founding Director, Center for Machine Learning and Intelligent Systems, University of California, Irvine,
`January 2007–June 2014.
`
`Faculty Member, Institute for Genomics and Bioinformatics (IGB), UC Irvine, Member 2001–present.
`
`Faculty Member, Institute for Mathematical Behavioral Sciences (IMBS), UC Irvine, 1999-present.
`
`Faculty Member, Center for Digital Transformation, UC Irvine, 2012–present.
`
`Faculty Member, Program for Mathematical, Computational, and Systems Biology (MCB), UC Irvine,
`2007–present.
`
`Faculty Member, Center for Research on Information Technology and Organizations (CRITO), UC Irvine,
`2008–2012.
`
`1
`
`EX1009
`
`
`
`Founding Director and Executive Committee Member of the ACM Special Interest Group on Knowledge
`Discovery and Data Mining (SIGKDD), 1998.
`
`Visiting Principal Researcher, Jet Propulsion Laboratory, California Institute of Technology, Pasadena,
`1996–2001.
`
`Member of IEEE (1988–present), American Statistical Association (1997–present), and the Association for
`Computing Machinery (ACM) (1999–present).
`
`Honors and Awards
`
`Fellow, Association for Computing Machinery (ACM), 2013
`
`Fellow, Association for the Advancement of Artificial Intelligence (AAAI), 2010
`
`ACM SIGKDD Innovation Award, 2009
`
`Best paper awards: ACM SIGKDD Conference (best paper(1997, 2002), runner-up best paper (1998, 2000)),
`ACM/IEEE Joint Conference on Digital Libraries (JCDL) (shortlist for best paper, 2007), Educational
`Data Mining Conference (best paper, 2018)
`
`Qualcomm Faculty Award, 2019
`
`Google Faculty Research Awards, 2008 and 2014
`
`IBM Faculty Partnership Award, 2001.
`
`National Science Foundation CAREER award, 1997
`
`ACM Teaching Award, UC Irvine, 1997
`
`NASA Group Achievement award, Jet Propulsion Labaratory, 1997
`
`Lew Allen Award for Excellence in Research, Jet Propulsion Laboratory, 1993
`
`17 NASA Certificates for Technical Innovation (1991–1996)
`
`Consulting and Business Activities
`
`Consultant/Advisor to Toshiba (2018-present), First American (2018-present); ProLung, Inc (2017-present);
`Unified Patents (2016-present); University of Washington (2016-present); emnos Inc (2015-2016); Frost
`Data Capital (2014-2015); AS&T Inc (2013-2015); Samsung (2012-2015); SOCCCD (2012-present);
`DigitalRisk (2010-2012); CoreLogic (2011-2014); IdentityMetrics (2010-2012); Microsoft (2010-2011);
`ImageCat (2010); eBay (2009-2011); DataAnalytics LLC (2009-2011); Oracle (2008-2011); Netflix
`(2006-2009); Topicseek LLC (2005-2008); Yahoo!
`(2005-2008); Strativa (2005); IET (2004-2005);
`JWDirect (2001-2004); Credit Sciences (2000-2004); Nokia Research (2000); First Quadrant Finan-
`cial Services (1998-1999); Smith-Kline Beecham (1998); AT&T (1996-1998).
`
`Professional Activities
`
`Journals: Associate/Action Editor
`
`ACM Transactions on Knowledge Discovery and Data, guest editor of special issue on best papers from
`ACM SIGKDD 2011 Conference, TKDD 6(4), 2012.
`
`Journal of the American Statistical Association, 2002 to 2005.
`
`2
`
`
`
`IEEE Transactions on Knowledge and Data Engineering, 2002 to 2004.
`
`Machine Learning Journal, July 1998 to December 2001.
`
`Machine Learning Journal, guest editor of special issue on probabilistic learning, 1997.
`
`Journals, Book Series, Centers: Editorial Board/Advisory Board Member
`
`Journal of Machine Learning Research, 2000-present.
`
`Journal of Data Mining and Knowledge Discovery, 1997-present.
`
`Chapman and Hall: Series in Computer Science and Data Analysis, 2002-2008.
`
`Bayesian Analysis, 2004-2007.
`
`Insight Center for Data Analytics, University College Dublin, Scientific Advisory Member, 2015-present.
`
`Journals: Reviewer
`
`Reviewer for Science, Journal of Machine Learning Research, Communications of the ACM, Journal of
`the American Statistical Association, Bayesian Analysis, IEEE Transactions on Information Theory,
`IEEE Trans. on Neural Networks, IEEE Trans. on Signal Processing, IEEE Trans. on Circuits and
`Systems, IEEE Trans. on Pattern Analysis and Machine Intelligence, IEEE Trans. on Knowledge
`and Data Engineering, Statistics and Computing, Journal of Artificial Intelligence Research, Pattern
`Recognition Letters, Neural Networks, Machine Learning, ACM Transactions on Knowledge Discovery
`from Data.
`
`Conference Program and General Chair Positions
`
`Program Chair for the Uncertainty in Artificial Intelligence (UAI) Conference, 2013.
`
`Program Chair for 17th ACM SIGKDD Conference, San Diego, 2011.
`
`Program Chair for the Symposium on the Interface between Statistics and Computing, Costa Mesa, CA,
`June 2001.
`
`General Chair for the Sixth International Conference on Artificial Intelligence and Statistics, January 1997.
`
`Other Conference and Workshop Organization Roles
`
`Conference Organization Roles: Area Chair, ICML 2018, 2019; Senior Area Chair, NeurIPS 2017, 2018,
`2019; Panels Chair for ACM SIGKDD Fifth International Conference on Knowledge Discovery and
`Data Mining, 1999; Tutorials co-Chair for National Conference on Artificial Intelligence, 1998; Tutorials
`Chair for the ACM SIGKDD Conferences on Knowledge Discovery and Data Mining, 1997 and 1998;
`Publicity Chair for the ACM SIGKDD Conferences on Knowledge Discovery and Data Mining, 1995
`and 1996.
`
`Workshop Co-Chair/Organizer for: Dagstuhl Seminar, Automating Data Science, 2018; Workshop on Al-
`gorithmic and Statistical Approaches for Large Social Network Data Sets, NIPS Conference, Lake
`Tahoe, 2012; Workshop on User-Centered Modeling, Institute for Mathematics and its Applications
`(IMA), University of Minnesota, 2012.; Workshop on Scientific Data Mining, Institute for Pure and
`Applied Mathematics (IPAM), UCLA, 2002; Workshop on Temporal and Spatial Machine Learning,
`International Conference on Machine Learning (ICML), 2001; Massive Datasets workshop at the 1998
`Neural Information Processing Conference (NIPS).
`
`3
`
`
`
`Conference Reviewing and Program Committees
`
`Neural Information Processing Conference (NIPS), International Conference on Machine Learning (ICML),
`Uncertainty in Artificial Intelligence Conference (UAI), Artificial Intelligence and Statistics Conference
`(AI-Stats), European Conference on Machine Learning (ECML/PKDD), ACM Conference on Knowl-
`edge Discovery and Data Mining (SIGKDD), WWW Conference, International Conference on Pattern
`Recognition (ICPR), International Joint Conference on Artificial Intelligence (IJCAI), American As-
`sociation for Artificial Intelligence Conference (AAAI), Pattern Recognition in Practice Workshops.
`
`Postdoctoral Advisees
`
`Michal Rosen-Zvi, 2003-2004; IBM Research, Israel.
`Michael Duff, 2005-2006; Assistant Professor, Genetics/Developmental Biology, University of Connecticut.
`Alex Ihler, 2005-2006; Associate Professor, Department of Computer Science, UC Irvine.
`Romain Thibaux, 2008-2009; Google, Mountain View, CA
`Ralf Krestel, 2011-2013; Senior Researcher, Hasso-Plattner Institute, Potsdam, Germany.
`Tracy Holsclaw, 2011-2014; Consultant, San Jose, CA
`
`Graduate Students
`
`PhD Advisees and Current Positions
`
`Dimitris Kotzias, PhD 2018; Google, Zurich
`Eric Nalisnick, PhD 2018; Postdoc, Cambridge University
`Moshe Lichman, PhD 2017; Google, Irvine, CA
`Nick Navaroli, PhD 2014; Google, Irvine, CA
`Jimmy Foulds, PhD 2014: Assistant Professor, Department of Computer Science, UMBC
`Chris DuBois, PhD 2013: Apple, Seattle
`America Chambers, PhD 2013: Assistant Professor, Department of Mathematics and Computer Science,
`University of Puget Sound
`Drew Frank (co-advised with Alex Ihler), PhD 2013: Apple, Seattle
`Arthur Asuncion, PhD 2011: Google, Seattle, WA
`Jon Hutchins (co-advised with Alex Ihler), PhD 2010: Google, Pittsburgh, PA
`Chaitanya Chemudugunta, PhD 2009: Director, Data Science/Research, Pandora, CA
`Seyoung Kim, PhD 2007: Assistant Professor, Department of Bioinformatics, CMU, Pittsburgh
`Darya Chudova, PhD 2007: VP of Bioinformatics, Guardant Health, Redwood City, CA
`Sergey Kirshner, PhD 2005: Facebook, Menlo Park, CA
`Scott Gaffney, PhD 2004
`Xianping Ge, PhD 2002
`Igor V. Cadez, PhD 2002
`Dimitry Pavlov, PhD 2001: VP, Walmart Labs, Sunnyvale, CA
`
`Current PhD Students
`
`Advanced to Candidacy: Chris Galbraith (2017), Jihyun Park (2017)
`Pre-Candidacy: Disi Ji, Casey Graff, Robby Logan, Preston Putzel, Alex Boyd
`
`PhD Thesis Committee Member
`
`UC Irvine, Computer Science:
`Bailey Kong (2018), Phuc Nguyen (2018), Sam Hallman (2015), David Keator (2015), Qiang Liu (2014),
`Anoop Korattikara (2014), Levi Boyles (2014), Yutian Chen (2013), Lars Otten (2013), Chaitanya Desai
`
`4
`
`
`
`(2012), Hamed Pirsiavash (2012), Behzad Sajadi (2012), David Orendorff (2012), Pinaki Sinha (2011),
`Chloe Azencott (2010), Vibhav Gogate (2009), Radu Marinescu (2008), Robert Mateescu (2007),
`Bozhena Bidyuk (2005), Stephen Bay (2001), Irina Rish (1999), Chris Merz (1998), Pedro Domingos
`(1997).
`
`UC Irvine, Other Departments:
`Garren Gaut (Cognitive Science, 2018), Majid Janzamin (EECS, 2016), Sepide Sarachi (Civil and
`Environmental Engineering, 2015), Justin Chung (Informatics, 2015), Colene Haffke (Earth Systems
`Science, 2015), Kevin Heins (Statistics, 2014), Michael Salmans (Biological Chemistry, 2014), (Emma
`Spiro (Sociology, 2013), Zack Almquist (Sociology, 2013), Kim Aeling (Microbiology and Molecular
`Genetics, 2007), Bethany Knapp (Cognitive Science, 2002).
`
`Other Universities (External Committee Member or Examiner):
`Ramnath Balasubramanyan (CMU, 2013), Mindaugus Norkus (National University of Ireland, Galway,
`2013), Xuerei Wang (U Mass Amherst, 2009), Sangmin Oh (Georgia Tech, 2009), Carla Domencioni
`(UC Riverside, 2002), John Lindal (Caltech, 2000), Srinivas Aji (Caltech, 2000), David Babcock (Cal-
`tech, 2000), Gavin Horn (Caltech, 1999), Lonnie Chrisman (CMU, 1996), Michael Burl (Caltech, 1996),
`Barry Ambrose (Caltech, 1995), Zheng Zeng (Caltech, 1995).
`
`PhD Candidacy/Thesis Proposal Committees
`
`UC Irvine: Pouya Pezeshkpour, 2018 (EECS), Zhengli Zhao, 2018 (CS), Cory Scott, 2018 (CS), Ted Grover,
`2018 (Informatics), Daniel Ruiz, 2018 (Earth System Science), Garren Gaut, 2017 (Cognitive Sciences),
`Alex Parret , 2017 (Economics), Phuc Nguyen, 2016 (CS), Nolan Phillips, 2016 (Sociology), Brian Veg-
`atible, 2016 (Statistics), Igor Burago, 2016 (CS), Emmanouil Alimpertis, 2016 (EECS), Daniel Quang,
`2015 (CS), Bailey Kong, 2015 (CS), Sholeh Fourazan, 2015 (CS), Coral Wheeler, 2014 (Physics), Raul
`Diaz, 2014 (CS), Golnaz Ghiasi, 2014 (CS), Wei Ping, 2014 (CS), Sam Hallman, 2013 (CS), Peter
`Sadowski, 2013 (CS), William Lam, 2013 (CS), Justin Chung, 2013 (Informatics), Kevin Heins, 2012
`(Statistics), Zack Almquist, 2012 (Sociology), Ashley Payne, 2012 (Earth System Science), Ragupa-
`thyraj Valluvan, 2012 (EECS), Emma Spiro, 2012 (Sociology), Michael Salmans (Biological Chemistry),
`Colene Haffke, 2011 (Earth System Sciences), Tim Rubin, 2011 (Cognitive Sciences), Brendan Rogers,
`2011 (Earth System Sciences), Hamed Pirsiavash, 2011 (CS), Behzad Sajadi, 2011 (CS), Qiang Liu,
`2011 (CS), Anoop Korattikara, 2011 (CS), David Keator, 2010 (CS), Kenny Daily, 2010 (CS), Yutian
`Chen, 2009 (CS), Lars Otten, 2009 (CS), David Orendorff, 2009 (CS), Chloe Azencott, 2009 (CS),
`Chaitanya Desai, 2008 (CS), Pinaki Sinha, 2007 (CS), Guy Yosiphon, 2006 (ICS), Bo Gong, 2006
`(ICS), Lin Wu, 2005 (ICS), Yiming Ma, 2004 (ICS), Dawit Seid, 2004 (ICS), John Abatzoglu, 2004
`(Earth System Sciences), Suman Sundaresh, 2003 (ICS), Mingliang Li, 2002 (Economics), Ye Sun,
`2001 (ICS), Bethany Knapp, 2000 (Cognitive Science), Stephen Bay, 1999 (ICS), Daniel Billsus, 1998
`(ICS), Pei Suen, 1998 (ECE), Chris Merz, 1997 (ICS).
`
`Other Universities: Ramnath Balasubramanyan, 2012 (CMU), Srinivas Aji, 1999 (Caltech), Gavin Horn,
`1998 (Caltech), John Lindal, 1998 (Caltech).
`
`Masters Students Supervised
`
`UC Irvine, Information and Computer Science: Homer Strong (2016), Zach Butler (2016), Scott Crawford
`(2012), Corey Schaninger (2012), Scott Triglia (2011), Ajay Mishra (2008), Scott White (2006), Joshua
`O Madadhain (2006), Vasanth Kumar (2006), Sridevi Parise (2003), Naval Verma (2002), Wagner
`Truppel (2001), Scott Lundgren (1997).
`
`Royal Institute of Technology: Stefan Edlund (1997), Department of Numerical Analysis and Computing
`Science, Stockholm: Thesis entitled Methods for Cluster Analysis with Applications to Large NASA
`Data Sets.
`
`University of Freiburg: Daniel Henke (2007), Department of Computer Science, MS Diplom Thesis.
`
`5
`
`
`
`Research Grants, Contracts and Gifts
`
`68. Qualcomm Faculty Award, $75,000 (gift), May 2019.
`
`67. Innovation Center for Advancing Ecosystem Climate Solutions, California Strategic Growth Council,
`award number CCR20021, $4,604,140, 4/01/2019 to 3/31/2022, co-investigator (PI: Mike Goulden,
`Earth Systems Sciences, UCI).
`
`66. Hands-free Documentation in Clinical Practice, SAP, $172,000 (gift/sponsored project), October 2018,
`co-Principal Investigator (with Kai Zheng, Department of Informatics, UCI).
`
`65. TRIPODS-X: Data Science Frontiers in Climate Science, National Science Foundation, award number
`NSF-1839336, $300,000, Oct 1 2018 to Sept 30 2021, co-PI (PI: Efi Foufoula-Georgiou, Civil and
`Environmental Engineering, UCI).
`
`64. Large-Scale Classification Algorithms, eBay Labs, $30,000 (gift), Dec 1 2017, Principal Investigator.
`
`63. Support for Center for Machine Learning and Intelligent Systems, Cylance, $50,000 (gift), Dec 1 2017,
`Principal Investigator.
`
`62. Development of Computational Methods for Evaluating Patient-Doctor Communication, PCORI,
`$270,000 (UCI portion), award number ME-1602-34167, July 1 2017 to June 30th 2019, co-Investigator
`(PI: Zac Imel, U Utah).
`
`61. NRT-DESE: Team Science for Integrative Graduate Training in Data Science and Physical Science,
`NSF, award number NSF-1633631, Sep 15 2016 to Aug 31 2021, $2,967,150, Principal Investigator.
`
`60. Learning Individual Predictive Choice Models, Adobe Research Award, $50,000, October 2016, Princi-
`pal Investigator.
`
`59. Transformative Computational Infrastructures for Cell-Based Biomarker Diagnostics, NIH, award num-
`ber U01TR001801-01, 09/01/16
`08/31/21, $766,000 (UCI portion), co-Investigator (PI: Richard
`Scheuermann, Venter Institute/UCSD).
`
`58. The Big DIPA: Data Image Processing and Analysis, NIH BD2K Program, award number
`1R25EB022366-01, $486,000, Sept 30 2015 to June 30th 2018, co-Investigator (UCI PI: Charless
`Fowlkes).
`
`57. Investigating Virtual Learning Environments, National Science Foundation, award number NSF-
`1535300, $2,500,000, Oct 1 2015 to Sept 30th 2020, co-Investigator (UCI PI: Mark Warschauer).
`
`56. Forensic Science Center of Excellence, National Institute of Standards and Technology (NIST), award
`number 70NANB15H176, $20,000,000 ($4,000,000 for UC Irvine), Oct 1 2015 to Sept 30th 2020, co-
`Investigator (UCI PI: Hal Stern).
`
`55. Data-Intensive Research and Education Center in Science, Technology, Engineering, and Mathematics
`(DIRECT-STEM), NASA MIRO program, award number NNX15AQ06A, $5,000,000 ($1,250,000 for
`UC Irvine), Sept 1 2015 to Aug 31st 2020, Principal Investigator.
`
`54. Analyzing Individual Event Data over Time, Google Faculty Research Award, $60,000, March 2014,
`Principal Investigator.
`
`53. Peer Assessment and Academic Achievement in a Gateway MOOC, Bill and Melinda Gates Foundation,
`Oct 1 2013, $25,000, Co-Investigator (PI: Mark Warschauer, UC Irvine).
`
`52. Statistical Learning Algorithms for Micro-Event Time Series Data, National Science Foundation, award
`number IIS-1320527, Oct 1 2013 to Sept 30th 2018, $499,880, Principal Investigator.
`
`51. Balancing the Portfolio: Efficiency and Productivity of Federal Biomedical R&D Funding, National Sci-
`ence Foundation, award number 1158699, Aug 15 2012 to July 31 2015, $297,331, Principal Investigator
`(original PI, David Newman).
`
`6
`
`
`
`50. Location-based Social Media for Context-based Analysis of Transportation Data, Xerox UAC Research
`Award, Jan 1st 2013 to Dec 31st 2015, $90,000 gift, Principal Investigator.
`
`49. Collaborative Research, Type 1: Decadal Prediction and Stochastic Simulation of Hydroclimate over
`Monsoonal Asia, US Department of Energy, award number DOE SC0006619, Sept 1st 2011 to August
`31st 2014, $180,000, Co-Investigator (PI: Andrew Robertson, Columbia University).
`
`48. Copernicus: System for Foresight and Understanding from Scientific Exposition, IARPA, contract
`number D11PC20155, September 2011 to August 2016, $1,097,420, Principal Investigator.
`
`47. Probabilistic Alignment and Distributed Analytics, IARPA/AFRL FA8650-10-C-7060, Oct 1 2010 to
`Dec 31 2011, $334,537, Principal Investigator.
`
`46. Biomedical Informatics Training Program (supplement), award number NIH LM07443-10S1, 7/1/10-
`6/30/11, $153,485, Senior Personnel (PI: Pierre Baldi, UC Irvine).
`
`45. Automating Behavioral Coding via Text-Mining and Speech Signal Processing, National Institutes of
`Health, award number R01AA018673, $3.1 million, (UC Irvine portion is $953,952), Sept 1 2010 to
`August 31 2015, Co-Investigator (PI: David Atkins, University of Washington).
`
`44. UC Irvine Clinical Translational Science Center, National Institutes of Health, award number
`UL1RR031985, $7,075,320 awarded to date, July 1 2010 to March 31st 2015, Senior Personnel (PI:
`Dan Cooper, UC Irvine).
`
`43. Scaling Statistical Topic Modeling Algorithms to Massive Data Sets, Yahoo! Faculty Research (FREP)
`award, $10,000 gift, May 2010, Principal Investigator.
`
`42. Scalable Methods for the Analysis of Network-based Data, Office of Naval Research: Multidisciplinary
`University Research Initiative (MURI) Award), award number N00014-08-1-1015, $5,381,300, May 1
`2008 to April 30 2013, Principal Investigator.
`
`41. Scaling Statistical Topic Modeling Algorithms to Massive Data Sets, Google Research Award, $60,000,
`April 2008, Principal Investigator.
`
`40. Research in Cyber-Fraud Detection and Prevention, gift from Experian, Inc., $200,000, February 2008,
`Co-Principal Investigator with Michael Goodrich.
`
`39. Collaborative Research: Regional Climate-Change Projections Through Next-Generation Empirical and
`Dynamical Models, Department of Energy, Scientific Discovery through Advanced Computing: Climate
`Change Prediction, award number DE-FG02-07ER64429, $360,000, Oct 1 2007 to Sept 30 2010, Prin-
`cipal Investigator.
`
`38. CRI: Collaborative Research: Improving Experimental Computer Science with a Searchable Web Portal
`for Datasets, National Science Foundation, award number CNS-0551510, $400,000, March 15, 2006 to
`February 28, 2009, Co-Principal Investigator with Andrew McCallum (University of Massachusetts).
`
`Institutes of Health,
`37. Functional Biomedical Informatics Research Network (FBIRN), National
`U24RR021992, $23,992,092, from February 8th 2006 to November 30th 2010, Senior Personnel (PI:
`Steven Potkin, UC Irvine).
`
`36. Characterizing ITCZ Dynamics and Breakdown using Statistical Learning Methods and Satellite Data,
`National Science Foundation, award number ATM-0530926, $618,000, 10/1/2005 to 9/30/2008, Co-
`Investigator (PI: Gudrun Magnusdottir, UC Irvine).
`
`35. UC Irvine Knowledge Discovery Evaluation Challenge Project, Entity Analytics Division, International
`Business Machines (IBM), $73,430, 7/15/05 to 12/31/05, Principal Investigator.
`
`34. Bringing Probabilistic Text Mining Techniques to Historical Document Collections: An Early American
`Case Study, UCI CORCLR Award MI-05-06-14, $18,080, 7/1/2005 - 6/30/2006, Co-Investigator (PI:
`Sharon Block, UC Irvine).
`
`7
`
`
`
`1-P20-RR020837-01, total award is
`33. Transdisciplinary Imaging Genetics Center, NIH Grant No.
`$1,724,026, 9/28/04 to 7/31/07, Co-Investigator (PI: Steven Potkin, UC Irvine).
`
`32. National Alliance for Medical Image Computing (NAMIC), National Institutes of Health, award number
`NIH U54 EB005149, total UCI award is $609,253 from 9/17/04 to 8/31/06, Co-Investigator (PI: Ron
`Kikinis, Brigham and Women’s Hospital).
`
`31. Morphometry Biomedical Informatics Research Network (MBIRN), National Institutes of Health, U24-
`RR021382, total UCI award is $579,880 from 9/30/04 to 5/31/06, Co-Investigator (PI: Bruce Rosen,
`Massachusetts General Hospital).
`
`30. Studies of regional-scale climate variability and change: Hidden Markov models and coupled ocean-
`atmosphere modes, funded by the Climate Change Prediction Program, US Department of Energy,
`October 1st 2004 to September 30th 2007, Principal Investigator.
`
`29. Statistical Data Mining of Time-Dependent Data with Applications in Geoscience and Biology, NSF-IIS-
`0431085, National Science Foundation, $566,644, October 1st 2004 to September 30th 2007, Principal
`Investigator.
`
`28. NSF-ITR: Responding to the Unexpected, Information Technology Research (ITR) program, National
`Science Foundation, $9,480,928, award number NSF-ITR-0331707, October 1st 2003 to September 30th
`2008, Co-Investgator (PI: Sharad Mehrotra, UC Irvine).
`
`27. NSF-ITR: The OptIPuter, Information Technology Research (ITR) program, National Science Foun-
`dation, award number , $13,500,000, October 1st 2002 to September 30th 2007, Co-Investigator (PI:
`Larry Smarr, UCSD).
`
`26. Biomedical Informatics Training Program, National Institutes of Health and National Library of
`Medicine, award number T15-LM-07443, $8,840,297, July 1st 2002 to June 30th 2012, Senior Per-
`sonnel (PI: Pierre Baldi, UC Irvine).
`
`25. Predicting Coupled Ocean-Atmosphere Modes With A Climate Modeling Hierarchy, US Department
`of Energy: Climate Change Prediction Program, $396,000, February 1st 2002 to January 31st 2005,
`Co-Investigator (with Andrew Robertson and Michael Ghil, UCLA).
`
`24. Intelligent Time-Series Pattern Matching, Jet Propulsion Laboratory, June 15th to September 30th
`2002, $80,920, Principal Investigator.
`
`23. Preclinical Detection and Disease Measurement of Alzheimer’s Disease and Related Disorders Using
`EEG, Psychophysical and Data Mining Methods, Alzheimer’s Association of America, September 1st
`2001 to August 30th 2003, $250,000, Co-Investigator (PI: Rod Shankle, UC Irvine).
`
`22. Spatial Data Mining for Massive Scientific Data Sets from Lawrence Livermore National Laboratory,
`May 1st 2001 to August 31st 2002, $100,000, Principal Investigator.
`
`21. IBM Faculty Partnership Award, Gift from IBM Watson Research Center, May 18th 2001, $40,000,
`Principal Investigator.
`
`20. Data Mining of Digital Behavior, NSF-IIS-0083489, Principal Investigator:
`• Original award: September 15th 2001 to August 30th 2004, $425,000.
`• Supplemental award: September 1st 2003 to December 31st 2010, $1,816,750.
`
`19. Predictive Models for Cancer Detection and Therapy, November 1st 2000 to October 31st 2001, Uni-
`versity of California, Irvine, Cancer Research Grants, $14,301, Co-Investigator (PI: Christine McLaren,
`UC Irvine).
`
`18. Probabilistic Clustering of Dynamic Trajectories for Scientific Data Mining, Institute for Scientific
`Computer Research, Lawrence Livermore National Laboratory, October 1 2000 to September 30 2001,
`$39,178, Renewal: October 1 2001 to September 30 2002, $28,448, Principal Investigator.
`
`8
`
`
`
`17. Sequential Data Analysis for Biomedical Applications, UCI CORCLR Program, July 1 2000 to June
`30th 2001, $12,000, Co-Investigator (PI: Christine McLaren, UC Irvine).
`
`16. Spatio-Temporal Data Mining of Scientific Trajectory Data, from Lawrence Livermore National Labo-
`ratory, March 1st to September 30th 2000, $42,937, Principal Investigator.
`
`15. Research in Data Mining, Gift from Microsoft Research, October 1999, $60,000, Principal Investigator.
`
`from
`14. Data Mining of Multivariate Time-Series Sensor Data for Semiconductor Manufacturing,
`NIST/National Semiconductor corporation, April 1 1999 through Dec 31 2001, $162,000, Principal
`Investigator.
`
`13. Clustering of Sequences and Time Series, from HNC Software, Inc, $40,913, January 1 1999 through
`Dec 31 1999, Principal Investigator.
`
`12. SGER: An Online Repository of Large Data Sets for Data Mining Research and Experimentation,
`from NSF, NSF IIS-9813584, co-PIs Dennis Kibler, Michael Pazzani, Aug 15, 1998 to January 31,
`2000, $99,737, Principal Investigator.
`
`11. Data Mining of High-Dimensional Structure-Activity Data Sets, from SmithKline Beecham Research,
`September 1st 1998 to April 1st 1999, $22,730, Principal Investigator.
`
`10. Graduate Fellowships in Biomedical Computing, from US Department of Education, $750,000. Sept 1,
`1997 to August 31, 2001, Co-Investigator (PI: Lubomir Bic, UC Irvine).
`
`9. A Distributed Biomedical Computing Laboratory, from NSF (CISE Research Instrumentation), NSF-
`9617349, co-investigator with L. Bic et al. (University of California, Irvine), March 1 1997 to February
`1 1998, $69,986. Co-Investigator.
`
`8. Turbo-Decoding of High Performance Error-Correcting Codes via Belief Propagation, from AFOSR,
`grant F49620-97-1-0313, May 1 1997 to December 31 1998, $300,000. Co-Investigator (PI: Robert
`McEliece, Caltech).
`
`7. Automated Cloud Screening for Remote Exploration and Experimentation (REE) Applications to the
`Earth Orbiting-1 (EO-1) Satellite and Similar Platforms, from the Jet Propulsion Laboratory, June
`16th 1997 to November 15th 1997, $34,601, Principal Investigator.
`
`6. Exploring QSAR Data using Probabilistic Data Mining, from SmithKline Beecham Research, July 1st
`to December 31st 1997, $35,048, Principal Investigator.
`
`5. Probabilistic Knowledge Discovery and Data Mining: An Integrated Approach at the Interface of Com-
`puter Science and Statistics, from NSF (CAREER award), NSF-9703120, September 1st 1997 to August
`31st 2001, $304,379, Principal Investigator.
`
`4. Clustering and Mode Classification of Engineering Time Series Data, Jet Propulsion Laboratory, June
`15th 1996 to October 17th 1996, $34,401, Principal Investigator.
`
`3. Automated Detection of Natural Features in SAR Images, Jet Propulsion Laboratory Director’s Discre-
`tionary Fund, January 1st 1994 to December 31st 1994, $140,000, Co-Investigator with Usama Fayyad
`(JPL) and Pietro Perona (Caltech).
`
`2. Using Information Theory to Discover Patterns in Databases, Lew Allen Award research grant, Jet
`Propulsion Laboratory. January 1st 1994 to December 31st 1995, $25,000, Principal Investigator.
`
`1. An Information-Theoretic Approach to Distributed Inference and Learning, from DARPA, AFOSR,
`and ONR, Co-Investigator (PI: Rodney Goodman, Caltech).
`• Original award AFOSR-90-0199, February 1st 1990 to May 30th 1992, $338,161.
`• Continuation award NOOO14-92-J-1860: July 1st 1992 to March 30th 1995, $394,118.
`
`9
`
`
`
`Publications List
`
`Books and Conference Proceedings
`
`B5 A. Nicholson and P. Smyth (eds.), Uncertainty in Artificial Intelligence: Proceedings of the 29th Con-
`ference, ISBN 978-0-9749039-9-6, AUAI Press, Corvallis, OR, 2013.
`
`B4 C. Apte, J. Ghosh, P. Smyth (eds.), Proceedings of the 17th ACM SIGKDD International Conference
`on Knowledge Discovery and Data Mining, ISBN 978-1-4503-0813-7, ACM Press, New York, NY, 2011.
`
`B3 Modeling the Internet and the Web: Probabilistic Methods and Algorithms, P. Baldi, P. Frasconi, and
`P. Smyth, John Wiley, June 2003.
`
`B2 Principles of Data Mining, D. Hand, H. Mannila, and P. Smyth, Cambridge, MA: MIT Press, 2001.
`
`B1 Advances in Knowledge Discovery and Data Mining, U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and
`R. Uthurasamy (eds.), Palo Alto, CA: AAAI/MIT Press, 1996.
`
`Journal Papers
`
`J70 D. Kotzias, M. Lichman, and P. Smyth, ‘Predicting consumption patterns with repeated and novel
`events,’ IEEE Transactions on Knowledge and Data Engineering, 30(8), 1-14, 2018.
`
`J69 J. R. Hipp, C. Bates, M. Lichman, and P. Smyth, ‘Using social media to measure temporal ambient
`population: does it help explain local crime rates?’ Justice Quarterly, March 2018.
`
`J68 C. Galbraith and P. Smyth, ‘Analyzing user-event data using score-based likelihood ratios with marked
`point processes,’ Journal of Digital Investigation, 22, 106-114, 2017.
`
`J67 T. Holsclaw, A. M. Greene, A. W. Robertson, P. Smyth, ‘Bayesian non-homogeneous Markov mod-
`els via Polya-Gamma data augmentation with applications to rainfall modeling’, Annals of Applied
`Statistics, 11(1):393–426, 2017.
`
`J66 G. Gaut, M. Steyvers, Z. E. Imel, D. C. Atkins, P. Smyth, ‘Content coding of psychotherapy transcripts
`using labeled topic models,’ IEEE Journal of Biomedical and Health Informatics, 21(2):476–487, 2017.
`
`J65 C. Haffke, G. Magnusdottir, D. Henke, P. Smyth, Y. Peings, ‘Daily states of the March-April east
`Pacific ITCZ in three decades of high-resolution satellite data,’ Journal of Climate, doi:10.1175/JCLI-
`D-15-0224.1, 29(8):2981-2995, 2016.
`
`J64 P. Arnesen, T. Holsclaw, P. Smyth, ‘Bayesian detection of changepoints in finite-state Markov chains
`for multiple sequences,’ Technometrics, doi:10.1080/00401706.2015.1044118, 58(2), 205-213, 2016.
`
`J63 T. Hoslclaw, A. Greene, A. R. Robertson, P. Smyth, ‘A Bayesian hidden Markov model of daily
`precipitation over South and East Asia,’ Journal of Hydrometeorology, doi:10.1175/JHM-D-14-0142.1,
`17(1):3–25, 2016.
`
`J62 T. Hoslclaw, K. A. Hallgren, M. Steyvers, P. Smyth, D. C. Atkins, ‘Measurement error and outcome
`distributions: Methodological issues in regression analyses of behavioral coding data,’ Psychology of
`Addictive Behaviors, doi:10.1037/adb0000091, 29(4):1031-1040, 2015
`
`J61 M. L. Salmans, Z. Yu, K. Watanabe, E. Cam, P. Sun, P. Smyth, X. Dai, B. Andersen, ‘The co-factor of
`LIM domains (CLIM/LDB/NLI) maintains basal mammary epithelial stem cells and promotes breast
`tumorigenesis,’ PLOS Genetics, July 2014, doi: 10.1371/journal.pgen.100452.
`
`J60 A. J. Frank, P. Smyth, A. T. Ihler, ‘Beyond MAP estimation with the track-oriented multiple hypothesis
`tracker,’ IEEE Transactions on Signal Processing, 62(9):2413–2423, 2014.
`
`J59 D. C. Atkins, M. Steyvers, Z. E. Imel, P. Smyth, ‘Scaling up the evaluation of psychotherapy: evaluating
`motivational interviewing fidelity via statistical text classification,’ Implementation Science, 9:49:1–11,
`2014.
`
`10
`
`
`
`J58 C. DuBois, C. T. Butts, D. McFarland, P. Smyth, ‘Hierarchical models for relational event sequences,’
`Journal of Mathematical Psychology, 57(6):297–309, 2013.
`
`J57 N. Navaroli, C. DuBois, P. Smyth, ‘Modeling individual email patterns over time with latent variable
`models,’ Machine Learning, 92(2–3):431-455, May 2013.
`
`J56 M. Geyfman, V. Kumar, Q. Liu, R. Ruiz, W. Gordon, F. Espitia, E. Cam, S. E. Millar, P. Smyth,
`A. Ihler, J. Takahashi, B. Andersen, ‘Bmal1 controls circadian cell proliferation and susceptibility
`to UVB-induced DNA damage in the epidermis,’ Proceedings of the National Academies of Science,
`109(29):11758-63, doi:10.1073/pnas.1209592109, July 2012.
`
`J55 D. Henke, P. Smyth, C. Haffke, G. Magnusdottir, ‘Automated analysis of the temporal behavior of
`the double Intertropical Convergence Zone over the east Pacific,’ Remote Sensing of Environment,
`123:418–433, August 2012.
`
`J54 T. Rubin, A. Chambers, P. Smyth, and M. Steyvers, ‘Statistical topic models for multi-label document
`classification,’ Machine Learning, doi: 10.1007/s10994-011-5272-5, 88(1-2):157–208, July 2012.
`
`J53 B. Gretarsson, J. O’ Donovan, S. Bostandjiev, T. Hollerer, A. Asuncion, D. Newman, and P. Smyth,
`‘TopicNets: Visual analysis of large text corpora with topic modeling,’ ACM Transactions on Intelligent
`Systems and Technology, 3(2):1–26, February 2012.
`
`J52 M. Steyvers, P. Smyth, and C. Chemudugunta, ‘Combining background knowledge and learned topics,’
`Topics in Cognitive Science, 3(1):18–47, January 2011.
`
`J51 A. M. Greene, A. W. Robertson, P. Smyth, and S. Triglia, ’Downscaling projections of Indian monsoon
`rainfall using a nonhomogeneous hidden Markov model,’ Quarterly Journal of the Royal Meteorological
`Society, 137(655):347–359, January 2011.
`
`J50 T. T. Van Leeuwen, A. J. Frank, Y. Jin, P. Smyth, M. L. Goulden, G. R. va