*Chronologically ordered by month starting with January – December


1. Hicks, J.L., Althoff, T.A. Sosic, R., King, A.C., Leskovec, J., Delp, S.L. Best practices for analyzing large-scale health data from commercial wearables and smartphone apps. Accepted for publication in npj Digital Medicine. PMID: 31304391. PMCID: PMC6550237 DOI: 10.1038/s41746-019-0121-1

2. De Sa C, Gu A, Puttagunta R, Ré C, Rudra A. A two-pronged progress in structured dense matrixvector multiplicationProc Annu ACM SIAM Symp Discret Algorithms. 2018;2018:1060–1079. PMCID: PMC6534155

3. Training Complex Models with Multi-Task Weak Supervision. Alex Ratner, Braden Hancock, Jared Dunnmon, Frederic Sala, Shreyash Pandey, Christopher Ré. AAAI 2019. PMCID: PMC6765366

4. Low-Precision Random Fourier Features for Memory-Constrained Kernel Approximation. Jian Zhang, Avner May, Tri Dao, Christopher Ré. International Conference on Artificial Intelligence and Statistics (AISTATS) 2019. March 2019. PMCID: Pending 

5. Scene Graph Prediction with Limited Labels. Vincent S Chen, Paroma Varma, Ranjay Krishna, Michael Bernstein, C. Ré, Li Fei-Fei. ICCV 2019. PMCID: Pending

6. Weakly supervised classification of rare aortic valve malformations using unlabeled cardiac MRI sequences. Jason Fries et al. Nature Comms 2019. PMID: 31308376 PMCID: PMC6629670 DOI: 10.1038/s41467-019-11012-3

7. Learning Fast Algorithms for Linear Transforms Using Butterfly Factorizations. Tri Dao, Albert Gu, Matthew Eichhorn, Atri Rudra, C. Ré. ICML 2019. PMCID: Pending

8.  A Kernel Theory of Modern Data Augmentation. Tri Dao, Albert Gu, Alexander J. Ratner, Virginia Smith, Christopher De Sa, C. Ré. ICML 2019. PMCID: Pending

9. Snorkel DryBell: A Case Study in Deploying Weak Supervision at Industrial Scale. Stephen Bach et al. SIGMOD 2019. PMCID: Pending

10. Connecting the legs with a spring improves human running economyCole S. SimpsonCara G. WelkerScott D. Uhlrich, et al. 

11. Predicting Dynamic Embedding Trajectory in Temporal Interaction Networks. S. Kumar, X. Zhang, J. Leskovec. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2019. PMCID: PMC6752886

12. Goal-setting And Achievement In Activity Tracking Apps: A Case Study Of MyFitnessPal. M. Gordon, T. Althoff, J. Leskovec. ACM International Conference on World Wide Web (WWW), 2019. PMCID: Pending

13. Predicting pregnancy using large-scale data from a women’s health tracking mobile application. B. Liu, S. Shi, Y. Wu, D. Thomas, L. Sumul, E. Pierson, J. Leskovec. ACM International Conference on World Wide Web (WWW), 2019. PMCID: PMC6752881

14. Inferring Multidimensional Rates of Aging from Cross-Sectional Data. E. Pierson, P. W. Koh, T. Hashimoto, D. Koller, J. Leskovec, N. Eriksson, P. Liang. Artificial Intelligence and Statistics Conference (AISTATS), 2019. PMCID: PMC6752884

15. Learning Mixed-Curvature Representations in Product Spaces. Beliz Gunel, Albert Gu, C. Ré. Fred Sala. ICLR 2019. PMCID: Pending


1.Prieto LP, Sharma K, Kidzinski Ł, Rodríguez‐Triana MJ, Dillenbourg P. Multimodal teaching analytics: Automated extraction of orchestration graphs from wearable sensor data. J Comput Assist Learn. 2018;34:193–203. PMCID: PMC5909982 NIHMSID: NIHMS932156 PMID: 29686446

2. Agrawal M, Zitnik M, Leskovec J. Large-scale analysis of disease pathways in the human interactome. Pacific Symposium on Biocomputing, 2018. PMCID: PMC5731453

3. Łukasz Kidziński, Sharada P. Mohanty, Carmichael Ong, Jennifer L. Hicks, Sean F. Carroll, Sergey Levine, Marcel Salathé, Scott L. Delp, “Learning to Run challenge: Synthesizing physiologically accurate motion using deep reinforcement learning” NIPS 2017 Competition Book, Springer, 2018.

4. Łukasz Kidziński, Sharada Mohanty, Carmichael Ong, Zhewei Huang, Shuchang Zhou, Anton Pechenko, Adam Stelmaszczyk, Piotr Jarosik, Mikhail Pavlov, Sergey Kolesnikov, Sergey Plis, Zhibo Chen, Zhizheng Zhang, Jiale Chen, Jun Shi, Zhuobin Zheng, Chun Yuan, Zhihui Lin, Henryk Michalewski, Piotr Miłoś, Błażej Osiński, Andrew Melnik, Malte Schilling, Helge Ritter, Sean Carroll, Jennifer Hicks, Sergey Levine, Marcel Salathé, Scott Delp “Learning to Run challenge solutions: Adapting reinforcement learning methods for neuromusculoskeletal environments” NIPS 2017 Competition Book, Springer, 2018.

5.  I’ll Be Back: On the Multiple Lives of Users of a Mobile Activity Tracking Application. Z. Lin, T. Althoff, J. Leskovec. ACM International Conference on World Wide Web (WWW), 2018. PMCID: PMC5959281

6. Modeling Individual Cyclic Variation in Human Behavior. E. Pierson, T. Althoff, J.Leskovec. ACM International Conference on World Wide Web (WWW), 2018. PMCID: PMC5959299

7. Modeling Interdependent and Periodic Real-World Action Sequences. T. Kurashima, T. Althoff, J. Leskovec. ACM International Conference on World Wide Web (WWW), 2018. PMCID: PMC5959287

8. Physical activity is associated with changes in knee cartilage microstructure. E.Halilaj, T.J.Hastie, G.E.Gold, S.L.Delp. doi: 10.1016/j.joca.2018.03.009. 2018. PMID: 29605382 PMCID: PMC6086595

9. Scott Powers, Junyang Qian, Kenneth Jung, Alejandro Schuler, Nigam Shah, Trevor Hastie and Robert Tibshirani. Some methods for heterogeneous treatment effect estimation in high-dimensions  Statistics in Medicine. (January 2018) PMCID: PMC5938172

10. Weakly supervised classification of rare aortic valve malformations using unlabeled cardiac MRI sequences. Jason Alan Fries, Paroma Varma, Vincent Chen, Ke Xiao, Heliodoro Tejeda, SahaPriyanka, Jared Dunnmon, Henry Chubb, Shiraz Maskatia, Madalina Fiterau, Scot Delp, Euan Ashley, Christopher Ré, James Priest. bioRxiv 339630; doi: PMCID: PMC6629670. PMID: 31308376

11. Training Classifiers with Natural Language Explanations. Braden Hancock, Paroma Varma, Stephanie Wang, Percy Liang, C. Ré. ACL18. PMCID:6534135

12. Representation Tradeoffs for Hyperbolic Embeddings Christopher De Sa, Albert Gu, C. Ré, Frederic Sala. ICML18. PMCID: PMC6534139 NIHMSID: NIHMS993800 PMID: 31131375

13. Learning Compressed Transforms with Low Displacement Rank. Anna T. Thomas*, Albert Gu*, Tri Dao, Atri Rudra, Christopher Ré. NIPS18. PMCID: PMC6534145 NIHMSID: NIHMS993802 PMID: 31130799

14. De Sa C, He B, Mitliagkas I, Ré C, Xu P. Accelerated Stochastic Power IterationProc Mach Learn Res. 2018;84:58–67. PMCID: PMC6557638 NIHMSID: NIHMS993807 PMID: 31187095

15. Acute changes in foot strike pattern and cadence affect running parameters associated with tibial stress fracturesJournal of biomechanics. Yong, J. R., Silder, A., Montgomery, K. L., Fredericson, M., Delp, S. L.2018. PMID: 29866518 DOI: 10.1016/j.jbiomech.2018.05.017. PMCID: PMC6203338

16. Exploring the Utility of Developer Exhaust. Jian Zhang, Max Lam, Stephanie Wang, Paroma Varma, Luigi Nardi, Kunle Olukotun and C. Ré. DEEM 2018PMCID: PMC6534136 NIHMSID: NIHMS993811 PMID: 31131381

17. Snorkel MeTaL: Weak Supervision for Multi-Task Learning. Alex Ratner, Braden Hancock, Jared Dunnmon, Roger Goldman, C.Ré. DEEM 2018. PMCID: PMC6436830

18. Machine learning in human movement biomechanics: Best practices, common pitfalls, and new opportunitiesJournal of Biomechanics. Eni Halilaj, Apoorva Rajagopal, Madalina Fiterau, Jennifer L. Hicks, Trevor J. Hastie, Scott L. Delp. 8 September 2018. PMID: 30279002 DOI: 10.1016/j.jbiomech.2018.09.009 PMCID: Pending

19. Physical activity is associated with changes in knee cartilage microstructure. Osteoarthritis and cartilage. Halilaj, E., Hastie, T. J., Gold, G. E., Delp, S. L.2018; 26 (6): 770–74. PMID: 29605382 PMCID: PMC6086595 DOI: 10.1016/j.joca.2018.03.009

20. Modeling and Predicting Osteoarthritis Progression: Data from the Osteoarthritis Initiative. Osteoarthritis and cartilage. Halilaj, E., Le, Y., Hicks, J. L., Hastie, T. J., Delp, S. L. 2018.PMID: 30130590 PMCID: PMC6469859 [Available on 2019-12-01] DOI: 10.1016/j.joca.2018.08.003

21. Some methods for heterogeneous treatment effect estimation in high dimensions. Scott Powers, Junyang Qian, Kenneth Jung, Alejandro Schuler, Nigam H. Shah, Trevor Hastie, Robert Tibshirani. Statistics in medicineMay 20, 2018. PubMed: 29508417. NIHMSID 956481. PMCID: PMC5938172

22. Evolution of resilience in protein interactomes across the tree of life. M. Zitnik, R. Sosic, M. Feldman, J. Leskovec. Proceedings of the National Academy of Sciences (PNAS), 2019. Pubmed: 30765515 PMCID: 6410798

23. Network enhancement as a general method to denoise weighted biological networks. B. Wang, A. Pourshafeie, M. Zitnik, J. Zhu, C. D. Bustamante, S. Batzoglou, J Leskovec. Nature CommunicationsPubMed: 30082777 PMCID: 6078978

24. Communications, 2018. Prioritizing Network Communities. M. Zitnik, R. Sosic, J. Leskovec. Nature Communications. PubMed: 29959323 PMCID: 6026212

25. Modeling Polypharmacy Side Effects with Graph Convolutional Networks. M. Zitnik, M. Agrawal, J. Leskovec. Bioinformatics, 2018. PMCID: PMC6022705

26. Physical Activity Is Associated with Changes in Knee Cartilage Microstructure: Data from the Osteoarthritis Initiative. Eni Halilaj, Trevor J. Hastie, Garry E. Gold, Scott L. Delp Osteoarthritis Cartilage. 2018 Jun; 26(6): 770–774. PMCID: PMC6086595

27. Paroma Varma and Christopher Ré. 2018. Snuba: automating weak supervision to label training dataProc. VLDB Endow. 12, 3 (November 2018), 223-236. DOI: PMCID: Pending

28. Longitudinal data analysis using matrix completion. Kidziński, Łukasz,  Hastie, Trevor. eprint arXiv. September 25, 2018.



1. Li X, Dunn J, Salinas D, Zhou G, Zhou W, Schussler-Forenza Rose SM, et al. Digital Health: Tracking Physiomes and Activity Using Wearable Biosensors Reveals Useful Health-Related Information. PLoS Biol. 2017 Jan 12;15(1):e2001402. doi: 10.1371/journal.pbio.2001402. eCollection 2017 Jan. PubMed PMID: 28081144; PubMed Central PMCID: PMC5230763.

2. Norden J, Smuck M, Sinha A, Hu R, Tomkins-Lane C. Objective measurement of free-living physical activity (performance) in lumbar spinal stenosis: are physical activity guidelines being met? Spine J. 2017 Jan;17(1):26-33. doi: 10.1016/j.spinee.2016.10.016. PubMed PMID: 27793759. PubMed Central PMCID: PMC5732871

3. DeMers, M.S., Hicks, J.L., Delp, S.L. Preparatory co-activation of the ankle muscles may prevent ankle inversion injuries. Journal of Biomechanics, Vol. 53, 17-23, 2017. PMCID: PMC5798431.

4. Althoff T, Jindal P, Leskovec J.  Online Actions with Offline Impact: How Online Social Networks Influence Online and Offline User Behavior. Proc Int Conf Web Search Data Min. 2017 Feb;2017:537-546. doi: 10.1145/3018661.3018672. Epub 2017 Feb 2. PubMed PMID: 28345078; PubMed Central PMCID: PMC5361221.

5. Hallac D, Wong C, Diamond S, Sosic R, Boyd S, et al. SnapVX: A Network-Based Convex Optimization Solver. Journal of machine learning research : JMLR. PMCID: PMC5870756

6. Jackson, R.W., Dembia, C.L., Delp, S.L., Collins, S.H. Muscle-tendon mechanics explain unexpected effects of exoskeleton assistance on metabolic rate during walking. Journal of Experimental Biology, 2017. PMCID: PMC6514464

7. Smuck M, Tomkins-Lane C, Ith MA, Jarosz R, Kao MJ. Physical performance analysis: A new approach to assessing free-living physical activity in musculoskeletal pain and mobility-limited populations. PLoS One. 2017 Feb 24;12(2):e0172804. doi: 10.1371/journal.pone.0172804. PubMed PMID: 28235039; PubMed Central PMCID: PMC5325560.

8. Shameli A, Althoff T, Saberi A, Leskovec J. How Gamification Affects Physical Activity: Large-scale Analysis of Walking Challenges in a Mobile Application. ACM International Conference on World Wide Web (WWW), 2017. PMCID: PMC5627651

9. De Sa C, Olukotun K, Re C. Ensuring Rapid Mixing and Low Bias for Asynchronous Gibbs Sampling. JMLR Workshop Conf Proc. 2016;48:1567-1576. PubMed PMID: 28344730; PubMed Central PMCID: PMC5360990.

10. Althoff T, Sosič R, Hicks JL, King AC, Delp SL, Leskovec J. Large-scale physical activity data reveal worldwide activity inequality. Nature. 2017 Jul 10. doi: 10.1038/nature23018. [Epub ahead of print] PubMed PMID: 28693034 PMCID: PMC5774986

11. Snorkel: Rapid Training Data Creation with Weak Supervision Alex Ratner, Stephen Bach, Henry Ehrenberg, Jason Fries, Sen Wu, C. Ré. VLDB 18. PMCID: PMC5951191

12. Fonduer: Knowledge Base Construction from Richly Formatted Data Sen Wu et al. SIGMOD 18. PMCID: PMC6013301

13. Anyone Can Become a Troll: Causes of Trolling Behavior in Online Discussions. J. Cheng, M. Bernstein, C. Danescu-Niculescu-Mizil, J. Leskovec. Computer-Supported Cooperative Work and Social Computing (CSCW), 2017. Best paper award. PMCID: PMC5791909. 

14. Population-Scale Pervasive Health Tim Althoff IEEE Pervasive Computing Year: 2017, Volume: 16, Issue: 4 Pages: 75 – 79 IEEE Journals & Magazines. PMCID: PMC5951162

15. Understanding and Optimizing Asynchronous Low-Precision Stochastic Gradient Descent. C. De Sa, Matt Feldman, C. Ré, Kunle Olukotun. ISCA 2017. PMCID: PMC5789782. 

16. Human Decisions and Machine Predictions. J. Kleinberg, H. Lakkaraju, J. Leskovec, J. Ludwig, S. Mullainathan. Quarterly Journal of Economics, 2017. PMCID: PMC5947971

17. Predicting multicellular function through multi-layer tissue networks. M. Zitnik, J. Leskovec.Bioinformatics, 33 (14): i190-i198, 2017. PMID: 28881986 PMCID: PMC5870717 

18. Local Higher-Order Graph Clustering. H. Yin, A. Benson, D. Gleich, J. Leskovec. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2017.PMCID: PMC5951164

19. The Selective Labels Problem: Evaluating Algorithmic Predictions in the Presence of Unobservables. H. Lakkaraju, J. Kleinberg, J. Leskovec, J. Ludwig, S. Mullainathan. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2017. PMCID: PMC5958915

20. Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data. D. Hallac, S. Vare, S. Boyd, J. Leskovec. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2017. Best paper runner-up. PMCID: PMC5951184

21. Network Inference via the Time-Varying Graphical Lasso. D. Hallac, Y. Park, S. Boyd, J. Leskovec.ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2017. PMCID: PMC5951186

22. Loyalty in Online Communities. W. Hamilton, J. Zhang, C. Danescu-Niculescu-Mizil, D. Jurafsky, J. Leskovec. AAAI International Conference on Weblogs and Social Media (ICWSM), 2017. PMCID: PMC5774975

23. Hinckson E, Schneider M, Winter S, Stone E, Puhan M, Stathi A, Porter MM, Gardiner PA, Lopes dos Santos D, Wolff, King AC. (In press). Citizen science applied to building healthier community environments: Advancing the field through shared construct and measurement development. IJBNPA [the International Journal of Behavioral Nutrition and Physical Activity]. PubMed Central PMCID: PMC5622546

24. Learning the Structure of Generative Models without Labeled DataStephen H. Bach, Bryan He, Alex Ratner, C. Ré. ICML 2017. PMCID: PMC6417840

25. Inferring Generative Model Structure with Static Analysis. Paroma Varma, Bryan He, Payal Bajaj, C. Ré, NIPS2017. PMCID: PMC5789796. 

26. Learning to Compose Domain-Specific Transformations for Data Augmentation. A. Ratner, H. Ehrenberg, Z. Hussain, J. Dunnmon, C. Ré, NIPS2017. PMCID: PMC5786274

27. Smuck M, Muaremi A, Zheng P, Norden J, Sinha A, Hu R, Tomkins-Lane C. Objective measurement of function following lumbar spinal stenosis decompression reveals improved functional capacity with stagnant real-life physical activity. Spine J. 2017 Sep 26. pii: S1529-9430(17)30979-8. doi: 10.1016/j.spinee.2017.08.262. [Epub ahead of print] PMID: 28962914 PMCID: PMC5732871

28. Gaussian Quadrature for Kernel Features Tri Dao, Chris De Sa, C. Ré, NIPS2017. Spotlight. PMCID: PMC5791159

29. Weighted SGD for lp regression with Randomized Preconditioning. Jiyan Yang, Yin-Lam Chow, C. Ré, and Michael Mahoney. JMLR 17. PMCID: PMC5959301

30. Community Identity and User Engagement in a Multi-Community Landscape. J. Zhang, W. Hamilton, C. Danescu-Niculescu-Mizil, D. Jurafsky, J. Leskovec. AAAI International Conference on Weblogs and Social Media (ICWSM), 2017. PMCID: PMC5774974

31. Learning the Network Structure of Heterogeneous Data via Pairwise Exponential Markov Random Fields. Y. Park, D. Hallac, S. Boyd, J. Leskovec. Artificial Intelligence and Statistics Conference (AISTATS), 2017. PMCID: PMC6436845

32. Network Analysis: A novel Method for Mapping Neonatal Acute Transport Patterns in California. S.N. Kunz, J.A.F. Zupancic, J. Rigdon, C.S. Phibbs, H.C. Lee, J.B. Gould, J. Leskovec, J. Profit. Journal of Perinatology, 2017. PubMed Central PMCID: PMC5446293

33. Madalina Fiterau, Suvrat Bhooshan, Jason Fries, Charles Bournhonesque, Jennifer Hicks, Eni Halilaj, Christopher Ré and Scott Delp. ShortFuse: Biomedical Time Series Representations in the Presence of Structured Information. 3rd Conference on Machine Learning for Healthcare, MLHC 2017. PMCID: PMC6417829


1. He B, De Sa C, Mitliagkas I, Re C. Scan Order in Gibbs Sampling: Models in Which it Matters and Bounds on How Much. Adv Neural Inf Process Syst. 2016;29. pii: 6589. PubMed PMID: 28344429; PubMed Central PMCID: PMC5361064.

2. De Sa C, Ratner A, Re C, Shin J, Wang F, et al. DeepDive: Declarative Knowledge Base Construction. SIGMOD Rec. 2016 Mar;45(1):60-67. Epub 2016 Feb 6. PubMed PMID: 28344371; PubMed Central PMCID: PMC5361060.

3. Winter SJ, Sheats JL, King AC. The Use of Behavior Change Techniques and Theory in Technologies for Cardiovascular Disease Prevention and Treatment in Adults: A Comprehensive Review. Prog Cardiovasc Dis. 2016 May-Jun;58(6):605-12. doi: 10.1016/j.pcad.2016.02.005. Epub 2016 Feb 20. Review. PubMed PMID: 26902519; PubMed Central PMCID: PMC4868665.

4. Uchida TK, Hicks JL, Dembia CL, Delp SL. Stretching Your Energetic Budget: How Tendon Compliance Affects the Metabolic Cost of Running. PLoS One. 2016 Mar 1;11(3):e0150378. doi: 10.1371/journal.pone.0150378. eCollection 2016. PubMed PMID: 26930416; PubMed Central PMCID: PMC4773147.

5. Diamond S, Boyd S. CVXPY: A Python-Embedded Modeling Language for Convex Optimization. J Mach Learn Res. 2016 Apr;17. pii: 83. PubMed PMID: 27375369; PubMed Central PMCID: PMC4927437.

6. Wulczyn E, West R, Zia L, Leskovec J. Growing Wikipedia Across Languages via Recommendation. Proc Int World Wide Web Conf. 2016 Apr;2016:975-985. PubMed PMID: 27819073; PubMed Central PMCID: PMC5092237.

7. Althoff T, Clark K, Leskovec J. Large-scale Analysis of Counseling Conversations: An Application of Natural Language Processing to Mental Health. Trans Assoc Comput Linguist. 2016;4:463-476. PubMed PMID: 28344978; PubMed Central PMCID: PMC5361062.

8. King AC, Winter SJ, Sheats JL, Rosas LG, Buman MP, Salvo D, Rodriguez NM, Seguin RA, Moran M, Garber R, Broderick B, Zieff SG, Sarmiento OL, Gonzalez SA, Banchoff A, Dommarco JR. Leveraging Citizen Science and Information Technology for Population Physical Activity Promotion. Transl J Am Coll Sports Med. 2016 May 15;1(4):30-44. PubMed PMID: 27525309; PubMed Central PMCID: PMC4978140.

9. Aberger CR, Tu S, Olukotun K, Ré C. EmptyHeaded: A Relational Engine for Graph Processing. Proc ACM SIGMOD Int Conf Manag Data. 2016 Jun-Jul;2016:431-446. doi: 10.1145/2882903.2915213. PubMed PMID: 28077912; PubMed Central PMCID: PMC5221635.

10. King AC, Hekler EB, Grieco LA, Winter SJ, Sheats JL, Buman MP, Banerjee B, Robinson TN, Cirimele J. Effects of Three Motivationally Targeted Mobile Device Applications on Initial Physical Activity and Sedentary Behavior Change in Midlife and Older Adults: A Randomized Trial. PLoS One. 2016 Jun 28;11(6):e0156370. doi: 10.1371/journal.pone.0156370. eCollection 2016. Erratum in: PLoS One. 2016;11(7):e0160113. PubMed PMID: 27352250; PubMed Central PMCID: PMC4924838.

11. Zhang C, Cafarella M, Niu F, Re C, Shin J. Extracting Databases from Dark Data with DeepDive. Proc ACM SIGMOD Int Conf Manag Data. 2016 Jun-Jul;2016:847-859. doi: 10.1145/2882903.2904442. PubMed PMID: 28316365; PubMed Central PMCID: PMC5350112.

12. Benson AR, Gleich DF, Leskovec J. Higher-order organization of complex networks.Science. 2016 Jul 8;353(6295):163-6. doi: 10.1126/science.aad9029. PubMed PMID: 27387949; PubMed Central PMCID: PMC5133458.

13. Grover A, Leskovec J. node2vec: Scalable Feature Learning for Networks. KDD. 2016 Aug;2016:855-864. PubMed PMID: 27853626; PubMed Central PMCID: PMC5108654.

14. Lakkaraju H, Bach SH, Leskovec J. Interpretable Decision Sets: A Joint Framework for Description and Prediction. KDD. 2016 Aug;2016:1675-1684. PubMed PMID: 27853627; PubMed Central PMCID: PMC5108651.

15. Mallory EK, Zhang C, Ré C, Altman RB. Large-scale extraction of gene interactions from full-text literature using DeepDive. Bioinformatics. 2016 Jan 1;32(1):106-13. doi: 10.1093/bioinformatics/btv476. Epub 2015 Sep 3. PubMed PMID:26338771; PubMed Central PMCID: PMC4681986.

16. Bias correction in species distribution models: pooling survey and collection data for multiple species. Methods in ecology and evolution / British Ecological Society. NIHMSID: 723170. PMID: 27840673. PMCID: PMC5102514

17. Leskovec J, Sosic R. SNAP: A General Purpose Network Analysis and Graph Mining Library. ACM Trans Intell Syst Technol. 2016 Oct;8(1). pii: 1. doi: 10.1145/2898361. Epub 2016 Oct 3. PubMed PMID: 28344853; PubMed Central PMCID: PMC5361061.

18. Hamilton W, Leskovec J, Jurafasky D. I Inducing Domain-Specific Sentiment Lexicons from Unlabeled Corpora. Proc Conf Empir Methods Nat Lang Process. 2016 Nov;2016:595-605. doi: 10.18653/v1/D16-1057. PubMed PMID: 28660257; PubMed Central PMCID: PMC5483533.

19. Hamilton W, Leskovec J, Jurafsky D. Cultural Shift or Linguistic Drift? Comparing Two Computational Measures of Semantic Change. Proc Conf Empir Methods Nat Lang Process. 2016 Nov;2016:2116-2121. PubMed PMID: 28580459; PubMed Central PMCID: PMC5452980.

20. Ratner A, De Sa C, Wu S, Selsam D, Re C. Data Programming: Creating Large Training Sets, Quickly. NIPS 2016. PMCID: PMC5985238 NIHMSID: NIHMS961532 PMID: 29872252

21. Ferber, R., Osis, S., Hicks, J., Delp, S. Gait Biomechanics in the Era of Data Science. Journal of biomechanics. PMCID: PMC5407492 NIHMSID: NIHMS825844 PMID: 27814971


1. Althoff T, Leskovec J. Donor Retention in Online Crowdfunding Communities: A Case Study of Proc Int World Wide Web Conf. 2015 May;2015:34-44. PubMed PMID: 27077139; PubMed Central PMCID: PMC4827627.

2. Hadjis S, Abuzaid F, Zhang C, Ré C. Caffe con Troll: Shallow Ideas to Speed Up Deep Learning. Proc Fourth Workshop Data Anal Scale Danac 2015 (2015). 2015 May-Jun;2015. pii: 2. PubMed PMID: 27314106; PubMed Central PMCID: PMC4906251.

3. Perez Y, Sosič R, Banerjee A, Puttagunta R, Raison M, Shah P, Leskovec J. Ringo: Interactive Graph Analytics on Big-Memory Machines. Proc ACM SIGMOD Int Conf Manag Data. 2015 May-Jun;2015:1105-1110. PubMed PMID: 27081215; PubMed Central PMCID: PMC4829061.

4. Uchida TK, Sherman MA, Delp SL. Making a meaningful impact: modelling simultaneous frictional collisions in spatial multibody systems. Proc Math Phys Eng Sci. 2015 May 8;471(2177):20140859. PubMed PMID: 27547093; PubMed Central PMCID: PMC4984984.

5. Kimelfeld B, Ré C. A Database Framework for Classifier Engineering. CEUR Workshop Proc. 2015 May;1378. pii: Epub 2015 Jun 11. PubMed PMID: 27274719; PubMed Central PMCID: PMC4891810.

6. Shin J, Wu S, Wang F, De Sa C, Zhang C, Ré C. Incremental Knowledge Base Construction Using DeepDive. Proceedings VLDB Endowment. 2015 Jul;8(11):1310-1321. PubMed PMID: 27144081; PubMed Central PMCID: PMC4852149.

7. Hallac D, Leskovec J, Boyd S. Network Lasso: Clustering and Optimization in Large Graphs. KDD. 2015 Aug;2015:387-396. PubMed PMID: 27398260; PubMed Central PMCID: PMC4937836.

8. Ku JP, Hicks JL, Hastie T, Leskovec J, Ré C, Delp SL. The mobilize center: an NIH big data to knowledge center to advance human movement research and improve mobility. J Am Med Inform Assoc. 2015 Nov;22(6):1120-5. doi:10.1093/jamia/ocv071. Epub 2015 Aug 13. PubMed PMID: 26272077; PubMed Central PMCID: PMC4639715.

9. Shin J, Ré C, Cafarella M. Mindtagger: A Demonstration of Data Labeling in Knowledge Base Construction. Proceedings VLDB Endowment. 2015 Aug;8(12):1920-1923. PubMed PMID: 27144082; PubMed Central PMCID: PMC4852148.

10. Benson A, Gleich D, Leskovec J. Tensor Spectral Clustering for Partitioning Higher-order Network Structures. Proc SIAM Int Conf Data Min. 2015;2015:118-126. PubMed PMID: 27812399; PubMed Central PMCID: PMC5089081.

11. De Sa C, Zhang C, Olukotun K, Ré C. Rapidly Mixing Gibbs Sampling for a Class of Factor Graphs Using Hierarchy Width. Adv Neural Inf Process Syst. 2015 Dec;28:3079-3087. PubMed PMID: 27279724; PubMed Central PMCID: PMC4894721.

12. De Sa C, Zhang C, Olukotun K, Ré C. Taming the Wild: A Unified Analysis of Hogwild!-Style Algorithms. Adv Neural Inf Process Syst. 2015 Dec;28:2656-2664. PubMed PMID: 27330264; PubMed Central PMCID: PMC4907892.