Publications

*Chronologically ordered by month starting with January – December


2017

1. Li X, Dunn J, Salinas D, Zhou G, Zhou W, Schussler-Forenza Rose SM, et al. Digital Health: Tracking Physiomes and Activity Using Wearable Biosensors Reveals Useful Health-Related Information. PLoS Biol. 2017 Jan 12;15(1):e2001402. doi: 10.1371/journal.pbio.2001402. eCollection 2017 Jan. PubMed PMID: 28081144; PubMed Central PMCID: PMC5230763.

2. Norden J, Smuck M, Sinha A, Hu R, Tomkins-Lane C. Objective measurement of free-living physical activity (performance) in lumbar spinal stenosis: are physical activity guidelines being met? Spine J. 2017 Jan;17(1):26-33. doi: 10.1016/j.spinee.2016.10.016. PubMed PMID: 27793759.

3. Althoff T, Jindal P, Leskovec J.  Online Actions with Offline Impact: How Online Social Networks Influence Online and Offline User Behavior. Proc Int Conf Web Search Data Min. 2017 Feb;2017:537-546. doi: 10.1145/3018661.3018672. Epub 2017 Feb 2. PubMed PMID: 28345078; PubMed Central PMCID: PMC5361221.

4. Hallac D, Wong C, Diamond S, Sosic R, Boyd S, et al. SnapVX: A Network-Based Convex Optimization Solver. Journal of machine learning research : JMLR.

5. Smuck M, Tomkins-Lane C, Ith MA, Jarosz R, Kao MJ. Physical performance analysis: A new approach to assessing free-living physical activity in musculoskeletal pain and mobility-limited populations. PLoS One. 2017 Feb 24;12(2):e0172804. doi: 10.1371/journal.pone.0172804. PubMed PMID: 28235039; PubMed Central PMCID: PMC5325560.

6. Shameli A, Althoff T, Saberi A, Leskovec J. How Gamification Affects Physical Activity: Large-scale Analysis of Walking Challenges in a Mobile Application. ACM International Conference on World Wide Web (WWW), 2017.

7. De Sa C, Olukotun K, Re C. Ensuring Rapid Mixing and Low Bias for Asynchronous Gibbs Sampling. JMLR Workshop Conf Proc. 2016;48:1567-1576. PubMed PMID: 28344730; PubMed Central PMCID: PMC5360990.

8. Althoff T, Sosič R, Hicks JL, King AC, Delp SL, Leskovec J. Large-scale physical activity data reveal worldwide activity inequality. Nature. 2017 Jul 10. doi: 10.1038/nature23018. [Epub ahead of print] PubMed PMID: 28693034

9. Rapid Training Data Creation with Weak Supervision Alex Ratner, Stephen Bach, Henry Ehrenberg, Jason Fries, Sen Wu, C. Ré. VLDB 18.

10. Fonduer: Knowledge Base Construction from Richly Formatted Data Sen Wu et al. SIGMOD 18.

11. Anyone Can Become a Troll: Causes of Trolling Behavior in Online Discussions. J. Cheng, M. Bernstein, C. Danescu-Niculescu-Mizil, J. Leskovec. Computer-Supported Cooperative Work and Social Computing (CSCW), 2017. Best paper award.

12. Shcherbina, A., Mattsson, CM., Waggott, D., Salisbury, H., Christle, JW., Hastie, T., Wheeler, M.T., Ashley E. A. Accuracy in Wrist-Worn, Sensor-Based Measurements of Heart Rate and Energy Expenditure in a Diverse Cohort. Journal of Personalized Medicine, 7(2), May 24, 2017.

13. Understanding and Optimizing Asynchronous Low-Precision Stochastic Gradient Descent. C. De Sa, Matt Feldman, C. Ré, Kunle Olukotun. ISCA 2017.

14. Human Decisions and Machine Predictions. J. Kleinberg, H. Lakkaraju, J. Leskovec, J. Ludwig, S. Mullainathan. Quarterly Journal of Economics, 2017.

15. Predicting multicellular function through multi-layer tissue networks. M. Zitnik, J. Leskovec.Bioinformatics, 33 (14): i190-i198, 2017.

16. Local Higher-Order Graph Clustering. H. Yin, A. Benson, D. Gleich, J. Leskovec. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2017.

17. The Selective Labels Problem: Evaluating Algorithmic Predictions in the Presence of Unobservables. H. Lakkaraju, J. Kleinberg, J. Leskovec, J. Ludwig, S. Mullainathan. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2017.

18. Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data. D. Hallac, S. Vare, S. Boyd, J. Leskovec. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2017. Best paper runner-up.

19. Network Inference via the Time-Varying Graphical Lasso. D. Hallac, Y. Park, S. Boyd, J. Leskovec.ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2017.

20. Loyalty in Online Communities. W. Hamilton, J. Zhang, C. Danescu-Niculescu-Mizil, D. Jurafsky, J. Leskovec. AAAI International Conference on Weblogs and Social Media (ICWSM), 2017.

21. Hinckson E, Schneider M, Winter S, Stone E, Puhan M, Stathi A, Porter MM, Gardiner PA, Lopes dos Santos D, Wolff, King AC. (In press). Citizen science applied to building healthier community environments: Advancing the field through shared construct and measurement development. IJBNPA [the International Journal of Behavioral Nutrition and Physical Activity].

22. Learning the Structure of Generative Models without Labeled Data Stephen H. Bach, Bryan He, Alex Ratner, C. Ré. ICML 2017.

23. Inferring Generative Model Structure with Static Analysis Paroma Varma, Bryan He, Payal Bajaj, C. Ré, NIPS2017.

24. Learning to Compose Domain-Specific Transformations for Data Augmentation A. Ratner, H. Ehrenberg, Z. Hussain, J. Dunnmon, C. Ré, NIPS2017.

25. Smuck M, Muaremi A, Zheng P, Norden J, Sinha A, Hu R, Tomkins-Lane C. Objective measurement of function following lumbar spinal stenosis decompression reveals improved functional capacity with stagnant real-life physical activity. Spine J. 2017 Sep 26. pii: S1529-9430(17)30979-8. doi: 10.1016/j.spinee.2017.08.262. [Epub ahead of print] PMID: 28962914

26. Gaussian Quadrature for Kernel Features Tri Dao, Chris De Sa, C. Ré, NIPS2017. Spotlight

27. Weighted SGD for lp regression with Randomized Preconditioning. Jiyan Yang, Yin-Lam Chow, C. Ré, and Michael Mahoney. JMLR 17.

29. Community Identity and User Engagement in a Multi-Community Landscape. J. Zhang, W. Hamilton, C. Danescu-Niculescu-Mizil, D. Jurafsky, J. Leskovec. AAAI International Conference on Weblogs and Social Media (ICWSM), 2017.

30. Learning the Network Structure of Heterogeneous Data via Pairwise Exponential Markov Random Fields. Y. Park, D. Hallac, S. Boyd, J. Leskovec. Artificial Intelligence and Statistics Conference (AISTATS), 2017.

31. Network Analysis: A novel Method for Mapping Neonatal Acute Transport Patterns in California. S.N. Kunz, J.A.F. Zupancic, J. Rigdon, C.S. Phibbs, H.C. Lee, J.B. Gould, J. Leskovec, J. Profit. Journal of Perinatology, 2017.

32. CVXR: An R Package for Disciplined Convex Optimization. A. Fu, B. Narasimhan, and S. Boyd.


2016

1. He B, De Sa C, Mitliagkas I, Re C. Scan Order in Gibbs Sampling: Models in Which it Matters and Bounds on How Much. Adv Neural Inf Process Syst. 2016;29. pii: 6589. PubMed PMID: 28344429; PubMed Central PMCID: PMC5361064.

2. De Sa C, Ratner A, Re C, Shin J, Wang F, et al. DeepDive: Declarative Knowledge Base Construction. SIGMOD Rec. 2016 Mar;45(1):60-67. Epub 2016 Feb 6. PubMed PMID: 28344371; PubMed Central PMCID: PMC5361060.

3. Winter SJ, Sheats JL, King AC. The Use of Behavior Change Techniques and Theory in Technologies for Cardiovascular Disease Prevention and Treatment in Adults: A Comprehensive Review. Prog Cardiovasc Dis. 2016 May-Jun;58(6):605-12. doi: 10.1016/j.pcad.2016.02.005. Epub 2016 Feb 20. Review. PubMed PMID: 26902519; PubMed Central PMCID: PMC4868665.

4. Uchida TK, Hicks JL, Dembia CL, Delp SL. Stretching Your Energetic Budget: How Tendon Compliance Affects the Metabolic Cost of Running. PLoS One. 2016 Mar 1;11(3):e0150378. doi: 10.1371/journal.pone.0150378. eCollection 2016. PubMed PMID: 26930416; PubMed Central PMCID: PMC4773147.

5. Diamond S, Boyd S. CVXPY: A Python-Embedded Modeling Language for Convex Optimization. J Mach Learn Res. 2016 Apr;17. pii: 83. PubMed PMID: 27375369; PubMed Central PMCID: PMC4927437.

6. Wulczyn E, West R, Zia L, Leskovec J. Growing Wikipedia Across Languages via Recommendation. Proc Int World Wide Web Conf. 2016 Apr;2016:975-985. PubMed PMID: 27819073; PubMed Central PMCID: PMC5092237.

7. Althoff T, Clark K, Leskovec J. Large-scale Analysis of Counseling Conversations: An Application of Natural Language Processing to Mental Health. Trans Assoc Comput Linguist. 2016;4:463-476. PubMed PMID: 28344978; PubMed Central PMCID: PMC5361062.

8. King AC, Winter SJ, Sheats JL, Rosas LG, Buman MP, Salvo D, Rodriguez NM, Seguin RA, Moran M, Garber R, Broderick B, Zieff SG, Sarmiento OL, Gonzalez SA, Banchoff A, Dommarco JR. Leveraging Citizen Science and Information Technology for Population Physical Activity Promotion. Transl J Am Coll Sports Med. 2016 May 15;1(4):30-44. PubMed PMID: 27525309; PubMed Central PMCID: PMC4978140.

9. Aberger CR, Tu S, Olukotun K, Ré C. EmptyHeaded: A Relational Engine for Graph Processing. Proc ACM SIGMOD Int Conf Manag Data. 2016 Jun-Jul;2016:431-446. doi: 10.1145/2882903.2915213. PubMed PMID: 28077912; PubMed Central PMCID: PMC5221635.

10. King AC, Hekler EB, Grieco LA, Winter SJ, Sheats JL, Buman MP, Banerjee B, Robinson TN, Cirimele J. Effects of Three Motivationally Targeted Mobile Device Applications on Initial Physical Activity and Sedentary Behavior Change in Midlife and Older Adults: A Randomized Trial. PLoS One. 2016 Jun 28;11(6):e0156370. doi: 10.1371/journal.pone.0156370. eCollection 2016. Erratum in: PLoS One. 2016;11(7):e0160113. PubMed PMID: 27352250; PubMed Central PMCID: PMC4924838.

11. Zhang C, Cafarella M, Niu F, Re C, Shin J. Extracting Databases from Dark Data with DeepDive. Proc ACM SIGMOD Int Conf Manag Data. 2016 Jun-Jul;2016:847-859. doi: 10.1145/2882903.2904442. PubMed PMID: 28316365; PubMed Central PMCID: PMC5350112.

12. Benson AR, Gleich DF, Leskovec J. Higher-order organization of complex networks.Science. 2016 Jul 8;353(6295):163-6. doi: 10.1126/science.aad9029. PubMed PMID: 27387949; PubMed Central PMCID: PMC5133458.

13. Grover A, Leskovec J. node2vec: Scalable Feature Learning for Networks. KDD. 2016 Aug;2016:855-864. PubMed PMID: 27853626; PubMed Central PMCID: PMC5108654.

14. Lakkaraju H, Bach SH, Leskovec J. Interpretable Decision Sets: A Joint Framework for Description and Prediction. KDD. 2016 Aug;2016:1675-1684. PubMed PMID: 27853627; PubMed Central PMCID: PMC5108651.

15. Mallory EK, Zhang C, Ré C, Altman RB. Large-scale extraction of gene interactions from full-text literature using DeepDive. Bioinformatics. 2016 Jan 1;32(1):106-13. doi: 10.1093/bioinformatics/btv476. Epub 2015 Sep 3. PubMed PMID:26338771; PubMed Central PMCID: PMC4681986.

16. Uchida TK, Seth A, Pouya S, Dembia CL, Hicks JL, Delp SL. Simulating Ideal Assistive Devices to Reduce the Metabolic Cost of Running. PLoS One. 2016 Sep 22;11(9):e0163417. doi: 10.1371/journal.pone.0163417. eCollection 2016. PubMed PMID: 27656901.

17. Leskovec J, Sosic R. SNAP: A General Purpose Network Analysis and Graph Mining Library. ACM Trans Intell Syst Technol. 2016 Oct;8(1). pii: 1. doi: 10.1145/2898361. Epub 2016 Oct 3. PubMed PMID: 28344853; PubMed Central PMCID: PMC5361061.

18. Hamilton W, Leskovec J, Jurafasky D. I Inducing Domain-Specific Sentiment Lexicons from Unlabeled Corpora. Proc Conf Empir Methods Nat Lang Process. 2016 Nov;2016:595-605. doi: 10.18653/v1/D16-1057. PubMed PMID: 28660257; PubMed Central PMCID: PMC5483533.

19. Hamilton W, Leskovec J, Jurafsky D. Cultural Shift or Linguistic Drift? Comparing Two Computational Measures of Semantic Change. Proc Conf Empir Methods Nat Lang Process. 2016 Nov;2016:2116-2121. PubMed PMID: 28580459; PubMed Central PMCID: PMC5452980.

20. Ratner A, De Sa C, Wu S, Selsam D, Re C. Data Programming: Creating Large Training Sets, Quickly. NIPS 2016.


2015

1. Althoff T, Leskovec J. Donor Retention in Online Crowdfunding Communities: A Case Study of DonorsChoose.org. Proc Int World Wide Web Conf. 2015 May;2015:34-44. PubMed PMID: 27077139; PubMed Central PMCID: PMC4827627.

2. Hadjis S, Abuzaid F, Zhang C, Ré C. Caffe con Troll: Shallow Ideas to Speed Up Deep Learning. Proc Fourth Workshop Data Anal Scale Danac 2015 (2015). 2015 May-Jun;2015. pii: 2. PubMed PMID: 27314106; PubMed Central PMCID: PMC4906251.

3. Perez Y, Sosič R, Banerjee A, Puttagunta R, Raison M, Shah P, Leskovec J. Ringo: Interactive Graph Analytics on Big-Memory Machines. Proc ACM SIGMOD Int Conf Manag Data. 2015 May-Jun;2015:1105-1110. PubMed PMID: 27081215; PubMed Central PMCID: PMC4829061.

4. Uchida TK, Sherman MA, Delp SL. Making a meaningful impact: modelling simultaneous frictional collisions in spatial multibody systems. Proc Math Phys Eng Sci. 2015 May 8;471(2177):20140859. PubMed PMID: 27547093; PubMed Central PMCID: PMC4984984.

5. Kimelfeld B, Ré C. A Database Framework for Classifier Engineering. CEUR Workshop Proc. 2015 May;1378. pii: http://ceur-ws.org/Vol-1378/AMW_2015_paper_1.pdf. Epub 2015 Jun 11. PubMed PMID: 27274719; PubMed Central PMCID: PMC4891810.

6. Shin J, Wu S, Wang F, De Sa C, Zhang C, Ré C. Incremental Knowledge Base Construction Using DeepDive. Proceedings VLDB Endowment. 2015 Jul;8(11):1310-1321. PubMed PMID: 27144081; PubMed Central PMCID: PMC4852149.

7. Hallac D, Leskovec J, Boyd S. Network Lasso: Clustering and Optimization in Large Graphs. KDD. 2015 Aug;2015:387-396. PubMed PMID: 27398260; PubMed Central PMCID: PMC4937836.

8. Ku JP, Hicks JL, Hastie T, Leskovec J, Ré C, Delp SL. The mobilize center: an NIH big data to knowledge center to advance human movement research and improve mobility. J Am Med Inform Assoc. 2015 Nov;22(6):1120-5. doi:10.1093/jamia/ocv071. Epub 2015 Aug 13. PubMed PMID: 26272077; PubMed Central PMCID: PMC4639715.

9. Shin J, Ré C, Cafarella M. Mindtagger: A Demonstration of Data Labeling in Knowledge Base Construction. Proceedings VLDB Endowment. 2015 Aug;8(12):1920-1923. PubMed PMID: 27144082; PubMed Central PMCID: PMC4852148.

10. Benson A, Gleich D, Leskovec J. Tensor Spectral Clustering for Partitioning Higher-order Network Structures. Proc SIAM Int Conf Data Min. 2015;2015:118-126. PubMed PMID: 27812399; PubMed Central PMCID: PMC5089081.

11. De Sa C, Zhang C, Olukotun K, Ré C. Rapidly Mixing Gibbs Sampling for a Class of Factor Graphs Using Hierarchy Width. Adv Neural Inf Process Syst. 2015 Dec;28:3079-3087. PubMed PMID: 27279724; PubMed Central PMCID: PMC4894721.

12. De Sa C, Zhang C, Olukotun K, Ré C. Taming the Wild: A Unified Analysis of Hogwild!-Style Algorithms. Adv Neural Inf Process Syst. 2015 Dec;28:2656-2664. PubMed PMID: 27330264; PubMed Central PMCID: PMC4907892.