Data Preprocessing
- M. Hernandez and S. Stolfo, Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem, Journal of Data Mining and Knowledge Discovery, 1998
Presenter: Mary Bolton
Date: 09/05/18
Presenter: Braith Jackson
Date: 09/11/18
Association Rules
- Chun-Nan Hsu and Graig A. Knoblock, Discovering Robust Knowledge from Databases that Change, Data Mining and Knowledge Discovery, Volume 2, Issue 1, 1998, 69-95.
Presenter: William Fu
Date: 09/10/2018
Presenter: Jose
Date: 09/11/2018
- Xindong Wu, Chengqi Zhang, and Shichao Zhang, Efficient Mining of Both Positive and Negative Association Rules, ACM Transactions on Information Systems, 2004.
Presenter:
Date: 09/12/2018
Presenter: Sophie Smith
Date: 09/13/2018
- R. Agrawal and R. Srikant, Fast Algorithms for Mining Association Rules, Proceedings of the 20th VLDB Conference, Santiago, Chile, 1994
Presenter: Noah Daniel
Date: 09/17/2018
Presenter: Carter King
Date: 09/18/2018
- R. Srikant and R. Agrawal, Mining Quantitative Association Rules in Large Relational Tables, SIGMOD 1996.
Presenter:
Date: 09/17/2018
Presenter: Julie Charbonnet
Date: 09/18/2018
Pattern Mining
- Mohammed J. Zaki, Efficiently Mining Frequent Trees in a Forest, KDD 2002.
Presenter: Clare Edgar
Date: 09/19/2018
Presenter: Liam Nolan
Date: 09/20/2018
- Jiawei Han, Jian Pei, and Yiwen Yin, Mining Frequent Patterns without Candidate Generation, SIGMOD, 2000.
Presenter: Shane Kelley
Date: 09/24/2018
Presenter: Jun Kim
Date: 09/25/2018
- R. Agrawal and R. Srikant, Mining Sequential Patterns, Proc. of the Int'l Conference on Data Engineering (ICDE), Taipei, Taiwan, March 1995.
Presenter: Hieu Nguyen
Date: 09/26/2018
Presenter: Natalia Dobrowlski
Date: 09/27/2018
Classification
- Pedro Domingos, Meta-Cost: A General Method for Making Classifiers Cost-Sensitive, KDD, 1999.
Presenter: Jillian Gamble
Date: 10/01/2018
Presenter: Ben Phosarath
Date: 10/02/2018
- B. Abelson, K. Varshney, and J. Sun, Targeting Direct Cash Transfers to the Extremely Poor, KDD, 2014.
Presenter: Tim Domescek
Date: 10/08/2018
Presenter: Emma Goff
Date: 10/09/2018
Clustering
- George Karypis, Eui-Hong (Sam) Han, and Vipin Kumar, CHAMELEON: A Hierarchical Clustering Algorithm Using Dynamic Modeling, IEEE Computer, 1999.
Presenter: caroline coles
Date: 10/10/2018
Presenter: William von Hassell
Date: 10/11/2018
- Hastie, T. and Tibshirani, R., Discriminant Adaptive Nearest Neighbor Classification, IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI), 1996.
Presenter: Jimmy Schermer
Date: 10/17/2018
Presenter: Will Robichaux
Date: 10/18/2018
- S. Arya, D. Mount, N. Netanyahu, R. Silverman, and A. Wu, An Optimal Algorithm for Approximate Nearest Neighbor Searching in Fixed Dimensions, J. ACM 45, 6 (November 1998), 891-923.
Presenter: Michael Bardos
Date: 10/22/2018
Presenter: Zach Cornelison
Date: 10/23/2018
Boosting/Bagging
- Y. Freund and R. Schapire, A decision-theoretic generalization of on-line learning and an application to boosting, Journal of Computer and System Sciences, 55(1): 119-139, 1997.
Presenter: Adam Hearn
Date: 10/24/2018
Presenter: Chandler Braxton
Date: 10/25/2018
- R. Schapire and Y. Singer, Improved Boosting Algorithms Using Confidence-rated Predictions, Machine Learning, 37(3):297-336, 1999.
Presenter: Allante Carr
Date: 10/29/2018
Presenter: Andrew Frantz
Date: 10/30/2018
Big Data
- Brin, S. and Page, L. The anatomy of a large-scale hypertextual Web search engine. In Proceedings of the Seventh international Conference on World Wide Web (WWW-7), 1998.
Presenter: Will Raines
Date: 11/05/2018
Presenter: Marcus Tate
Date: 11/06/2018
- Roberto J. Bayardo Jr., Efficiently Mining Long Patterns from Databases, SIGMOD, 1998.
Presenter: Conrad Staiger
Date: 11/07/2018
Presenter: Hannah Chipman
Date: 11/08/2018
Applications
- B. Hooi, K. Shin, H. A. Song, A. Beutel, N. Shah, and C. Faloutsos, Graph-Based Fraud Detection in the Face of Camouflage, KDD, 2017.
Presenter: Sahil Reddy
Date: 11/12/2018
Presenter: Will McIntyre
Date: 11/13/2018
- S. Vosoughi, M. Mohsenvand, and D. Roy, Rumor Gauge: Predicting the Veracity of Rumors on Twitter, KDD, 2017.
Presenter: Connor Ross
Date: 11/14/2018
Presenter: Rachel Fox
Date: 11/15/2018
- J.Yang, J.McAuley, and J. Leskovec, Community Detection in Networks with Node Attributes, IEEE International Conference On Data Mining (ICDM), 2013.
Presenter: Geoffrey Adams
Date: 11/14/2018
Presenter: João Pedro Veloso
Date: 11/15/2018
- E. Algizawy, T. Ogawa, and A. El-Mahdy Real-Time Large-Scale Map Matching Using Mobile Phone Data ,KDD, 2017.
Presenter: Negusu
Date: 11/19/2018
Presenter: Thomas Kirby
Date: 11/20/2018
- J. Liu, C. Aggarwal, and J. Han. On Integrating Network and Community Discovery, WSDM, 2015.
Presenter: Tanner McDaniel
Date: 11/26/2018
Presenter: Jack Kirkpatrick
Date: 11/27/2018