Data Preprocessing

  1. M. Hernandez and S. Stolfo, Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem, Journal of Data Mining and Knowledge Discovery, 1998
  2. Presenter: Mary Bolton
    Date: 09/05/18

    Presenter: Braith Jackson
    Date: 09/11/18

    Association Rules

  3. Chun-Nan Hsu and Graig A. Knoblock, Discovering Robust Knowledge from Databases that Change, Data Mining and Knowledge Discovery, Volume 2, Issue 1, 1998, 69-95.
  4. Presenter: William Fu
    Date: 09/10/2018

    Presenter: Jose
    Date: 09/11/2018

  5. Xindong Wu, Chengqi Zhang, and Shichao Zhang, Efficient Mining of Both Positive and Negative Association Rules, ACM Transactions on Information Systems, 2004.
  6. Presenter:
    Date: 09/12/2018

    Presenter: Sophie Smith
    Date: 09/13/2018

  7. R. Agrawal and R. Srikant, Fast Algorithms for Mining Association Rules, Proceedings of the 20th VLDB Conference, Santiago, Chile, 1994
  8. Presenter: Noah Daniel
    Date: 09/17/2018

    Presenter: Carter King
    Date: 09/18/2018

  9. R. Srikant and R. Agrawal, Mining Quantitative Association Rules in Large Relational Tables, SIGMOD 1996.
  10. Presenter:
    Date: 09/17/2018

    Presenter: Julie Charbonnet
    Date: 09/18/2018

    Pattern Mining

  11. Mohammed J. Zaki, Efficiently Mining Frequent Trees in a Forest, KDD 2002.
  12. Presenter: Clare Edgar
    Date: 09/19/2018

    Presenter: Liam Nolan
    Date: 09/20/2018

  13. Jiawei Han, Jian Pei, and Yiwen Yin, Mining Frequent Patterns without Candidate Generation, SIGMOD, 2000.
  14. Presenter: Shane Kelley
    Date: 09/24/2018

    Presenter: Jun Kim
    Date: 09/25/2018

  15. R. Agrawal and R. Srikant, Mining Sequential Patterns, Proc. of the Int'l Conference on Data Engineering (ICDE), Taipei, Taiwan, March 1995.
  16. Presenter: Hieu Nguyen
    Date: 09/26/2018

    Presenter: Natalia Dobrowlski
    Date: 09/27/2018

    Classification

  17. Pedro Domingos, Meta-Cost: A General Method for Making Classifiers Cost-Sensitive, KDD, 1999.
  18. Presenter: Jillian Gamble
    Date: 10/01/2018

    Presenter: Ben Phosarath
    Date: 10/02/2018

  19. B. Abelson, K. Varshney, and J. Sun, Targeting Direct Cash Transfers to the Extremely Poor, KDD, 2014.
  20. Presenter: Tim Domescek
    Date: 10/08/2018

    Presenter: Emma Goff
    Date: 10/09/2018

    Clustering

  21. George Karypis, Eui-Hong (Sam) Han, and Vipin Kumar, CHAMELEON: A Hierarchical Clustering Algorithm Using Dynamic Modeling, IEEE Computer, 1999.
  22. Presenter: caroline coles
    Date: 10/10/2018

    Presenter: William von Hassell
    Date: 10/11/2018

  23. Hastie, T. and Tibshirani, R., Discriminant Adaptive Nearest Neighbor Classification, IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI), 1996.
  24. Presenter: Jimmy Schermer
    Date: 10/17/2018

    Presenter: Will Robichaux
    Date: 10/18/2018

  25. S. Arya, D. Mount, N. Netanyahu, R. Silverman, and A. Wu, An Optimal Algorithm for Approximate Nearest Neighbor Searching in Fixed Dimensions, J. ACM 45, 6 (November 1998), 891-923.
  26. Presenter: Michael Bardos
    Date: 10/22/2018

    Presenter: Zach Cornelison
    Date: 10/23/2018

    Boosting/Bagging

  27. Y. Freund and R. Schapire, A decision-theoretic generalization of on-line learning and an application to boosting, Journal of Computer and System Sciences, 55(1): 119-139, 1997.
  28. Presenter: Adam Hearn
    Date: 10/24/2018

    Presenter: Chandler Braxton
    Date: 10/25/2018

  29. R. Schapire and Y. Singer, Improved Boosting Algorithms Using Confidence-rated Predictions, Machine Learning, 37(3):297-336, 1999.
  30. Presenter: Allante Carr
    Date: 10/29/2018

    Presenter: Andrew Frantz
    Date: 10/30/2018

    Big Data

  31. Brin, S. and Page, L. The anatomy of a large-scale hypertextual Web search engine. In Proceedings of the Seventh international Conference on World Wide Web (WWW-7), 1998.
  32. Presenter: Will Raines
    Date: 11/05/2018

    Presenter: Marcus Tate
    Date: 11/06/2018

  33. Roberto J. Bayardo Jr., Efficiently Mining Long Patterns from Databases, SIGMOD, 1998.
  34. Presenter: Conrad Staiger
    Date: 11/07/2018

    Presenter: Hannah Chipman
    Date: 11/08/2018

    Applications

  35. B. Hooi, K. Shin, H. A. Song, A. Beutel, N. Shah, and C. Faloutsos, Graph-Based Fraud Detection in the Face of Camouflage, KDD, 2017.
  36. Presenter: Sahil Reddy
    Date: 11/12/2018

    Presenter: Will McIntyre
    Date: 11/13/2018

  37. S. Vosoughi, M. Mohsenvand, and D. Roy, Rumor Gauge: Predicting the Veracity of Rumors on Twitter, KDD, 2017.
  38. Presenter: Connor Ross
    Date: 11/14/2018

    Presenter: Rachel Fox
    Date: 11/15/2018

  39. J.Yang, J.McAuley, and J. Leskovec, Community Detection in Networks with Node Attributes, IEEE International Conference On Data Mining (ICDM), 2013.
  40. Presenter: Geoffrey Adams
    Date: 11/14/2018

    Presenter: João Pedro Veloso
    Date: 11/15/2018

  41. E. Algizawy, T. Ogawa, and A. El-Mahdy Real-Time Large-Scale Map Matching Using Mobile Phone Data ,KDD, 2017.
  42. Presenter: Negusu
    Date: 11/19/2018

    Presenter: Thomas Kirby
    Date: 11/20/2018

  43. J. Liu, C. Aggarwal, and J. Han. On Integrating Network and Community Discovery, WSDM, 2015.
  44. Presenter: Tanner McDaniel
    Date: 11/26/2018

    Presenter: Jack Kirkpatrick
    Date: 11/27/2018