Main Page

From DataMining
Jump to: navigation, search
August 29, 2015 (Saturday)
Welcome to Penn Data Mining Group!

The Penn Data Mining Group develops principled means of modeling and analyzing large data sets. We combine modern statistical methods, machine learning, and knowledge of specific application areas to develop new approaches to data mining.
Current projects include developing new methods of
  • clustering and analyzing hundreds of thousands of articles using citations as well as the text
  • selecting the best of hundreds of thousands of variables to use in a model
  • statistical learning from relational data
  • learning gene regulatory networks in brain, circadian rhythm and liver.
See our Research Descriptions and slightly out of date Publications for more info. We also have a Data Mining Reading Group with a mailing list, maintained by Dean Foster (
Computer Science Contact
Lyle Ungar
Statistics Department Contact
'Dean Foster'
Next Meeting
Date: Mondays
Time: 4:30 pm
Location: Statistics Conference Room (4th floor Huntsman)
Reading: TBA: current topics in multi-view learning and CCA

More info...

What is Up
People in the group may want info on
getting started, git, Python, Matlab, productivity, ssh, MediaWiki or Paper Writing and Finding Papers
Machine Learning
info for New members
Data Miningreferences
LDC bioIE tagged medline abstracts for IE
soon to come are a set of test wikis for our email2wiki project

Other projects on this wiki