THE LUCS-KDD SOFTWARE LIBRARY
(LIVERPOOL UNIVERSITY COMPUTER SCIENCE - KNOWLEDGE DISCOVERY IN
DATAS)
Frans Coenen
Department of Computer Science
The University of Liverpool
|
|
SO YOU WANT TO MINE YOUR DATA!
Some guidance notes for people who wish to mine their data
but know nothing about programming (and don't want to know
anything about programming).
|
|
It is has always been the practice of the LUCS-KDD
research team to make a substantial amount of the software developed, as part
of the teams on going KDD research work, publicly available free of charge.
From this WWW page it is possible to download various peices of software that
the team feel are robust enough for general usage. All the nsoftware is written
in Java and therefore should be highly portable. The team welcomes comments
and observations on any aspect of the software available from these WWW
pages.
To assist in understanding some of the algorithms some additional generic
notes have also been made available in Section 2.
1. AVAILABLE SOFTWARE SYSTEMS |
Data generators:
- LUCS-KDD ARM data generator
(Version 3.2 --- 7/2/2007).
- LUCS-KDD Random Image
generator.
Data preprocessing software:
- LUCS-KDD CARM Discretisation/
Normalisation (DN) software (Version 2 --- 18/1/2005).
- LUCS-KDD ARM Discretisation/
Normalisation (DN) software.
- Notes on discretising selected
data sets.
Data Sets
- Selection of discretised
(using the LUCS-KDD-DN software) data sets.
- 20 News
Group Sample Dataset (Local access only).
- Other sites from where data is avialable include: (i) the
UCI Machine Learning repository, and (ii) the Helsinki Frequent
Itemset Mining Dataset Repository.
Association Rule Mining (ARM) software:
- Apriori-T
demonstrator (originally developed for Laureate eLearning).
- Apriori-T Association
Rule Mining (ARM) algorithm.
- Apriori-T GUI Association
Rule Mining (ARM) algorithm GUI (includes DIC and Negtaive boarder example
algorithms for comparison purposes).
- TFP (Total From Partial)
Association Rule Mining (ARM) algorithm.
- Apriori-TFP GUI
Association Rule Mining (ARM) algorithm GUI (includes FP-growth example
algorithm for comparison purposes).
- FP-growth Association
Rule Mining (ARM) algorithm.
- Fuzzy Apriori-T Fuzzy
Association Rule Mining (FARM) algorithm.
- Weighted Fuzzy
Apriori-T Fuzzy Weighted
Association Rule Mining (FWARM) algorithm.
Classification Association Rule Mining (CARM) software:
- Apriori-TFPC Classification
Association Rule Mining (CARM) algorithm.
- LUCS-KDD implementations of
FOIL, PRM and CPAR.
- LUCS-KDD implementation of
CMAR.
- LUCS-KDD implementation of
CBA.
Classification Rule Mining (CRM) software:
- Decision Tree Classification
Rule Miner (CARM).
Text Mining Software
- TFPC Text Miner (and text
preprocessor).
Included here are some additional notes relevant to the software available
from this WWW page.
- The support and cofidenace
framework.
Created and maintained by
Frans Coenen.
Last updated 21 December 2012