|
The PNC2 Rule Induction System
Windows software tool that induces rules using the PNC2 cluster algorithm. An integrated parameter tuning component allows an easy adjustment of the algorithm's behaviour to the given problem without any further knowledge. [Gnu GPL]
http://www.newty.de/pnc2/index.html
Data-Miner Software Kit
Collection of standalone data mining programs, available as scripts or Java programs. Contains information for downloading, installation, data preparation, and operating instructions. Supplementary to the book titled Predictive Data Mining - A Practical Guide.
http://www.data-miner.com/dmsk.html
AC2 - Software Toolkit
A multi-lingual toolkit for various decision tree algorithms with C++ libraries. Available for free download for a variety of platforms.
http://www.alice-soft.com/html/prod_ac2.ht...
Machine Learning Library in C++
MLC++ is a standard C++ library for supervised machine learning, with back-end and front-end tools for data mining tasks like Decision Trees, and Clustering. Information on legal issues, mailing lists, history, standards, platform support, and download instructions.
http://www.sgi.com/tech/mlc/
WinMine Toolkit Home Page
By David Chickering at Microsoft Research. The WinMine Toolkit is a set of tools for Windows 2000/NT/XP that allow you to build statistical models from data. The majority of the tools are command-line executables that can be run in scripts.
http://research.microsoft.com/~dmax/winmin...
Graf-FX
A Microsoft Access application designed to provide tools to explore your databases with graphs and queries. It is also a quick way to generate/prototype Access Graphs without running the Wizards.
http://www.gr-fx.com/graf-fx.htm
StatLib - XlispStat Archive
Environment for statistical computing and dynamic graphics based on Lisp. Contains contributed code and submission instructions.
http://lib.stat.cmu.edu/xlispstat/
Inducing Functional Dependencies from Relations
Includes source code, related research papers and associated work.
http://www.cs.bris.ac.uk/~flach/fdep/
MPCA Software
GPL C/C++ software for data analysis of discrete data using principal/independent component methods. Examples are DPCA, LDA, GaP (like PLSI and NMF). Targetted at text, with MPI and multithreading.
http://cosco.hiit.fi/search/MPCA/
Snob
Uses the Minimum Message Length (MML) principle to do mixture modeling. Mixture modeling concerns modeling a statistical distribution by a mixture of other distributions, and is also known as unsupervised concept learning in Artificial Intelligence. Links to related research papers and software.
http://www.cs.monash.edu.au/~dld/Snob.html
DeFindIt Analysis and Reporting
Open source software for extraction and reporting using a powerful template tool. Deft combines declarative concepts of SQL with all of Perl's features. Requires Linux and Perl
http://defindit.com/
AutoClass C - General Information
An unsupervised Bayesian classification system that seeks a maximum posterior probability classification.
http://ic-www.arc.nasa.gov/ic/projects/bay...
QuickMiner
Open Source creation of a data mining C++ procedure library. Initially focused on mining generalised association rules and generalised sequential patterns
http://quickminer.sourceforge.net/
Visual Basic Data Mining .Net
Data Mining applications developed with Visual Basic or the .NET Framework by Kingsley Tagbo, including Naive Bayes Classifiers. Site provides public domain data mining applications with source code and online documentation. The latest release as of October 2002 is 'Visual Basic Data Mining With Naive Bayes' and '.NET Data Mining With Naive Bayes'.
http://www.visual-basic-data-mining.net
Association Rule Miner
Client-server Java based data mining software for mining association rules. Developed at University of Massachusetts.
http://www.cs.umb.edu/~laur/ARMiner/
PAFI - Pattern Finding Toolkit
A freely available software toolkit for finding frequent patterns in diverse datasets. It contains highly efficient algorithms for finding patterns in transactional, sequential, and graph datasets.
http://www.cs.umn.edu/~karypis/pafi/
ROSETTA
A Software system for data mining based on rough set theory. GUI based operation on MS Windows platforms, with a wide variety of algorithms. Information on features, documentation, utilities, and upcoming releases.
http://www.idi.ntnu.no/~aleks/rosetta/
CART - Salford Systems
A decision tree tool that automatically sifts large, complex databases, searching for and isolating significant patterns and relationships. Offers free limited capability demo for download, product features, applications, user feedback, and associated books.
http://www.salford-systems.com/products-ca...
Shih Tree Builder
Modelling tool that analyzes data generating classification, regression or class probability prediction models.
http://www.shih.be/
Classification Tree in Excel
A small Excel based freeware to build Classification Tree models in Excel. Uses C4.5 algorithm. Very easy to learn and use - but capability is limited.
http://www.geocities.com/adotsaha/CTree/Ct...
DMTools
Written in Python, the toolbox handles caching of database queries and parallelism within a collection of independent queries. Our toolbox provides a number of routines for basic data mining tasks on top of which the user can add more functions - mainly domain and data collection dependent - for complex and time consuming data mining tasks. GNU/GPL. From the Computer Sciences Laboratory of The Australian National University
http://cslab.anu.edu.au/ml/dm/dm_software....
Frequent Pattern Mining Implementations
Frequent itemset and association rule mining implementations (C++) such as Apriori, Eclat and FP-growth.
http://www.adrem.ua.ac.be/~goethals/softwa...
VisDB: A Visual Data Mining and Database Exploration System
The VisDB has been developed to support the exploration of large database. The VisDB system implements several visual data mining techniques, allowing an exploration of large databases (up-to about one million data values).
http://www.dbs.informatik.uni-muenchen.de/...
XELOPES Data Mining Library
Platform- and data-source-independent library for embedded data mining based on the CWM/OMG and other data mining standards. XELOPES-Java algorithms: SVMs, market basket analysis, sequence analysis, decision trees, cluster analysis, multidimensional grouping. XELOPES-C++ algorithms: SVMs, decision trees. [GPL]
http://www.prudsys.com/Produkte/Algorithme...
Discovery of Multivalued Dependencies from Relations
Includes source code, related papers and associated projects.
http://www.cs.bris.ac.uk/~flach/mdep/
Quinlan, Ross
University of New South Wales - Machine learning and data mining.
http://www.rulequest.com/Personal/
CLUTO - Clustering Toolkit
A freely available software toolkit for clustering low- and high-dimensional data sets. It is well-suited for clustering data sets arising in many areas including information retrieval, customer purchasing transactions, science, and biology.
http://www.cs.umn.edu/~karypis/cluto
ECOBWEB - Concept Formation Program
Source code for program for creation of hierarchical classification trees. Information about implementations, documentation, and related research papers.
http://www.eng.tau.ac.il/~yoram/topics/eco...
XmdvTool Development Project
Public-domain software package for the interactive visual exploration of multivariate data sets. It supports four methods for displaying flat form data and hierarchically clustered data: Scatterplots, Star Glyphs, Parallel Coordinates and Dimensional Stacking.
http://davis.wpi.edu/~xmdv/
Model-Based Classification Software
Model based clustering and discriminant analysis, including hierarchical clustering and EM. Developed at University of Washington.
http://www.stat.washington.edu/fraley/mclu...
Tminer Personal Edition
Software suite which has algorithms for association rules, building classifiers, and clustering data from relational database products using JDBC. References to related articles, and research papers.
http://frontdb.ugr.es/research.htm
|