Tuesday, February 13, 2007

KNIME

KNIME, pronounced [naim], is a modular data exploration platform that enables the user to visually create data flows (often referred to as pipelines), selectively execute some or all analysis steps, and later investigate the results through interactive views on data and models.


KNIME was developed (and will continue to be expanded) by the Chair for Bioinformatics and Information Mining at the University of Konstanz, Germany. The group headed by Michael Berthold is also using KNIME for teaching and research at the University. Quite a number of new data analysis methods developed at the chair are integrated in KNIME. Let us know if you are looking for something in particular, not all of those modules are part of the standard KNIME release just yet...


KNIME base version already incorporates over 100 processing nodes for data I/O, preprocessing and cleansing, modelling, analysis and data mining as well as various interactive views, such as scatter plots, parallel coordinates and others. It includes all analysis modules of the well known Weka data mining environment ( http://www.cs.waikato.ac.nz/ml/weka/) and additional plugins allow R-scripts (www.r-project.org) to be run, offering access to a vast library of statistical routines.


KNIME is based on the Eclipse platform and, through it's modular API, easily extensible. Customs nodes and types can be integrated within hours enabling KNIME to be used not only in production environments but also for teaching and research prototyping. If you would like to read a more detailed description of the software, please download our White Paper as a PDF file here.


KNIME is available through a dual licensing scheme. A non-profit open source license allows KNIME to be downloaded, distributed, and used freely as long as the software or its use is not distributed per profit. See license arrangements for details.