5 of the Best Free and Open Source Data Mining Software

download 5 of the Best Free and Open Source Data Mining Software

of 3

Transcript of 5 of the Best Free and Open Source Data Mining Software

  • 7/29/2019 5 of the Best Free and Open Source Data Mining Software

    1/3

    5 of the Best Free and Open Source Data Mining SoftwarePOSTED BYJUN AUZA ON 11/25/2010

    The process of extracting patterns from data is called data mining. It is recognized as

    an essential tool by modern business since it is able to convert data into businessintelligence thus giving an informational edge. At present, it is widely used in profiling

    practices, like surveillance, marketing, scientific discovery, and fraud detection.

    There are four kinds of tasks that are normally

    involve in Data mining:

    * Classification - the task of generalizing familiar structure to employ to new data

    * Clustering - the task of finding groups and structures in the data that are in some

    way or another the same, without using noted structures in the data.

    * Association rule learning - Looks for relationships between variables.

    * Regression - Aims to find a function that models the data with the slightest error.

    For those of you who are looking for some data mining tools, here are five of the best

    open-source data mining software that you could get for free:

    Orange

    Orange is a component-based data mining and machine

    learning software suite that features friendly yet powerful, fast and versatile visual

    programming front-end for explorative data analysis and visualization, and Python

    bindings and libraries for scripting. It contains complete set of components for datapreprocessing, feature scoring and filtering, modeling, model evaluation, and

    http://www.ailab.si/orangehttp://www.ailab.si/orangehttp://4.bp.blogspot.com/_UqUwVPikChs/TO4rPMJvt6I/AAAAAAAAPrQ/8A7JtGDCRsc/s1600/orange-data-mining-software.jpghttp://2.bp.blogspot.com/_UqUwVPikChs/TO4r2Tmkd2I/AAAAAAAAPrg/0XdSY7uwt-w/s1600/data-mining-software.jpghttp://4.bp.blogspot.com/_UqUwVPikChs/TO4rPMJvt6I/AAAAAAAAPrQ/8A7JtGDCRsc/s1600/orange-data-mining-software.jpghttp://2.bp.blogspot.com/_UqUwVPikChs/TO4r2Tmkd2I/AAAAAAAAPrg/0XdSY7uwt-w/s1600/data-mining-software.jpghttp://www.ailab.si/orange
  • 7/29/2019 5 of the Best Free and Open Source Data Mining Software

    2/3

    exploration techniques. It is written in C++ and Python, and its graphical user

    interface is based on cross-platform Qt framework.

    RapidMiner

    RapidMiner, formerly called YALE (Yet Another Learning

    Environment), is an environment for machine learning and data mining experimentsthat is utilized for both research and real-world data mining tasks. It enablesexperiments to be made up of a huge number of arbitrarily nestable operators, which

    are detailed in XML files and are made with the graphical user interface of RapidMiner.

    RapidMiner provides more than 500 operators for all main machine learningprocedures, and it also combines learning schemes and attribute evaluators of the

    Weka learning environment. It is available as a stand-alone tool for data analysis and

    as a data-mining engine that can be integrated into your own products.

    Weka

    Written in Java, Weka (Waikato Environment forKnowledge Analysis) is a well-known suite of machine learning software that supports

    several typical data mining tasks, particularly data preprocessing, clustering,classification, regression, visualization, and feature selection. Its techniques are basedon the hypothesis that the data is available as a single flat file or relation, where each

    data point is labeled by a fixed number of attributes. Weka provides access to SQL

    databases utilizing Java Database Connectivity and can process the result returned bya database query. Its main user interface is the Explorer, but the same functionalitycan be accessed from the command line or through the component-based Knowledge

    Flow interface.

    JHepWork

    http://rapidminer.com/http://rapidminer.com/http://www.cs.waikato.ac.nz/~ml/weka/http://www.cs.waikato.ac.nz/~ml/weka/http://2.bp.blogspot.com/_UqUwVPikChs/TO4rOpDvkVI/AAAAAAAAPrI/B7HFz7N6CE4/s1600/data-mining-software-weka.jpghttp://4.bp.blogspot.com/_UqUwVPikChs/TO4rN_g8AYI/AAAAAAAAPrA/5ZtyCxk8F04/s1600/data-mining-software-rapidminer.jpghttp://2.bp.blogspot.com/_UqUwVPikChs/TO4rOpDvkVI/AAAAAAAAPrI/B7HFz7N6CE4/s1600/data-mining-software-weka.jpghttp://4.bp.blogspot.com/_UqUwVPikChs/TO4rN_g8AYI/AAAAAAAAPrA/5ZtyCxk8F04/s1600/data-mining-software-rapidminer.jpghttp://www.cs.waikato.ac.nz/~ml/weka/http://rapidminer.com/
  • 7/29/2019 5 of the Best Free and Open Source Data Mining Software

    3/3

    Designed for scientists, engineers and students,jHepWorkis a freeand open-source data-analysis framework that is created as an attempt to make a

    data-analysis environment using open-source packages with a comprehensible user

    interface and to create a tool competitive to commercial programs. It is specially madefor interactive scientific plots in 2D and 3D and contains numerical scientific libraries

    implemented in Java for mathematical functions, random numbers, and other datamining algorithms. jHepWork is based on a high-level programming language Jython,

    but Java coding can also be used to call jHepWork numerical and graphical libraries.

    KNIME

    KNIME (Konstanz Information Miner) is a user friendly,

    intelligible, and comprehensive open-source data integration, processing, analysis, andexploration platform. It gives users the ability to visually create data flows or pipelines,

    selectively execute some or all analysis steps, and later study the results, models, and

    interactive views. KNIME is written in Java, and it is based on Eclipse and makes use ofits extension method to support plugins thus providing additional functionality.Through plugins, users can add modules for text, image, and time series processing

    and the integration of various other open source projects, such as R programminglanguage, Weka, the Chemistry Development Kit, and LibSVM.

    If you know of other free and open-source data mining software, please share themwith us via comment.

    http://jwork.org/jhepwork/http://jwork.org/jhepwork/http://jwork.org/jhepwork/http://www.knime.org/http://www.knime.org/http://4.bp.blogspot.com/_UqUwVPikChs/TO4rNoP5tOI/AAAAAAAAPq4/cgdxMm-9K-o/s1600/data-mining-software-KNIME.jpghttp://3.bp.blogspot.com/_UqUwVPikChs/TO4rPRkdykI/AAAAAAAAPrY/hSnA_wWEw48/s1600/data_mining_software_jhepwork.jpghttp://4.bp.blogspot.com/_UqUwVPikChs/TO4rNoP5tOI/AAAAAAAAPq4/cgdxMm-9K-o/s1600/data-mining-software-KNIME.jpghttp://3.bp.blogspot.com/_UqUwVPikChs/TO4rPRkdykI/AAAAAAAAPrY/hSnA_wWEw48/s1600/data_mining_software_jhepwork.jpghttp://www.knime.org/http://jwork.org/jhepwork/