打开APP
userphoto
未登录

开通VIP,畅享免费电子书等14项超值服

开通VIP
Software | Kaggle

This article is a stub. You can help us by expanding it.

Tools Used By Competitors

— from the Kagglers' Favorite Tools blog post, where we do a survey of our competitors to find their favourite tools.


How I Did It Archive
The teams that win our competitions regularly post on our blog, giving an overview of how they won and the tools they used.

Free Software

R

R is a popular language and environment for statistical computing and graphics.

Official Site ?
Download ?

Apeks.io

Apeks.io is a machine learning tools for data classification and prediction. Apeks.io has a great collection of machine learning algorithms.

Official Site ?

Weka

Weka is a collection of machine learning algorithms for data mining tasks, in Java.

Official Site ?
Download ?

Cascading

Cascading is an Apache Licensed software abstraction layer for Apache Hadoop for creating complex workloads and queries.

  • Pattern - run models directly on Hadoop from PMML exports or build complex custom ensembles via the Java API.
  • Lingual - run ANSI SQL queries on Hadoop either though popular SQL Clients via the JDBC Driver, or via an API for complex workloads/queries, or by mixing SQL with PMML into a single application. Works great with R as a client.

Official Site ?
Download ?

Apache Mahout

Apache Licensed, Java- and Hadoop-based scalable machine learning library.

Official Site ?
Download ?

PredictionIO

An open source scalable machine learning server for programmers and data engineers to build smart software. It is algorithm-agnostic and has built-in support to Apache Mahout.

Official Site ?Download ?

Octave

GNU Octave is a high-level language, primarily intended for numerical computations — A.K.A. "Free MATLAB".

Official Site ?
Download ?

LibFM

A Factorization Machine Library by Steffen Rendle, winner of the Grockit competition.

Official Site ?

XGBoost

An optimized general purpose gradient boosting library. The library is parallelized using OpenMP. It implements machine learning algorithm under gradient boosting framework, including generalized linear model and gradient boosted regression tree. It supports various objective functions, including regression, classification and ranking. The package is also made to be extensible, so that users are also allowed to define their own objectives easily. Besides the standalone console version, you can use XGBoost in python, R and Julia

Official Site ?

GraphLab

A parallel framework for machine learning, has many collaborative filtering algorithms.

Official Site ?Download ?

MyMediaLite

MyMediaLite is a lightweight, multi-purpose library of recommender system algorithms.

Official Site ?Download ?

Myrrix

A scalable real-time recommender engine platform, evolved from Apache Mahout. The single-machine Serving Layer is free and open source.

Official Site ?Download ?

Mersenne Twister

The pseudo-random generator with the coolest-sounding name, and that's why you should use it (in addition to its other redeeming qualities).

Official Site ?

SciLua

SciLua is a framework for general purpose scientific computing based on LuaJIT. It includes vector/matrix algebra, random number generators/distributions, root finding and optimisation algorithms, automatic differentiation, others ... Also included a module to interface with R.

Official Site ?

Torch7

Torch7 is a scientific computing framework with wide support for machine learning algorithms, similar to Matlab/Octave, but ANN focused.

Official Site ?

APRIL-ANN

A Pattern Recognizer In Lua with Artificial Neural Networks is recently and in development open source tool which allows to train ANNs among other machine learning models for a wide range of pattern recognition tasks.

Official Site ?

HyperOpt

A hyperparameter optimization framework implemented in Python. Useful to estimate hyperpara,eters like learning rate, momentum, hidden layer sizes, ...

Official Site ?

Commercial Software

Alpine Data Labs

Alpine is a visual and collaborative environment for building powerful end to end workflows (data mining, exploratory analysis, modeling, and scoring) with support for many databases and Hadoop.

Official Site ?

MATLAB

MATLAB (matrix laboratory) is a numerical computing environment and programming language.

Official Site ?

Mathematica

Official Site ?

Neural Designer

Neural Designer is a professional application for predictive analytics which transforms raw data in useful knowledge through trained neural networks.

Official Site ?

SAS

Official Site ?

SPSS

Official Site ?

Portrait Software

Official Site ?

Microsoft Excel

Official Site ?

Skytree Server Free Edition

Machine Learning and advanced analytics engine, designed to accurately process massive datasets at high speeds.

Official Site ?

General Purpose Programming Languages

R

Python

C++

Java

C

Julia

Lua

Last Updated: 2014-10-22 06:49 by sergiointelnics

本站仅提供存储服务,所有内容均由用户发布,如发现有害或侵权内容,请点击举报
打开APP,阅读全文并永久保存 查看更多类似文章
猜你喜欢
类似文章
The Next Generation of Apache Hadoop MapReduc...
Deep Learning Libraries by Language
不用软件,让你的电脑急速如飞[实用技术]
InformIT: Interview with Donald Knuth > Inter...
Spark: Open Source Superstar Rewrites Future of Big Data
Hadoop: Best Practices and Anti-Patter...
更多类似文章 >>
生活服务
热点新闻
分享 收藏 导长图 关注 下载文章
绑定账号成功
后续可登录账号畅享VIP特权!
如果VIP功能使用有故障,
可点击这里联系客服!

联系客服