Apache Mahout

Apache Mahout
Developer(s)	Apache Software Foundation
Initial release	7 April 2009[1]
Stable release	0.14.0 / 6 March 2019[2]
Repository	Mahout Repository
Written in	Java, Scala
Operating system	Cross-platform
Type	Machine learning
License	Apache License 2.0
Website	mahout.apache.org

Apache Mahout is a project of the Apache Software Foundation to produce free implementations of distributed or otherwise scalable machine learning algorithms focused primarily on linear algebra. In the past, many of the implementations use the Apache Hadoop platform, however today it is primarily focused on Apache Spark.[3][4] Mahout also provides Java/Scala libraries for common maths operations (focused on linear algebra and statistics) and primitive Java collections. Mahout is a work in progress; a number of algorithms have been implemented.[5]

While Mahout's core algorithms for clustering, classification and batch based collaborative filtering are implemented on top of Apache Hadoop using the map/reduce paradigm, it does not restrict contributions to Hadoop-based implementations. Contributions that run on a single node or on a non-Hadoop cluster are also welcomed. For example, the 'Taste' collaborative-filtering recommender component of Mahout was originally a separate project and can run stand-alone without Hadoop.

Starting with the release 0.10.0, the project shifted its focus to building a backend-independent programming environment, code named "Samsara".[6][7][8] The environment consists of an algebraic backend-independent optimizer and an algebraic Scala DSL unifying in-memory and distributed algebraic operators. Supported algebraic platforms are Apache Spark, H2O, and Apache Flink. Support for MapReduce algorithms started being gradually phased out in 2014.[9]

References

"Apache Mahout: First release 0.1 released".
"Apache Mahout: Scalable machine learning and data mining". Retrieved 6 March 2019.
"Introducing Apache Mahout". ibm.com. 2011. Retrieved 13 September 2011.
"InfoQ: Apache Mahout: Highly Scalable Machine Learning Algorithms". infoq.com. 2011. Retrieved 13 September 2011.
"Algorithms - Apache Mahout - Apache Software Foundation". cwiki.apache.org. 2011. Retrieved 13 September 2011.
"Mahout-Samsara's In-Core Linear Algebra DSL Reference".
"Mahout-Samsara's Distributed Linear Algebra DSL Reference".
"Mahout 0.10.x: first Mahout release as a programming environment". www.weatheringthroughtechdays.com. Archived from the original on 9 October 2016. Retrieved 29 February 2016.
"MAHOUT-1510 ("Good-bye MapReduce")".

External links

Official website

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.

[1] "Apache Mahout: First release 0.1 released".

[2] "Apache Mahout: Scalable machine learning and data mining". Retrieved 6 March 2019.

[3] "Introducing Apache Mahout". ibm.com. 2011. Retrieved 13 September 2011.

[4] "InfoQ: Apache Mahout: Highly Scalable Machine Learning Algorithms". infoq.com. 2011. Retrieved 13 September 2011.

[5] "Algorithms - Apache Mahout - Apache Software Foundation". cwiki.apache.org. 2011. Retrieved 13 September 2011.

[6] "Mahout-Samsara's In-Core Linear Algebra DSL Reference".

[7] "Mahout-Samsara's Distributed Linear Algebra DSL Reference".

[8] "Mahout 0.10.x: first Mahout release as a programming environment". www.weatheringthroughtechdays.com. Archived from the original on 9 October 2016. Retrieved 29 February 2016.

[9] "MAHOUT-1510 ("Good-bye MapReduce")".

Apache Software Foundation
Top-level projects	Accumulo ActiveMQ Airflow Ambari Ant Aries Apache HTTP Server APR Avro Axis Axis2 Beam Bloodhound Brooklyn Buildr Calcite Camel CarbonData Cassandra Cayenne Chemistry CloudStack Cocoon Cordova CouchDB cTAKES CXF Derby Directory Drill Druid Empire-db Felix Flex Flink Flume Forrest Geronimo Giraph Gump Hadoop Hama HBase Helix Hive Impala Jackrabbit James Jena Jini JMeter Kafka Karaf Kudu Kylin Lucene Mahout Marmotta Maven MINA mod_perl MyFaces NetBeans Nutch OFBiz Oozie OpenEJB OpenJPA OpenNLP OрenOffice ORC PDFBox Parquet Phoenix POI Pig Pivot Qpid Roller RocketMQ Samza ServiceMix Shiro SINGA Sling Solr Spark Stanbol Storm SpamAssassin Sqoop Struts 1 Struts 2 Subversion SystemML Tapestry Thrift Tika Tomcat Trafodion Traffic Server UIMA Velocity Wicket Xalan Xerces XMLBeans Yetus ZooKeeper
Commons	BCEL BSF Daemon Jelly Logging
Incubator	Iceberg MXNet Superset Taverna XAP
Other projects	Batik Chainsaw FOP Ivy Log4j
Attic	Abdera Apex AxKit Beehive Bluesky iBATIS Cactus Click Continuum Deltacloud Etch Excalibur Harmony HiveMind Jakarta Lenya ODE Shale Shindig Slide stdcxx Tuscany Wave Wink
Licenses	Apache License
Category