Home » , , , , » Download PDF Apache Mahout Cookbook A fast, fresh, developer-oriented dive into the world of Apache Mahout by Piero Giacomelli

Download PDF Apache Mahout Cookbook A fast, fresh, developer-oriented dive into the world of Apache Mahout by Piero Giacomelli



Sinopsis

Mahout is basically a set of machine learning Java libraries meant to be used for various tasks, such as classification, evaluation clustering, pattern-mining, and so on. There are many good frameworks that are user-friendly and fully equipped with more algorithms to do these tasks. For reference, the R community is much bigger and in the Java world we have had the RapidMiner and Weka frameworks present on the scene for many years. So why should we use Mahout instead of the aforementioned frameworks? Well, the truth is that all the previous frameworks are not meant to be designed for very large datasets. When we refer to very large datasets we refer to datasets, no matter the format, whose records require an order in the scale of a hundred million records. The power of Mahout lies in the fact that the algorithms are meant to be used in a Hadoop environment. Hadoop is a general framework that allows for an algorithm to run in parallel on multiple machines (called nodes) using the distributed computing paradigm.


Content

  1. Mahout is Not So Difficult!
  2. Using Sequence Files – When and Why?
  3. Integrating Mahout with an External Datasource
  4. Implementing the Naϊve Bayes classifier in Mahout
  5. Stock Market Forecasting with Mahout
  6. Canopy Clustering in Mahout
  7. Spectral Clustering in Mahout
  8. K-means Clustering
  9. Soft Computing with Mahout
  10. Implementing the Genetic Algorithm in Mahout





0 komentar:

Posting Komentar