Skip to content

Portable, Scalable and Reliable Distributed Machine Learning, support various platforms including Hadoop YARN, MPI, etc.

License

Notifications You must be signed in to change notification settings

huangpingchun/wormhole

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Build Status Documentation Status GitHub license

Portable, scalable and reliable distributed machine learning.

Wormhole is a place where DMLC projects works together to provide scalable and reliable machine learning toolkits that can run on various platforms

Features

  • Portable:
    • Supported platforms: local machine, Apache YARN, MPI and Sungrid Engine
  • Rich support of Data Source
    • All projects can read data from HDFS, S3 or local filesystem
  • Scalable and Reliable

List of Tools

Build & Run

  • Requires a C++11 compiler (e.g.~g++ >=4.8) and git. Install them on Ubuntu

    = 13.10

sudo apt-get update && sudo apt-get install -y build-essential git
  • Type make to build all deps and tools

  • All tools can run both in a laptop and in a cluster. For example, train logisitic regression using 2 workers and one servers in local machine

tracker/dmlc_local.py -n 2 -s 1 bin/linear.dmlc learn/linear/guide/demo.conf

Support

If you are having issues, please let us know.

Contribute

  • We are actively building new tools. The source codes of all tools are available under learn/.
  • Wormhole depends on other DMLC projects, which are also under active developing
    • dmlc-core provides I/O modules and job trackers
    • rabit provides reliable BSP Allreduce communication.
    • ps-lite provides the asynchronous key-value push and pull for the parameter server framework.

About

Portable, Scalable and Reliable Distributed Machine Learning, support various platforms including Hadoop YARN, MPI, etc.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 86.7%
  • Makefile 5.6%
  • Protocol Buffer 5.4%
  • Shell 2.3%