Skip to content

Latest commit

 

History

History
41 lines (32 loc) · 668 Bytes

README.md

File metadata and controls

41 lines (32 loc) · 668 Bytes

EasyMapReduce

Write Hadoop MapReduce in high-level API. Inspired by Apache Spark

Word Count

  Void main() {
    init
    ds := c.load(input).map{ it.split(' ').join("\n") }.reduce(0)|Int r,v|{ r + 1 }
    ds.save(output)
  }

Run

cd demo
fan wordCount.fan

Config

set 'HADOOP_HOME' env vars

Debug

Running in a local simulation env.

fan wordCount.fan -debug

How it works

Show more detail by '-debug_show' args.

fan wordCount.fan -debug_show

Passing Hadoop Args

fan wordCount.fan -hadoop '-files ./dict,./data'

More Hadoop Streaming Args