Parlour

Parlour. a place that sells scoops of ice-cream; a cascading-sqoop integration.

Parlour provides a basic Cascading/Scalding Sqoop integration allowing export from HDFS.

It also provides for support for the Cloudera/Teradata Connector.

Third-Party Libraries

In order to get Parlour to work - you will need to include the following third-party JARs with the application that you use it in.

If you want to use them within the parlour repository - you will need to put them in lib/.

Oracle Support:

ojdbc6.jar: the Oracle JDBC Adapter

Teradata Support:

sqoop-connector-teradata-1.2c4.jar: Cloudera Connector Powered by Teradata
tdgssconfig.jar: Teradata Driver (Security configuration)
terajdbc4.jar: Teradata JDBC Adapter

Cascade Job

import au.com.cba.omnia.parlour.SqoopSyntax._

new ExportSqoopJob(
  sqoopOptions()
   .teradata(BatchInsert)
   .connectionString("jdbc:teradata://some.server/database=DB1")
   .username("some username")
   .password(System.getenv("DATABASE_PASSWORD"))
   .tableName("some table"),
  TypedPsv[String]("hdfs/path/to/data/to/export")
)(args)

Console Job

Parlour includes a sample job that can be invoked from the command-line:

hadoop jar <parlour-jar> \
    com.twitter.scalding.Tool \
    au.com.cba.omnia.parlour.ExportSqoopConsoleJob \
    --hdfs \
    --input /data/on/hdfs/to/sqoop \
    --teradata \
    --teradata-method internal.fastload \
    --teradata-internal-fastload-host-adapter myhostname1 \
    --connection-string "jdbc:teradata://database/database=test" \
    --table-name test \
    --username user1 \
    --password $PASSWORD \
    --mappers 1 \
    --input-field-delimiter \| \
    --input-line-delimiter \n

Teradata Fastload Support

Teradata Internal Fastload requires the use of a coordinating service that runs on the machine that launches the jobs.

As a result - you may need to manually specify which adapter the service should be bound to. This is done using sqoopOptions.teradata(InternalFastload, Some("myhostname")).

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
bin		bin
project		project
src		src
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
build.sbt		build.sbt
sbt		sbt
version.sbt		version.sbt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Parlour

Third-Party Libraries

Cascade Job

Console Job

Teradata Fastload Support

About

Releases

Packages

Contributors 4

License

sujeshchirackkal/parlour

Folders and files

Latest commit

History

Repository files navigation

Parlour

Third-Party Libraries

Cascade Job

Console Job

Teradata Fastload Support

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Packages