Evaluating Apache Hadoop Software for Big Data ETL Functions

September 9th, 2014 |
Image for FaceBook
Evaluating Apache Hadoop Software for Big Data ETL Functions
Download PDFRead/Download White Paper (PDF)
For more information on Intel IT Best Practices, please visit www.intel.com/IT

IT Best Practices: Intel IT recently evaluated Apache Hadoop software for ETL (extract, transform, and load) functions. We first studied industry sources to learn the advantages and disadvantages of using Hadoop for big data ETL functions. We then tested what we learned with a real business use case that involved analyzing system logs as well as a cost comparison of Hadoop and our third-party ETL tool.

We determined that using Hadoop for ETL functions works well for datasets that are coming from, passing through, or resting in Hadoop. Specifically, Hadoop makes sense for simple extract and load operations performed on those datasets.


Posted in: Big Data, Information Technology, Intel, Intel IT, IT White Papers, IT@Intel
Tags: , , , , , , , ,

Learn more about IT Business Value from Paula Tolliver, Intel CIO on:
Intel.com, LinkedIn, or follow her on Twitter.