Pentaho Data Integration and Big Data
With growing volumes and varieties of data flowing at increasing speed, organizations need a fast and easy way to harness and gain insight from their big data sources. Pentaho accelerates the realization of value from big data with the most complete solution for big data analytics.
Pentaho provides the right set of tools to each user, all within a tightly coupled data integration and analytics platform that supports the entire big data lifecycle. For IT and developers, Pentaho provides a complete, visual design environment to simplify and accelerate data preparation and modeling. For business users, Pentaho provides visualization and exploration of data. And for data analysts and scientists, Pentaho provides full data discovery, exploration and predictive analytics.
Using a combination of instructor-led presentations and hands-on exercises, this course provides an overview of the big data capabilities within Pentaho Data Integration, including visualization tools. This course helps prepare you for the Pentaho Data Integration Certification Exam.Back to Courses
This course is the third course in the Database Developer path. PDI4000 is an intermediate course and is intended for students experienced in both PDI and big data. Students who need a comprehensive overview of big data tools and technologies should take course PDI3000: Pentaho Big Data Fundamentals
|Online||English||Pentaho||December 18, 2013 - 10:00 AM EST||Register Now|
At the completion of this course, you should be able to:
- Use Pentaho Data Integration (and Pentaho MapReduce) to manipulate big data
- Orchestrate big data jobs in Pentaho Data Integration
- Visualize big data using Pentaho InstaView
Before taking this class, students should complete course PDI2000: Pentaho Data Integration I or have equivalent field experience with Pentaho Data Integration. Big data knowledge is also required. This course does not present an overview of the various big data tools and technologies. Some basic knowledge of the Linux operating system (CentOS) is required.
Online courses require a broadband Internet connection, the use of a modern Web browser (such as Microsoft Internet Explorer or Mozilla Firefox), and the ability to connect to the WebEx Training Center. For more information on WebEx Training Center requirements, see www.webex.com. Online courses use Pentaho’s cloud-based exercise environment. Students are provided access to a virtual machine used to complete the exercises.
For online courses, students are provided with a secured, electronic course manual. Printed manuals are not provided for online courses. When an electronic manual is provided, students are encouraged to print the exercise book before class begins, though this is not required.
Students attending this course on-site should contact their Customer Success Manager for hardware and software requirements. You can also email us at email@example.com for more information regarding on-site training requirements.
This course is still under development. A comprehensive agenda (by day) will be available after the course pilot is complete. The The following agenda is meant to provide a general overview of the course structure. It is subject to change until the course pilot is complete.
- Overview of big data
- Big data capabilities in Pentaho Data Integration
- Working with Pentaho MapReduce
- Big data job orchestration
- Visualizing big data