Fast SQL on Hadoop

0 / 5
Add to favorites
Price: $600 per person

This two-day practical workshop teaches how to efficiently analyze large volumes of data available in Hadoop cluster using SQL-like technologies.

During the course we simulate real-world scenarios. Every participant plays a role of data analyst who works for a fictional company called StreamRock (inspired by Spotify – our favourite music streaming app).

The workshop consists of practical exercises that are executed on a remote multi-node Hadoop cluster.

Day 1

  • Introduction to use-case: StreamRock
  • Introduction to Hadoop (HDFS & YARN)
  • File Formats
    • Text formats - JSON, XML, CSV
    • Row-oriented format – Apache Avro
    • Column-oriented formats – Parquet and ORC
  • Apache Hive
    • Key concepts
    • Comparison with RDBMS
    • Hive Query Language
    • Hands-on exercises
    • Hive architecture
    • Execution engines: MapReduce, Tez, Spark
    • Hands-on exercises
    • Useful features
    • Query optimisations techniques
Day 2
  • Cloudera Impala
    • Typical use-cases
    • Comparison with Hive
    • Impala architecture
    • Hands-on exercises
  • Bonus – Facebook Presto
    • Comparison with Hive and Impala
    • Presto architecture
    • Demo
  • Spark SQL
    • Introduction to Spark
    • Key features
    • Integration with Hive
    • DataFrames
    • Hands-on exercises
  • Comparing Hive, Impala, Spark SQL and Presto
    • Benchmarks
    • Practical recommendations when to use which

Other Information

Please read my blog posts and presentations:

  • Price $600 per person
  • Listing categories Big Data
  • Min/Max Participants 4/min 10/max
  • Duration 6 hours
  • Education level None, Beginner, Intermediate
  • Location Europe
  • Languages English, Polish
  • Features Classroom provided by the client, Customizations possible, Hands-on exercises included, Price negotiable, References required (client name and logo), Slides sent after the workshop, Travel and accommodation booked by the client, Travel possible

Post New Review

Your email address will not be published. Required fields are marked *