blog.goofy.net

14 Years In The Making

0 notes

CHUG Presentation

Last night I gave a talk at the Chicago Hadoop Users Group on using Hadoop to help Orbitz collect and search large volumes of application logs.

Slides are posted if you are interested.

I think there is some useful information in there including:

  • High Availability Name Node/Job Tracker Configuration
  • Mostly “real time” log collection into HDFS (current and future direction)
  • Creating an interactive “grep” webapp from batch oriented Map/Reduce
  • Lots of code & configuration details

Enjoy!