CHUG Presentation
Last night I gave a talk at the Chicago Hadoop Users Group on using Hadoop to help Orbitz collect and search large volumes of application logs.
Slides are posted if you are interested.
I think there is some useful information in there including:
- High Availability Name Node/Job Tracker Configuration
- Mostly “real time” log collection into HDFS (current and future direction)
- Creating an interactive “grep” webapp from batch oriented Map/Reduce
- Lots of code & configuration details
Enjoy!