September 2011
1 post
Elevating your SNR →
Got a nice mention on this post for my CHUG presentation a while back. Since then we’ve actually ditched the linux-ha setup. Turns out if you buy hardware that is halfway decent, the chances of failure pretty low. If you keep your NN metadata on NFS storage you can get up and running quickly again someplace else. Best thing to do is to use a service DNS CNAME for your NN and JT services...
January 2011
2 posts
An Experiment in ipv6
So World ipv6 Day is coming June 8, 2011. Should I be stocking water and canned goods in the basement like the doomsayers in 2000 did?
What would it take to actually get ready? Well, turns out for most folks, nothing. This is because the companies we pay each month (Comcast/RCN/AT&T) for our internet hookup aren’t providing a ipv6 pipe to my house. No big deal.
But in trying to...
iPad fun
One month this last summer I started a skunkworks project (among other things) to enhance the Leon Levy Expeditation to Ashkelon’s use of technology by using an iPad as a data terminal. The dig had already done away with the data entry of old with all data going back to a database at the University of Chicago for several years now. You can’t imagine the amount of detail they collect...
September 2010
1 post
CHUG Presentation
Last night I gave a talk at the Chicago Hadoop Users Group on using Hadoop to help Orbitz collect and search large volumes of application logs.
Slides are posted if you are interested.
I think there is some useful information in there including:
High Availability Name Node/Job Tracker Configuration
Mostly “real time” log collection into HDFS (current and future direction)
Creating...
August 2010
2 posts
UDT - Now in Java flavor
As a developer, when you want to have one machine talk with another (and you want your packets delivered reliably and in-order) you open a TCP connection. Ah, TCP, that tried and true communication mechanism that probably drives most of the Internet today.
While speeds have come a long way, packet loss and latency are two very real factors that can really eat into your TCP throughput.
Packet...
A Blog By Any Other Name
Well, I finally broke down and created a personal blog. While I’ll sometimes post useless information or observations to the Group Inanity Blog or to my Twitter feed, it seems that I needed a place to put things of a less inane nature.
So 14 years after signing up for the goofy.net domain (so I could have a vanity/constant email address) there is finally some content here that...