I’ve going to be giving a talk on big data for the newly formed Nevada County Tech Talk event – a monthly gathering at Sierra Commons.
Unfortunately most of the relevant content I’ve got is for Java programmers interested in using Hadoop. Things I could talk about, based on personal experience:
- A 600M page web crawl using Bixo.
- Using LibSVM to predict medications from problems.
- Using Mahout’s kmeans clustering algorithm on pages referenced from tweets (the unfinished Fokii service).
I’m looking for relevant talks that I can borrow from, but I haven’t found much that’s targeted at the technically minded-but-not-a-programmer crowd.
Comments with pointers to useful talks/presentations would be great!