Skip to content

{ Author Archives }

Tech Talk: Tom Hughes-Croucher (Joyent) — “Node.js at Scale”

Node.js at Scale Tom Hughes-Croucher (Joyent) Thursday, August 4, 2011 ABSTRACT When we talk about performance what do we mean? There are many metrics that matter in different scenarios but it’s difficult to measure them all. Tom Hughes-Croucher looks at what performance is achievable with Node today, which metrics matter and how to pick the [...]

Tagged

Tech Talk: Neha Narkhede (LinkedIn) — Kafka, LinkedIn’s open-source distributed pub-sub messaging system

Kafka Neha Narkhede (LinkedIn) Wednesday, July 27, 2011 ABSTRACT Kafka is a distributed publish-subscribe messaging system aimed at providing a scalable, high-throughput, low latency solution for log aggregation and activity stream processing for LinkedIn. Built on Apache Zookeeper in Scala, Kafka aims at providing a unified stream for both real-time and offline consumption. We provide [...]

Tagged ,

Tech Talk: Rajat Paharia (Bunchball) — “Game Dynamics”

Game Dynamics Rajat Paharia (Bunchball) Monday, July 25th, 2011 ABSTRACT Status, achievement, reward, competition, self-expression: by addressing these fundamental human needs and desires, designers can make experiences both compelling and satisfying. Game designers, in particular, have known for years how to incent and motivate players by addressing these needs through the use of mechanics like [...]

Tagged

Tech Talk: Michael Stack (StumbleUpon) — “State of HBase”

State of HBase Michael Stack (StumbleUpon) Monday, July 18th, 2011 ABSTRACT Attendees will learn about the current state of the HBase project. We’ll review what the community is contributing, some of the more interesting production installs, killer apps on HBase, the on-again, off-again HBase+HDFS love affair, and what the near-future promises. A familiarity with BigTable [...]

Tagged

Tech Talk: Andy Twigg (Acunu) — Stratified B-Tree and Versioned Dictionaries

Stratified B-Tree and Versioned Dictionaries Andy Twigg (Acunu) Monday, June 20, 2011 ABSTRACT A classic versioned data structure in storage and computer science is the copy-on-write (CoW) B-tree — it underlies many of today’s file systems and databases, including WAFL, ZFS, Btrfs and more. Unfortunately, it doesn’t inherit the B-tree’s optimality properties; it has poor [...]

Tagged

Tech Talk: Michael Deerkoski (Flickr) — “Continuous Deployment at Flickr”

Continuous Deployment at Flickr Michael Deerkoski (Flickr) Wednesday, May 25, 2011 ABSTRACT Flickr is almost certainly the best online photo management and sharing application in the world. The small, efficient, development team uses a process called Continious Deployment. There are several technical tools in place to make this happen, but the most important aspect to [...]

Tagged

Tech Talk: Anil Madan (eBay) — “Hadoop at eBay”

Hadoop at eBay Anil Madan (eBay) Monday, May 23, 2011 ABSTRACT The talk will illustrate how Hadoop has become a critical center piece of infrastructure for eBay, running on thousands of servers. I will also discuss how it fuels our derived data pipeline which in turn affects just about all our services. Attendees will understand [...]

Tagged

Tech Talk: Xavier Amatriain (Telefonica) — “The Science and Magic of User and Expert Feedback for Improving Recommendations”

The Science and Magic of User and Expert Feedback for Improving Recommendations Dr. Xavier Amatriain (Telefonica) Thursday, March 3, 2011 ABSTRACT Recommender systems are playing a key role in the next web revolution as a practical alternative to traditional search for information access and filtering. Most of these systems use Collaborative Filtering techniques in which [...]

Tagged

Tech Talk: Chris Douglas (Yahoo!) — “Next Generation Hadoop MapReduce”

Next Generation Hadoop MapReduce Chris Douglas (Yahoo!) Monday, March 7, 2011 ABSTRACT The Apache Hadoop MapReduce framework has hit a scalability limit around 4,000 machines. We are developing the next generation of Apache Hadoop MapReduce that factors the framework into a generic resource scheduler and a per-job, user-defined component that manages the application execution. Since [...]

Tagged

Tech Talk: Matei Zaharia (UC Berkeley) — “Spark: A Framework for Iterative and Interactive Cluster Computing”

Spark: A Framework for Iterative and Interactive Cluster Computing Matei Zaharia (UC Berkeley) Tuesday, February 8, 2011 ABSTRACT Although the MapReduce programming model has been highly successful, it is not suitable for all applications. We present Spark, a framework optimized for one such type of applications – iterative jobs where a dataset is reused across [...]

Tagged