SysML08 Technical Sessions

TECHNICAL SESSIONS

Session papers are available to workshop registrants immediately and to everyone beginning December 11, 2008.

Thursday, December 11, 2008

8:30 a.m.–9:00 a.m. Continental Breakfast

9:00 a.m.–9:10 a.m.

Opening Remarks

Program Co-Chairs: Armando Fox, University of California, Berkeley; Sumit Basu, Microsoft Research

9:10 a.m.–10:00 a.m.

Invited Talk

Patterns (and Anti-Patterns) for Developing Machine Learning Systems
Gordon Rios, Chief Technology Officer, Zvents, Inc.

View the presentation slides

Design patterns are a staple in the software engineering field and have helped developers capture and reuse successful approaches to common problems. As the field of machine learning continues to mature, it may be helpful to discuss some of the implementation patterns that can lead to successful deployments. We'll also touch on some anti-patterns, a much more controversial topic, where a seemingly obvious approach can lead to less successful outcomes. Rather than a manifesto or final word, my intent is to provide a starting point for the conversation.

Bio: Gordon Rios is a proven leader in search and search-related technologies. Over the past 10 years he has focused his efforts on devising machine learning systems for Web search, document classification, and text mining. At Inktomi he developed the scoring engine for a user click-based ranking application that was deployed at HotBot and Snap; was a major contributor to the Directory Engine product, which classified millions of Web documents into over 10,000 categories; and was a founding member of Inktomi's Web Search Relevance Group. Moving to Proofpoint, an enterprise messaging company, he led the development of the award-winning MLX™ technology used in their industry-leading anti-spam product. Most recently, Gordon worked at Yahoo! on the International Relevance team, where he worked on all aspects of search engine relevance and developed production code running in all major international markets. Gordon holds an MBA in Finance and a Master's Degree in Engineering and Computer Science from the University of California, Berkeley.

10:00 a.m.–10:30 a.m. Break

10:30 a.m.–noon

Oral Session 1

Finding Similar Failures Using Callstack Similarity
Kevin Bartz, Harvard University; Jack W. Stokes and John C. Platt, Microsoft Research, Redmond; Ryan Kivett, David Grant, Silviu Calinoiu and Gretchen Loihle, Microsoft Corporation, Redmond

Paper in HTML | PDF

Empirical Comparison of Techniques for Automated Failure Diagnosis
Songyun Duan and Shivnath Babu, Duke University

Paper in HTML | PDF

HiLighter: Automatically Building Robust Signatures of Performance Behavior for Small- and Large-Scale Systems
Peter Bodík, University of California, Berkeley; Moises Goldszmidt, Microsoft Research, Silicon Valley; Armando Fox, University of California, Berkeley

Paper in HTML | PDF

Noon–1:00 p.m. Workshop Luncheon

1:00 p.m.–2:00 p.m.

Oral Session 2

Mining Console Logs for Large-Scale System Problem Detection
Wei Xu, University of California, Berkeley; Ling Huang, Intel Research Berkeley; Armando Fox, David Patterson, and Michael Jordan, University of California, Berkeley

Paper in HTML | PDF

An Internet Protocol Address Clustering Algorithm
Robert Beverly and Karen Sollins, MIT CSAIL

Paper in HTML | PDF

2:00 p.m.–3:30 p.m.

Poster Session (with refreshments)

Reinforcement Learning for Optimizing Flash Memory Cache Management
Byung Kon Kang, Kee-Eung Kim, Wook Jung, and Jin-Soo Kim, KAIST

Abstract

Predictive System Health Monitoring for Large-Scale Hosting Infrastructures
Xiaohui Gu, North Carolina State University; Haixun Wang, IBM T.J. Watson Research

Abstract

Ganesha: Black-Box Fault Diagnosis for MapReduce Systems
Xinghao Pan, Jiaqi Tan, Soila Kavulya, Rajeev Gandhi, and Priya Narasimhan, Carnegie Mellon University

Abstract

Using Machine Learning to Auto-tune a Stencil Code on a Multicore Architecture
Archana Ganapathi, Kaushik Datta, Armando Fox, and David Patterson, University of California, Berkeley

Abstract

Generalization of Machine Learning Based Encrypted Traffic Identification: A Comparison of Two Classifiers
Riyad Alshammari and Nur Zincir-Heywood, Dalhousie University

Abstract

Kudzu: A Self-Balancing P2P File Transfer System
Sean Barker, Marius Catalin Iordan, and Jeannie Albrecht, Williams College; Barath Raghavan, University of California, San Diego

Abstract

3:30 p.m.–4:30 p.m.

Oral Session 3

Probabilistic Inference in Queueing Networks
Charles Sutton and Michael I. Jordan, University of California, Berkeley

Paper in HTML | PDF

From Optimization to Regret Minimization and Back Again
Ioannis Avramopoulos, Jennifer Rexford, and Robert Schapire, Deutsche Telekom Laboratories and Princeton University

Paper in HTML | PDF

4:30 p.m.–5:00 p.m.

Wrap-up and Discussion

Need help? Use our Contacts page.

Last changed: 17 Dec. 2008 ch