Important Dates

by April 15, 2012

Meeting May 8-9, 2012



Contact Info:

For questions please contact:

Chaitan Baru


arrow CLDS
arrow SDSC


NSF - National Science FoundationMellanox TechnologiesSe
agate Brocade



Mon, May 7, 2012
6:30pm 9:00pm Reception, Hyatt House (hosted by Brocade)
Tues, May 8, 2012
8:00am 8:30am Breakfast, Brocade Executive Briefing Center (EBC)
8:30am 8:50am Introduction to WDBD2012, Chaitan Baru, CLDS/SDSC
8:50am 8:55am Welcome to Brocade Exec Briefing Center, Scott Pearson, Brocade
8:55am 9:10am Designing Benchmarks, Meikel Poess, Oracle
9:10am 9:25am The Benchmark Auditing Process: Lessons Learned, Francois Raab, InfoSizing
9:25am 9:40am Hadoop Benchmarking, Owen O'Malley, Hortonworks
9:40am 10:55am 5-minute Lightning Talks Data Genres. Chair: Milind Bhandarkar
    1) About Showers and Streams: Benchmarking Big Event Data, Hans-Arno Jacobsen, MSRG, U Toronto
    2) Big Data Benchmarking - Data Model Proposal, Ahmad Ghazal, Teradata Corporation
    3) Big Data Benchmarking with High Level Tasks, Ted Dunning, MapR Technologies
    4) Benchmarking the "Now", Aleksander Kolcz, Twitter
    5) Facilitating Large-scale Analysis of Scholarly Archives, Beth Plale, Indiana U
    6) Data Hot Spots, Chaitan Baru, UC San Diego
    7) Data Intensive Research and Explorative Data Analysis, Stefan Manegold, CWI
10:55am 11:10 Break
11:10am 12:30pm 5-minute Lightning Talks Benchmark Properties. Chair: Tilmann Rabl
    1) Hadepot: A Repository of Big-Data Applications, Magdalena Balazinska, U Washington
    2) Five Characteristics of a Successful Benchmark, Andrew Bond, Red Hat, Inc.
    3) Salient Features for a BigData Benchmark, Dhruba Borthakur, Facebook
    4) We Don't Know Enough to Make a Big Data Benchmark Suite: An Academia-Industry View, Yanpei Chen, UC Berkeley/Cloudera
    5) Creating an Effective Benchmark Suite for Big Data, John Galloway, Actian
    6) Benchmarking Robust Performance, Goetz Graefe, Hewlett-Packard Laboratories
    7) The Need for Standard Benchmarks for Big Data, Reza Taheri, VMWare
    8) Benchmarking Abstractions, Len Wyatt, Microsoft
    9) Data Science Workloads for Big Data Benchmarking, Milind Bhandarkar, Greenplum
1:00pm 2:00pm Lunch
    Talk: Networking Technologies for Big Data. PG Menon, Director, Solutions Architecture, Office of the CTO Brocade. Chair: Scott Pearson
2:00pm 3:30pm Parallel Discussion Session 1 on Data Genres
    Session 1a: Auditorium. Discussion led by M. Bhandarkar, T. Rabl
    Session 1b: Breakout room. Discussion led by C. Baru, M. Poess
3:30pm 4:00pm Break
3:30pm 5:30pm Parallel Discussion Session 2 on Benchmark Properties
    Session 2a: Auditorium. Discussion led by M. Bhandarkar, T. Rabl
    Session 2b: Breakout room. Discussion led by C. Baru, M. Poess
6:00pm 8:00pm Reception/Dinner (at Brocade EBC)
Wed, May 9, 2012
8:00am 8:30am Breakfast, Brocade Executive Briefing Center (EBC)
8:30am 8:45am Opening Notes, Chaitan Baru, CLDS/SDSC
8:45am 9:05am Big Genomic Data, Nicholas Schork, The Scripps Research Institute, TSRI
9:05am 9:30am Big Geospatial Data, Shashi Shekhar, U Minnesota
9:30am 9:45am Generating Big Data, Tilmann Rabl, MSRG / U Toronto
9:45am 11:00am 5-minute Lightning Talks Benchmarking Process. Chair: Meikel Poess
    1) Benchmarking Heterogeneous Graph Data, Amarnath Gupta, SDSC
    2) Big Data Benchmark Repository, Andries Engelbrecht, Hewlett-Packard
    3) Requirements for Meaningful Big Data Benchmarking, Michael Carey, UC Irvine
    4) Benchmarking Big Data in the Cloud, Dan Koren, Actian Corporation
    5) Big Data Benchmarking for Lustre File Systems, Dan Ferber, Whamcloud Inc.
    6) Benchmarking Work Group (BWG) under OpenSFS, Richard Vanderbilt, NetApp/OpenSFS
    7) Benchmarking Infrastructure for Big Data, Stephen Daniel, NetApp
    8) SNIA Activities in Big Data and Big Data Benchmarking, Alan Yoder, SNIA Technical Council
    9) Hadoop Benchmarking from a SAS Perspective, Paul Kent, SAS
    10) Big Data Benchmarking, Serge Mankovski, CA Technologies
    11) Towards an Industry Standard for Benchmarking Big Data Workloads, Raghu Nambiar, Cisco
11:00am 11:15am Break
11:15am 12:30pm 5-minute Lightning Talks Software/Hardware. Chair: Raghu Nambiar
    1) Crowbar: Deploying Big Data Benchmark Configurations, Nicholas Wakou, Dell
    2) Customizing Servers for Emerging Scale-Out Workloads Using CloudSuite, Onur Kocberber, EPFL
    3) High Performance Computing Networks for Apache Hadoop, Tong Liu, Mellanox
    4) HiBench: A Representative and Comprehensive Hadoop Benchmark Suite, Bhaskar Gowda, Intel
    5) In-Production Benchmarks for Distributed Large-scale Data Processing, Jerry Zhao, Google Inc.
    6) Developing System Performance Metrics for Cloud Computing Based on Hadoop, Gabriele Jost, AMD
    7) What Can Ethernet Provide for Big Data Systems? Ideas for Getting More Out of an Ethernet Fabric, Casey Miles, Brocade
    8) Voldemort on Solid State Drives, Vinoth Chandar, LinkedIn
    9) Big Data Benchmarks, Mark Kelly, Convey Computers
    10) Big Data Benchmarking for Yahoo!'s use of Hadoop Map-Reduce, Srigurunath Chakravarthi, Yahoo! Inc.
12:30pm 1:30pm Lunch
1:30pm 3:00pm Parallel Discussion Session 3 on the Benchmarking Process
    Session 3a: Auditorium. Discussion led by M. Poess, T. Rabl
    Session 3b: Breakout Room. Discussion led by C. Baru, M. Bhandarkar
3:00pm 3:30pm Break
3:30pm 5:00pm Wrap-up and next steps (Plenary)