May 20, 2016
San Francisco, CA Buy Tickets
The future is already here, it is just not evenly distributed. But it clearly shows in our 150 talks, comprising 7 conferences, bounded by the 5 days conference matrix. 50+ founders/CEOs/CTOs speaking.
In-depth talks from Google (BigQuery and Translate), Baidu Research, MetaMind, StitchFix (Deep Learning), Microsoft, Bloomberg, Quora, Kaggle, Dato (Machine Learning), Netflix (Recommender Systems), IBM (Watson), Facebook, ClearStory (DataViz), LinkedIn, Yahoo, H2O, Confluent, Mesosphere (Data Pipelines), Samsung, Automatic (IoT), AMPLab, Databricks, Salesforce, Workday, Cloudera (Spark), Pivotal (OSS), Zillow, Pandora, Nitro, Lucidworks, Mattermark, Credit Karma, Alpine Labs, , University of California-Berkeley, Stanford University, City of San Francisco, and many others.
Buy TicketsOnly 300 tickets for each day will be available to have a truly intimate technical community atmosphere.
Brought to you by the organizers of SF Text, SF Scala, SF Spark, Reactive Systems,
Text By the Bay 2015, Big Data Scala 2015, and Scala By the Bay 2013-2015.
Conference News
Registration is open
Schedule is Published
Life Sciences By the Bay
All our activities depend on good health. And if we need help with it, we need to know that we know as much as humanly and machine learning-ly possible, and that we're doing everything achievable with state of the art of medicine -- and even pushing that. Better yet, we want to do the right things for our health proactively. In many ways this is knowledge management, predictive analytics, data mining on genomes and annotated medical images, merging streams of sensor data from medical devices, activity sensors, and historical records. We'll connect the companies in the space with foundational open-source paltforms and leading practitioners using them to build the next level of life and health understanding.
Please see the umbrella Data By the Bay description of a good talk By the Bay.
For Life Sciences By the Bay, some specific topics of interest include:
- Genomics sequencing, data pipelines required for it, algorithms and applications
- Medical devices, sensor data, stream processing and mining, alerting
- Biomedical ontologies, knowledge bases, annotation of images and results
- Deep Learning for medical imaging and sensor data
- And more!
Last year, we started with a two-day, three-track, 50-talk conference. We've put together an inspiring program centered around language, Big Data, text and images, deep learning, UI, social networks, and much more.
This year, we're running the first data grid conference sequence with with seven verticals over five days. Each day's attendance is limited to only 400 seats and it will be full. We hope you join us in May By the Bay!
Data Pipelines By the Bay
May 16, 2016
Building on Big Data Scala, this is the first conference showing end-to-end unity of Data Engineering and Data Science for big, fast, streaming data.
Text By the Bay
May 17-18, 2016 (Day 2 parallel with Democracy By the Bay and Law By the Bay)
The first applied NLP conference for the Bay Area, building on the highly-acclaimed 2015 edition: 50 talks from 50 top companies, all online at functional.tv.
Democracy By the Bay
May 18, 2016 (parallel with Law By the Bay and Text By the Bay)
NLP and Data Science with focus on politics, society, and government.
Law By the Bay
May 18, 2016 (parallel with Democracy By the Bay and Text By the Bay)
NLP and Data Science with focus on legal data and processes.
Legal search (100% recall), case-specific NLP, ambiguity analysis, etc.
AIoT By the Bay
May 19, 2016
Not everything is text. Multiple talks at Text By the Bay dealt with multi-modal data such as images with text. AI and IoT day is all about sensor data streams, images, vision, speech, music.
Life Sciences By the Bay
May 20, 2016 (Parallel with Data UX By the Bay)
There are several major categories of data mining related to life and health. First, genomics -- Bay Area leads with Spark and ADAM. Second, medical sensor and imaging data, with companies like Enlitic.
Data UX By the Bay
May 20, 2016 (Parallel with Life Sciences By the Bay)
Data should be visualized, with massive datasets distilled into clear and actionable display calling attention to what's really important. And then UX should naturally lead to the appropriate action.
Data By the Bay – Common Thread
May 16-20, 2016
For each conference, we'll have a common horizontal themes: platforms and algorithms.
Keynote Speakers

Jeremy Howard
Jeremy Howard is a serial entrepreneur, business strategist, developer, and educator. He is the CEO of Enlitic, a startup he founded to use recent advances in machine learning to transform the practice of medicine, and bring modern medical diagnostics to billions of people in the developing world for the first time. He is the youngest faculty member at Singularity University, where he teaches data science, and is also a Young Global Leader with the World Economic Forum. Previously he was the President and Chief Scientist of Kaggle, a community and competition platform for over 150,000 data scientists. Before working at Kaggle, he was the top ranked participant in data science competitions globally, in 2010 and 2011. He founded two successful Australian startups (the email provider FastMail, and the insurance pricing algorithm company Optimal Decisions Group), both of which grew internationally and were sold to large international companies. He started his career in management consulting, working at the world's most exclusive firms, including McKinsey & Co, and AT Kearney (becoming the youngest engagement manager world-wide, and building a new global practice in what is now called "Big Data"). He is also a keen student, for example developing a new system for learning Chinese, which he used to develop usable Chinese language skills in just one year. Jeremy has mentored and advised many startups, and is also an angel investor. He has contributed to a range of open source projects as a developer, and was a regular expert guest on Australia's most popular TV morning news program.
Our Sponsors
Friend Sponsors
Media Sponsors
Technology Insights and Events
Be a supporting member of San Francisco's premier Data/AI conference. We want to hear from you! Contact us for a prospectus and sponsorship agreement, or to talk about how we can help you be a contributing sponsor for the Data By The Bay conference!

The Agenda
Come to Life Sciences By the Bay well-rested and ready to meet your fellow developers. We'll have a full day of talks (keynotes, full-length, and lightning) and build a startup-centric data engineering community for the Bay Area!
Get Updates
Stay informed with the Life Sciences By the Bay conference news and event updates.
If you'd like to sponsor Life Sciences By the Bay, contact sponsors@bythebay.io
Map
Conference Schedule
Conference Tickets
You can buy tickets for two or more days of the conference as passes. Once you buy a pass, you will receive an email with instructions on how to redeem the days you want. Each day has the capacity of 400 and will automatically be disabled once full. We'll add the days that are sold out on the TICKETS page as soon as they become unavailable.
Currently available days: Day 1, Day 2, Day 3, Day 4, Day 5.
Pricing works as follows: regular admission is $500/day. Very Early Bird is $400/day, Early Bird is $450/day, and late Bird is $550/day. We will only allocate 100 Very Early/Early Bird tickets for each day, since our capacity is limited and the word is only getting out. The passes are 2/3/4/5-day bundles, discounted $50 per each extra day (so 2-day Very Early Bird Bundle is $750, 2-day Early Bird Bundle is $850, 2-day Regular Admission Bundle is $950, etc.). We use Stripe directly to process all payments.
Full-time students inquiring about discounts: please email proof of enrollment and dates of interest.