Loading…
ApacheCon EU 2014 has ended
Register Now for ApacheCon Europe 2014 - November 17-21 in Budapest, Hungary. 

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Cassandra Days [clear filter]
Tuesday, November 18
 

9:00am

Cassandra (And Hadoop) Case Studies From Finn.No - Mick Semb Wever, FINN.no
FINN.no is the leading classifieds website in Norway and the country's busiest website.

This session will go through various product development where Cassandra has shown to be the best choice. Focusing on the primary use-case: a tracking solution that's collects raw time-series data in c* and aggregates it near real-time using Hadoop into various new datasets from advert-centric statistics to user-centric behavioural analysis.

Mick will cover the final technical design chosen after three years of development iterations, touching on technologies: scribe, thrift, kafka, hadoop, pig, mahout; the hurdles faced along the way, integration improvements done between cassandra and hadoop, and the throughput and performance of today's systems.

Speakers
avatar for Mick Semb Wever

Mick Semb Wever

Team Member, The Last Pickle
Mick Semb Wever works at The Last Pickle helping customers deliver and improve Apache Cassandra based solutions. Prior to TLP he spent seven years at FINN.no building their Microservices platform utilizing Apache Cassandra, Hadoop, Spark and Kafka. He is the PMC Chair for Apache Tiles... Read More →


Tuesday November 18, 2014 9:00am - 9:50am
Tohotom

10:00am

Introduction to Cassandra - Duy Hai, DataStax

During this talk, you'll be given a high level introduction to Cassandra and to the database mechanism. Summary of topics discussed:

  •    Architecture
  •    Data model
  •    Replication
  •    Consistency model
  •    Read/Write path
  •    Failure handling

Speakers
avatar for Duy Hai

Duy Hai

Technical Advocate, Datastax
DuyHai is a Cassandra technical advocate. He spends his time between technical presentations/meetups on Cassandra, coding on open source projects to support the community and helping all companies using Cassandra to make their project successful. Shall you have any question on Cassandra... Read More →


Tuesday November 18, 2014 10:00am - 10:50am
Arany

10:00am

Intro To Apache Usergrid, The BaaS Build On Cassandra - David Johnson, Apigee Corp
Whether you are building a mobile app or a web app, Apache Usergrid (incubating) can provide you with a complete backend that supports authentication, persistence and social features like activities and followers all via a comprehensive REST API — and backed by Cassandra, giving you linear scalability. All that, and Usergrid is open source too.

This session will explain how you can use Usergrid to provide a back-end for your application. We’ll start with an overview of Usergrid features, then explore in depth how to authenticate users, store data and query data with the REST API provided by a Usergrid server. We’ll develop a simple HTML5 app and package it as a native mobile app via Apache Cordova. We'll also cover how to run Usergrid locally for development and testing.

Speakers
avatar for David Johnson

David Johnson

Open Source Developer, Apigee
Dave Johnson is an open source developer who specializes in web technologies, social software and open standards. Dave works for Apigee and is active in several open source projects at the Apache Software Foundation and elsewhere. Dave is an experienced speaker who has spoken at ApacheCon... Read More →


Tuesday November 18, 2014 10:00am - 10:50am
Tas

3:50pm

Time Series Data With Apache Cassandra - Eric Evans, OpenNMS Group
Whether it's statistics, astronomy, finance, or network management, time series data plays a critical role in analytics and forecasting. Yet, while many tools exist, few are able to scale past memory limits; For those challenged by large volumes of data, there is much room for improvement.

Apache Cassandra is a fully distributed second-generation database. Cassandra stores data in key-sorted order making it ideal for time series, and its high throughput and linear scalability make it well suited to very large data sets.

This talk will cover some of the requirements and challenges of large scale time series storage and analysis. Cassandra data and query modeling for this use-case will be discussed, and Newts, an open source Cassandra-based time series store under development at The OpenNMS Group will be introduced.

Speakers
avatar for Eric Evans

Eric Evans

Senior Software Engineer, Wikimedia Foundation
Eric has more than a decade of experience with the engineering and operations of large-scale distributed systems. He joined Rackspace as a startup, and implemented a global DNS infrastructure utilizing IP anycast (possibly the first), and a novel data-center-wide IDS for which a patent... Read More →


Tuesday November 18, 2014 3:50pm - 4:40pm
Arany

4:50pm

Data Migration and Development Cycles Using Cassandra - Matija Gobec, Execom
This talk will focus on learning curve and complex data models and queries. This is a real life example and something we do daily at Execom. It will also cover data migration and development cycles using Cassandra as database.

Speakers
MG

Matija Gobec

Senior Software Engineer
Software professional with solid knowledge in software development and design. Experience with serial and network protocols, Bluetooth, infrared and WiFi communication, medical and prototype devices, printers and smart cards. Advocate of virtualization, continuous integration and... Read More →


Tuesday November 18, 2014 4:50pm - 5:40pm
Huba
 
Wednesday, November 19
 

2:00pm

Old and New Trends in Data Modeling - Arató Bence , BI Consulting
This talk starts with the tradition data modeling rules (and the reasons behind them) then shows how new requirement have brought new modeling approaches, including star schema modeling for DWs, and Data Vault modeling  The final part of talk discuss the requirements of the modern NOSQL database systems and serve as an introduction to NOSQL/Cassandra data modeling.

This talk is a good fit for people who has some background in SQL databases but are new to NOSQL databases and Cassandra.  

Speakers
avatar for Bence Arató

Bence Arató

Managing Director, BI Consulting
Managing Director of BI Consulting Hungary. He has been in the BI industry since 1995 as an analyst, architect and consultant. He advises companies on general BI strategy, project and architecture planning, and vendor and tool selection. Also provides QA and on-the-job mentoring services... Read More →


Wednesday November 19, 2014 2:00pm - 2:50pm
Huba

3:00pm

How SoundCloud Uses Cassandra - Emily Green, SoundCloud
We'll be looking at how SoundCloud uses Cassandra, particularly looking how it supports SoundCloud's new stats product, which provides analytics data for people with sounds on SoundCloud. There are two main systems behind it, one doing real-time counts and timelines, and another doing batch top-k lists. We'll talk about the different problems inherent in both systems, and how we've attempted to overcome them.

Speakers
avatar for Emily Green

Emily Green

Backend Engineer, SoundCloud
Emily is a developer from London. She is a little in love with trying to do easy things but at hard scale, and also with Haskell, and her cat Alby. She works as a Backend Engineer in the Data Team at SoundCloud.


Wednesday November 19, 2014 3:00pm - 3:50pm
Huba
 
Thursday, November 20
 

1:40pm

Workshop: Owning Time Series With Team Apache - Cassandra, Spark, Spark Streaming and Kafka - Patrick McFadin, DataStax
Break out your laptops for this hands-on tutorial is geared around understanding the basics of how Apache Cassandra stores and access time series data. We'll start with an overview of how Cassandra works and how that can be a perfect fit for time series. Then we will add in Apache Spark as a perfect analytics companion. There will be coding as a part of the hands on tutorial. The goal will be to take a example application and code through the different aspects of working with this unique data pattern. The final section will cover the building of an end-to-end data pipeline to ingest, process and store high speed, time series data.  

Speakers
PM

Patrick McFadin

Prior to working for DataStax, Patrick was the Chief Architect at Hobsons, an education services company. His responsibilities included ensuring product availability and scaling for all higher education products. Prior to this position, he was the Director of Engineering at Hobsons... Read More →


Thursday November 20, 2014 1:40pm - 5:40pm
Tohotom