Loading…
ApacheCon EU 2014 has ended
Register Now for ApacheCon Europe 2014 - November 17-21 in Budapest, Hungary. 

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Developer [clear filter]
Monday, November 17
 

10:30am

Apache Tez - A New Chapter In Hadoop Data Processing - Hitesh Shah, Hortonworks
Apache Tez is a modern data processing engine designed for YARN on Apache Hadoop 2. Tez aims to provide high performance and efficiency out of the box, across the spectrum of low latency queries and heavy-weight batch processing. It provides a sophisticated topology API, advanced scheduling and concurrency control & fault tolerance. With a clear separation between the logical app layer and the physical data movement layer, Tez is designed from the ground up to be a platform on top of which a variety of domain specific applications can be built. Tez has pluggable control and data planes that allow users to plug in custom data transfer technologies, concurrency-control and scheduling policies to meet their exact requirements.

The talk will cover real use cases from adopters like Hive, Pig and Cascading and provide data to show the performance of Tez.

Speakers
HS

Hitesh Shah

Hortonworks Inc.
Hitesh Shah currently works on various things related to Apache Hadoop at Hortonworks with his primary focus on Apache Tez and Apache Hadoop YARN. He is a PMC member and committer for the Apache Hadoop, Tez and Ambari projects. Earlier to that, he spent close to a decade at Yahoo... Read More →


Monday November 17, 2014 10:30am - 11:20am
Arany

10:30am

Using Apache Commons Scxml 2.0: A General-Purpose And Standards Based State Machine Engine - Ate Douma, Hippo B.V.
Apache Commons SCXML provides a lightweight, embeddable state machine engine which can easily be configured and customized at runtime within your application.
Since last year the project has been picking up speed with new committers working hard towards a Commons SCXML 2.0 release, targeting full compliance with the latest W3C SCXML specification.

This presentation by Ate Douma, one of these new committers, will give you an overview of the current features and roadmap of Commons SCXML.
He will also discuss typical domains and use-cases for SCXML, and demonstrate how easily it can be used and integrated.
Commons SCXML also is very easy to extend and customize, with a clear API and a very modular structure and you will learn how Commons SCXML works under the hoods.
Finally a demonstration and explanation will be given on using Commons SCXML for a document-oriented workflow engine.

Speakers
avatar for Ate Douma

Ate Douma

Principal Product Architect BloomReach Experience, Hippo B.V., a BloomReach Company
Ate Douma is a Java developer and architect with 20+ years experience. He works as Principal Product Architect at Hippo B.V., Amsterdam, which has strong ties to the Apache Software Foundation. Ate is actively involved in the ASF, more than 12 years as committer and 10 years as ASF... Read More →


Monday November 17, 2014 10:30am - 11:20am
Tohotom

10:30am

Jax-Rs 2.0 With Apache Cxf Continued - Sergey Beryozkin, Talend
JAX-RS helps developers create HTTP-centric applications fast. JAX-RS 2.0, the evolution of the popular JAX-RS 1.1 specification, introduces new features which will make it simpler to build end to end HTTP applications of various complexity. This presentation will touch on Apache CXF philosophy of supporting various styles of developing web services, provide an overview of JAX-RS and finish with describing and analyzing all the new features of JAX-RS 2.0.
It will be based on the ""JAX-RS 2.0 With Apache CXF"" presentation given at Apache Con NA 2014 with new slides introduced. JAX-RS 2.1 plans will also be discussed should they become public by the time Apache Con EU 2014 starts.

Speakers
SB

Sergey Beryozkin

Software Architect, Talend
Sergey Beryozkin is a software architect at Talend. He is a web services expert leading Apache CXF JAX-RS and OAuth2 projects. He is also a committer of Apache Tika, Camel and Aries.


Monday November 17, 2014 10:30am - 11:20am
Grand Ballroom

10:30am

Configurable Configuration With Apache Httpd - Rich Bowen, Apache Software Foundation
In this age of automation, it's only natural that people would want greater scriptability in their web server configuration. From mod_macro to the syntax to various ways to set and use variables, Apache httpd 2.4 makes it much easier to deploy adapatable web server configurations.

In this talk you'll learn how to add intelligence to your server configuration.

Speakers
avatar for Rich Bowen

Rich Bowen

VP, Conferences, Apache Software Foundation
Rich has been involved in Open Source since before it had that name. He's a director at the Apache Software Foundation, and a community architect at Red Hat.


Monday November 17, 2014 10:30am - 11:20am
Elod/Ond

11:30am

ETL Made Simple Using Spark - Mayur Rustagi, Sigmoid Analytics
Apache Spark is growing to be the most active project in Apache Big Data ecosystem. It truly unlocks the ability to perform analytics in-memory & in an iterative fashion. In this talk I will highlight the several customer case studies where we used several aspects of Apache Spark from Streaming, Warehousing & ML. Furthermore I will show how the seamless integration of Streaming, ML & warehousing yields new opportunities for businesses to reach to their data faster.

Speakers
MR

Mayur Rustagi

CTO & Co-Founder, Sigmoid Analytics
Mayur Rustagi is a CTO & Co-founder of Sigmoid Analytics. His areas of expertise include Real Time Big Data Analytics using open source technologies like Apache Spark, Shark and Apache Hadoop. Sigmoid Analytics has worked with over 25 customers in the Big data space including several... Read More →


Monday November 17, 2014 11:30am - 12:20pm
Arany

11:30am

Oak, the Architecture of Apache Jackrabbit 3.0 - Michael Dürig, ASF
Apache Jackrabbit is just about to reach the 3.0 milestone based on a new architecture called Oak. Based on concepts like eventual consistency and multi-version concurrency control, and borrowing ideas from distributed version control systems and cloud-scale databases, the Oak architecture is a major leap ahead for Jackrabbit. This presentation describes the Oak architecture and shows what it means for the scalability and performance of modern content applications. Changes to existing Jackrabbit functionality are described and the migration process is explained.

Speakers
MD

Michael Dürig

Adobe Research
Michael Dürig is a key developer of Jackrabbit Oak and has been deeply involved in the project as of its inception. He is the PMC chair of Apache Jackrabbit and a frequent contributor to other Apache projects. Michael has presented at conferences like Jazoon, ScalaDays, .adaptTo... Read More →


Monday November 17, 2014 11:30am - 12:20pm
Tohotom

11:30am

Introduction to CouchDB - Jan Lehnardt, ASF
his an introduction to CouchDB. We'll tour the user-facing API while looking at the concepts behind it and what to build on top of them.

You will learn basic data storage and retrieval, data-design, querying, replication and various neat features on the edges of CouchDB. If you are coming from the relational world, this talk will help you understand how to "think in CouchDB".

Speakers
avatar for Jan Lehnardt

Jan Lehnardt

CEO, Neighbourhoodie Software
Jan Lehnardt is the PMC Chair and VP of Apache CouchDB, co-creator of the Hoodie web app framework based on CouchDB as well as the founder and CEO of Neighbourhoodie Software. He’s the longest standing contributor to Apache CouchDB.


Monday November 17, 2014 11:30am - 12:20pm
Tas

11:30am

Using Websocket With CXF And Camel - Akitoshi Yoshida, SAP
WebSocket is a full-duplex transport protocol developed as part of HTML5 and has caught attention not only for Browser applications but also also for application integration. This simple HTTP based protocol can operate over a single socket and does not suffer from network restriction such as proxies and firewalls when establishing a duplex channel.

New WebSocket support in Apache CXF can be used to invoke an asynchronous JAX-WS service over a single WebSocket connection instead of using two separate HTTP connections, to establish a direct node connection in a cluster environment, and to asynchronously push data from a JAX-RS service to its client. And Apache Camel now supporting both the client and server side WebSockets can be used to integrate and compose various WebSocket applications with other applications.

Speakers
AY

Akitoshi Yoshida

SAP
Akitoshi Yoshida is an ASF member and committer of Apache CXF and Apache Camel. In addition, he is also a committer of Atmosphere and Swagger-Socket. He works at SAP and has more than 10 years of experience in developing enterprise business applications and application integration... Read More →


Monday November 17, 2014 11:30am - 12:20pm
Grand Ballroom

11:30am

Building Cordova Applications With Apache Flex - Christofer Dutz, Codecentric AG
In this talk I give an overview over the Flex project's efforts in optimizing the new Falcon compiler in a way that it is able to not only produce Flash applications, but also Html and JavaScript applications from the same code-base. I will lay special emphasis on how this hybrid approach makes it easy to build Cordova applications while utilizing the full stack of features and tools developers are usually used to: Type-Safe development, Unit-Testing, Integration-Testing. Code Coverage, Debugging an application from within the IDE.

Speakers
avatar for Christofer Dutz

Christofer Dutz

Senior IT Consultant, codecentric AG
Likes to swim outside the mainstream. He is really passionate about walking new paths. Some times this means making a new technology useable for the masses or by creating new combinations of established strategies. His solutions are usually unconventional but simple, highly efficient... Read More →


Monday November 17, 2014 11:30am - 12:20pm
Petofi

1:40pm

Accelerating Big Data Application Development With Cascading - Supreet Oberoi, Concurrent, Inc.
Cascading is a Java-based application development framework for building Big Data applications on Apache Hadoop. This open source framework allows developers to leverage their existing skillsets such as Java, SQL, etc. to create enterprise=grade applications without having to think in MapReduce. This comprehensive framework separates business logic from integration logic so that developers can quickly build and test data applications locally on their laptop and then deploy them on Hadoop. While typical enterprise data applications must cross through multiple departments and frameworks, Cascading allows multiple departments to seamlessly integrate their application components into one single data processing application. In this presentation, developers will get an introduction to Cascading, how it works, and then dive into how one can build applications with Cascading

Speakers
SO

Supreet Oberoi

VP of Field Engineering, Concurrent, Inc.
Supreet Oberoi is a hands-on, entrepreneurial, technology leader with over two decades of experience in successfully developing transformative information technologies, and working in leadership roles at Concurrent Inc., American Express, Oracle, Microsoft and many privately-held... Read More →


Monday November 17, 2014 1:40pm - 2:30pm
Arany

1:40pm

What's New In Apache Syncope 1.2.0 - Colm O Heigeartaigh, Talend
Apache Syncope is a powerful and flexible open-source Identity Management project at Apache. It provides for the management and orchestration of users, roles, resources and entitlements, functionality which is exposed via a web-based UI console as well as a powerful and flexible REST API powered by Apache CXF.

In this talk, we will provide an overview of Apache Syncope and a ""hands-on"" demonstration of how to use it. We will describe and demonstrate the new features of Apache Syncope 1.2.0, such as:

* Support for binary attributes
* Passthrough authentication
* Non-cleartext password support
* Encrypted Schemas
* REST API powered by Apache CXF

Speakers
CO

Colm O hEigeartaigh

Software Architect, Talend
Dr. Colm O hEigeartaigh is a security architect at Talend, having earned his PhD in the area of cryptography. He is involved in a wide range of Apache projects, primarily in the area of security. He is PMC chair of the Apache Santuario project. He has previously presented at a number... Read More →


Monday November 17, 2014 1:40pm - 2:30pm
Tohotom

1:40pm

Putting The C Back In CouchDB 2.0: Merging Bigcouch - Joan Touzet, Cloudant
Apache CouchDB - an acronym for Cluster Of Unreliable Commodity Hardware - is a database that uses JSON for documents, JavaScript for MapReduce indexes, and regular HTTP for its API. To date, individual servers support multi-master replication, but do not actually act as a server cluster. In this presentation, Joan Touzet will cover the recent merge of the BigCouch fork in advance of the release of CouchDB 2.0. This significant work enhances CouchDB with a native Erlang-based clustering mechanism, supporting high availability, database partitioning and DynamoDB-inspired enhanced read/write semantics. Special attention will be given to new functionality, operational considerations from Cloudant's experience running this code for years in production, upgrade scenarios for existing users and the exciting new Fauxton UI.

Speakers
JT

Joan Touzet

Independent Contracter, Cloudant, an IBM Company
Joan Touzet is an independent contractor, managing commercial and OSS software development since 1993. Based in Toronto, Joan has worked in embedded systems, big data, finance, bio-informatics, advertising, natural resources and tooling for SW/HW systems. Prior to returning to independent... Read More →


Monday November 17, 2014 1:40pm - 2:30pm
Tas

1:40pm

CXF Security And Reliability - Dennis Sosnoski, Sosnoski Software Associates Ltd
Apache CXF supports a full range of SOAP extension technologies. Two of the most important extensions are WS-Security and WS-ReliableMessaging. The WS-Security family of technologies add message-level security to basic SOAP exchanges, while WS-ReliableMessaging adds delivery assurance options. In this presentation Dennis Sosnoski will show you how WS-Security and WS-ReliableMessaging can be added to your CXF SOAP applications with little or no impact on your application code. He'll also discuss how large organizations are increasingly using these SOAP extension technologies in their operations, including how the government of the Netherlands is applying the technologies to educational and other government sectors. Finally, Dennis will summarize the WS-ReliableMessaging and WS-Security enhancements included in the latest CXF 3.0 release.

Speakers
avatar for Dennis Sosnoski

Dennis Sosnoski

Principal Consultant, Sosnoski Software Associates Ltd
Dennis Sosnoski is a CXF committer and PMC member and an expert on web services in Java. His CXF work has included implementing the WS-ReliableMessaging 1.1/1.2 support and making WS-RM work correctly with WS-Security for the new CXF 3.0 release. Dennis has been helping organizations... Read More →


Monday November 17, 2014 1:40pm - 2:30pm
Grand Ballroom

1:40pm

mod_rewrite And Friends: URL Mapping And Manipulation With Apache httpd - Rich Bowen, Apache Software Foundation
mod_rewrite is still seen by many as magical incantations that will make your website rise to the top of the Google results. While this isn't true, it is an important tool to master as a web site administrator. Come learn how to read and write rewrite rules, and also about the many other ways to manipulate and map URLs on your Apache http server.

We'll cover mod_rewrite (of course), mod_speling, mod_substitute, mod_proxy_html, and many others.

Speakers
avatar for Rich Bowen

Rich Bowen

VP, Conferences, Apache Software Foundation
Rich has been involved in Open Source since before it had that name. He's a director at the Apache Software Foundation, and a community architect at Red Hat.


Monday November 17, 2014 1:40pm - 2:30pm
Elod/Ond

1:40pm

Cordova And Firefox OS - Html5 For The Mobile Web - Jason Weathersby, Mozilla
Firefox OS is an open source operating system for mobile phones designed entirely around HTML5, Javascript, CSS and open web standards. Recently Firefox OS has been integrated into Cordova and offers another option for multiplatform mobile developers. By leveraging the inherent advantages of web development and giving developers access to hardware through web standards, we can make an open and affordable mobile web a reality. In this talk Jason Weathersby from Mozilla will show how Firefox OS works, how to build and debug apps for it, and how end users will benefit from this open alternative to other platforms.

Speakers
JW

Jason Weathersby

Technical Evangelist, Mozilla
Jason Weathersby is a Technical Evangelist for Mozilla, evangelizing Firefox OS. He is also a committer on the BIRT project at the Eclipse Foundation, and has co-authored several books and written many articles on BIRT integration. He is a proponent of HTML5, the Open Web and Open... Read More →


Monday November 17, 2014 1:40pm - 2:30pm
Petofi

1:40pm

Introduction To A Groovy Based DSL For Apache OFBiz - Jacopo Cappellato, HotWax Media
Apache OFBiz is a framework and a suite of applications for the automation of enterprise tasks (e.g. ERP, CRM, MRP, E-Commerce). Although usable out of the box, OFBiz often requires the customization of some of its components for managing specific business processes. In order to rapidly and effectively integrate new functionalities, developers need a programming environment that allows to focus on the business logic without being distracted by low level coding details. Up to now, the best practice has been to use Minilang, an xml based Domain Specific Language (DSL) developed by the OFBiz community, together with plain Java and Groovy, when more flexibility is needed. In this presentation, Jacopo Cappellato will outline the limitations of the current approach and will propose a new Groovy based DSL that could become the only language you'll need to know for the customization of OFBiz.

Speakers
avatar for Jacopo Cappellato

Jacopo Cappellato

VP Technology, HotWax Media
Jacopo is a member of the Apache Software Foundation and a committer and chairperson of the Project Management Committee (PMC) for the OFBiz Apache project, to which he has participated since 2003. Jacopo is the CTO and member of the Executive Team at HotWax Systems. Prior to his... Read More →


Monday November 17, 2014 1:40pm - 2:30pm
Huba

2:40pm

Long-Lived Yarn Services: The Future Of Yarn Applications. - Steve Loughran, Hortonworks
Apache Hadoop clusters are generally viewed as data analysis systems, running short-to medium life analysis applications —or installations of a single large application such as Apache HBase & Apache Accumulo.

There's no reason for this to be the case: you can deploy long lived services into a Hadoop cluster, gaining access to the HDFS filesystem, availability from a fault-resilient infrastructure, shared use driven by scheduling -and the ability to integrate with other services running in the YARN cluster.

In this talk I will look at the needs of long-lived services, where YARN is today with supporting them -and where we are going next. In particular, I will explore the JIRA issue YARN-896 is the focal point for evolving YARN's support of long-lived services, addressing needs such as security, logging and service discovery -demonstrating some of this in action.

Speakers
avatar for Steve Loughran

Steve Loughran

Member of Technical Staff, Hortonworks
Steve Loughran is a developer at Hortonworks, where he works on leading-edge Hadoop applications, most recently on Apache Slider and on Apache Spark's integration with Hadoop and YARN, and Hadoop's S3A connector to Amazon S3. He's the author of Ant in Action, a member of the Apache... Read More →


Monday November 17, 2014 2:40pm - 3:30pm
Arany

2:40pm

Patches Welcome - Contributing To Apache Projects In A Nutshell - Isabel Drost-Fromm, Elasticsearch
For an outsider the way open source projects in general and Apache projects in particular work seems complex and hard to understand. Cultural differences not only between corporate work and open source collaboration but also across projects make things even harder.

This talk tries to outline some of the basics of what makes a successful committer at Apache: Why should you understand the culture and get involved if you are using one of the Apache projects? What are valuable contributions? How do you stay committed without over committing yourself in case someone invites you to become official committer?

Speakers
avatar for Isabel Drost-Fromm

Isabel Drost-Fromm

Open Source Strategist, Europace AG
Isabel Drost-Fromm is Open Source Strategist at Europace AG Germany. She's a member of the Apache Software Foundation, co-founder of Apache Mahout and mentored several incubating projects. Isabel is interested in all things FOSS, search and text mining with a decent machine learning... Read More →


Monday November 17, 2014 2:40pm - 3:30pm
Dery/Mikszath

2:40pm

From OAuth1 To OAuth2 With Apache CXF And Hawk - Sergey Beryozkin, Talend
OAuth2 is a set of specifications describing the way resource owners can allow third party applications access some of their resources. OAuth2 owes a lot to the popularity of older OAuth1, the fact which led to the OAuth2 process being initiated. However the concern about OAuth2 being as capable and secure as OAuth1 is still shared by some developers. In this presentation we will briefly introduce Apache CXF OAuth2 implementation and show how OAuth2 and Hawk (Mac) token can be used to create simpler and no less secure classical OAuth applications.

Speakers
SB

Sergey Beryozkin

Software Architect, Talend
Sergey Beryozkin is a software architect at Talend. He is a web services expert leading Apache CXF JAX-RS and OAuth2 projects. He is also a committer of Apache Tika, Camel and Aries.


Monday November 17, 2014 2:40pm - 3:30pm
Grand Ballroom

2:40pm

Apache Traffic Server & Lua - The Perfect Match - Shu Kit Chan, Yahoo!
Apache Traffic Server is a fast, scalable and extensible HTTP/1.1 compliant caching proxy server. There are C APIs to let you write your own plug-ins and do something like modifying HTTP headers or modifying the response with additional content. Lua is a fast, powerful, lightweight and embeddable scripting language.

In this talk, we will introduce the new feature that embeds the power of Lua into the software and allows user to write plugins for ATS in Lua. We will also talk the implementation of this feature and our plan on improving and expanding it it.


Speakers
SK

Shu Kit Chan

Yahoo
This proposal is submitted by Shu Kit Chan and Pushkar Sachdeva. Pushkar is a Senior Software Engineer at Yahoo!. He has been with Yahoo for over 5 years and enjoys working on the proxy layer. He has been part of many interesting projects at Yahoo! like Assembly At The Edge (gave... Read More →


Monday November 17, 2014 2:40pm - 3:30pm
Elod/Ond

2:40pm

Apache DeviceMap - Werner Keil, Creative Arts & Technologies
We experience a growing number of mobile phones, tablets, phablets, smart TV and similar devices flooding the market almost every day.
Capturing the specification of each device is a tough job. If you want to create a comfortable user experience you need dynamic content according to hardware and browser specifications of your device. That’s the reason why Device Description Repositories (DDR) exist.

Apache DeviceMap is a collaborative effort to create a comprehensive open-source and open-data repository of device information, images and other relevant information for all types of mobile devices, smartphones, tablets, smart TV, etc.. Much of the code was donated from the OpenDDR project, and participants come from many different companies.

The project began in January 2012, later that year OpenDDR contributed DDR APis for Java and. NET. Ongoing steps are a common device repository, a storage structure and maintenance of device data by the Apache community.

Speakers
avatar for Werner Keil

Werner Keil

Director, Creative Arts & Technologies
Werner Keil is Agile Coach Java and IoT/Embedded expert. Helping Global 500 Enterprises across industries and leading IT vendors. He worked for over 25 years as Program Manager, Coach, SW architect and consultant for Finance, Mobile, Media, Tansport and Public sector. Werner is Eclipse... Read More →


Monday November 17, 2014 2:40pm - 3:30pm
Petofi

2:40pm

GUI Unit Test For Apache OFBiz - Olivier Heintz, Nereide
Apache OFBiz is a ERP and also a technical and functional framework. It allows to build strong and adapted solution for managing Company.
Currently there are a lot of Junit test for internal business process but there are no test for user interface, because most of time standard GUI is modified in customer project implementation.
This presentation will show how to realize GUI unit test with selenium and webdriver to test Portal page with portlet, and how to integrate test in a continuous integration environment Jenkins(ex Hudson).
With standardized tests like these, it's possible, in a customer project to deploy a complete test coverage to guaranty quality and scalability.

Speakers
avatar for Olivier Heintz

Olivier Heintz

CIO and Business Architect, Apache-OFBiz
Olivier Heintz is CIO and Business Architect dedicated to Apache-OFBiz since 2003. He has over 30 years of experience in IT with core skills in Supply Chain Management, Shop Floor Control, Inventory Integration and Accounting. | | He has managed many OFBiz implementation projects... Read More →


Monday November 17, 2014 2:40pm - 3:30pm
Huba

3:50pm

Building A Better Test Platform: A Case Study Of Improving Apache Hbase Testing With Docker - Aleks Shulman & Dima Spivak, Cloudera
Cloudera Engineering has heavily incorporated Docker, an extension of Linux Containers, into our integration testing framework for Apache HBase, a distributed "NoSQL" datastore. Through the use of Docker images, we have succeeded in parameterizing the Apache Hadoop environment on which our tests are deployed. This allows us to test functionality and compatibility across a wider range of platforms in a much shorter amount of time, resulting in dramatic improvements in utilization of our computational resources. In this talk, we will present how we use Docker during test development to reduce the time it takes to write and run functional tests and to include more test configurations. We will then go in-depth on a particularly novel use case: compatibility testing. Attendees will come away with a perspective on Docker that will help them adopt it into their own test frameworks.

Speakers
AS

Aleks Shulman

Software Engineer, Cloudera
Aleks is a Software Engineer in Test, specializing in Apache HBase and running Apache Hadoop in the Cloud. He has been at Cloudera for two years. Previously, he was at Salesforce.com, working on test automation for the Force.com Platform APIs. Before Salesforce.com, Aleks attended... Read More →
avatar for Dima Spivak

Dima Spivak

Software Engineer, Cloudera
Dima Spivak is a Software Engineer in Test, where he works on Apache HBase in particular and test frameworks in general. Before joining Cloudera, Dima was a Research Assistant in the School of Physics and Astronomy at the University of Minnesota, where he received his MS in Physi... Read More →


Monday November 17, 2014 3:50pm - 4:40pm
Arany

3:50pm

If You Have The Content, Then Apache Has The Technology! - Nick Burch, Quanticate
Within the ASF, there are a wide variety of projects with technologies to help you store, retrieve, host, transform and generate content. This talk will review the landscape of Apache content technologies, provide a quick introduction to the more common and more interesting projects, and flag up new and innovative features within them. It'll also highlight talks from the rest of the week on many of the projects covered, so that you'll know where and when to go to learn more about those projects and technologies which catch your eye!

Speakers
avatar for Nick Burch

Nick Burch

CTO, Quanticate
Nick began contributing to Apache projects in 2003, and hasn't looked back since! Most of the projects Nick has worked in belong in the "Content" space, such as Apache POI (ex-PMC Chair), Apache Tika and Apache Chemistry. As well as coding projects, Nick is also involved in a number... Read More →


Monday November 17, 2014 3:50pm - 4:40pm
Tohotom

3:50pm

Particle Physics On The Couch: Using CouchDB To Help Unravel The Mysteries Of The Universe - Michael Marino, Echnische Universität München
Pushing the envelope of our understanding of the universe requires physics experiments that measure and analyze data from many disparate systems. Such systems generally have widely different requirements and architectures, making it difficult to implement a single, monolithic program for control and readout. CouchDB is an optimal tool to integrate and save data from such systems since its RESTful interface provides a minimal hurdle that almost all devices large and small can achieve. In this presentation, Mike Marino will describe how CouchDB is being used in the nEDM experiment at the Technische Universität München to integrate, control, view and save measurement data from many different subsystems, ranging from those powered by microcontrollers and mini PCs (Arduinos and Raspberry Pis) to those based on the industrial VMEbus.

Speakers
avatar for Michael Marino

Michael Marino

Experimental Physicist, Technische Universität München
Michael Marino is an experimental physicist working at the Technische Universität München (Germany) interested in searching for new physics that could help solve some of the open questions about our universe. He is currently working on two experiments: (1) EXO-200, looking for a... Read More →


Monday November 17, 2014 3:50pm - 4:40pm
Tas

3:50pm

Design REST Services With CXF JAX-RS Implementation: Best Practices And Lessons Learned - Andrei Shakirin, Talend
JAX-RS specification makes creation of REST services easy and intuitive for Java developers. However there are a number of topics to worry about in order to provide clean and maintainable REST APIs.
In this presentation, Andrei will share practical experience to design REST services based on JAX-RS Apache CXF implementation.
The presentation will cover the following:
- Exposing of resources and collections
- Resource methods selection and content negotiation
- Exception processing
- Client and Server asynchronous APIs
- Using of JSON providers
- Crosscutting aspects in JAX-RS filters and interceptors
- Bean validation

The main goals of this presentation are to share the best practices, experience and discuss the most important aspects of designing and implementing REST APIs in your projects.

Speakers
AS

Andrei Shakirin

Software Architect, Talend
Andrei is a software architect in the Talend team developing the open source Application Integration platform based on Apache projects. The areas of his interest are REST design, web services security, Microservices and agile development. Andrei is PMC and committer of Apache CXF... Read More →


Monday November 17, 2014 3:50pm - 4:40pm
Grand Ballroom

3:50pm

An Introduction To Apache Flex - Justin Mclean, Class Software
Apache Flex is an open source framework for easily building applications for mobile devices, the browser and desktop.

In this talk I'll give a brief history of Apache Flex, cover the main features of Apache Flex including recent releases, it's UI XML markup language, component hierarchy, binding, event bubbling, skinning and other major features of the framework.

I'll show a simple Flex application and how easy it is to make MVC architecture web browser application and convert with few simple modifications from a web browser based application to one that runs on the desktop and in a mobile device.

Finally I look at some of the other more recent directions the project is evolving in with FlaconJX a ActionScript to JavaScript cross compiler, FlexJS a new JavaScript based framework and ant for Air an utility for making cross platform user friendly installers from ant scripts.

Speakers
avatar for Justin Mclean

Justin Mclean

Founder, Class Software
Justin Mclean has more than 25 years experience in developing web based applications and is involved in the open source hardware movement. He runs his own consulting company Class Software and has spoken at numerous conferences in Australia and overseas including previous ApacheCon... Read More →


Monday November 17, 2014 3:50pm - 4:40pm
Petofi

4:50pm

A Development Analytics Dashboard For Apache Cloudstack - Jesus M Gonzalez-Barahona, Bitergia
The Apache CloudStack Development Analytics Dashboard helps to better understand how the project is evolving and performing. Using information from git, Jira and mailing lists, several charts and numbers provide some insight on relevant parameters of the project. The talk will explain the meaning of the different panels in the dashboard, and will use them and some complementary quantitative data to analyze specific aspects of the inner life of the project. Those will be used to describe the CloudStack community, its activity, and some of their most important development processes.

CloudStack Dashboard: http://projects.bitergia.com/apache-cloudstack/browser/

Speakers
avatar for Jesus M Gonzalez-Barahona

Jesus M Gonzalez-Barahona

Co-Founder, Bitergia
I'm one of the founders of Bitergia, the software development analytics company, and associate professor at Universidad Rey Juan Carlos. I've been working in quantitative empirical analysis of FOSS development for years, participating in several international R&D projects. I'm currently... Read More →


Monday November 17, 2014 4:50pm - 5:40pm
Dery/Mikszath

4:50pm

RBAC Authorization With Apache Directory Server And Fortress - Emmanuel Lecharny, Symas
Apache Directory Server is a LDAP server written in Java. It offers many interesting features, beside being a fully compliant LDAP server. Fortress, on the other hand, is a full ANSI RBAC (INCITS 359) implementation, which uses a LDAP server as a backend.

We will present how you can leverage both components to build a powerful authorization system that you can include into your application.

Speakers
avatar for Emmanuel Lecharny

Emmanuel Lecharny

Architect, Symas
Emmanuel Lécharny is an Apache Member, former chairman of the Apache Directory project. | | He is currently working for Symas. | | He has been working on Apache Directory project for more than 10 years.


Monday November 17, 2014 4:50pm - 5:40pm
Tohotom

4:50pm

CouchDB-Based System For The Users Data Management In Cms At Lhc: Implementation And Experience - Hassen Riahi, CERN
The Compact Muon Solenoid (CMS) users data management service at The Large Hadron Collider (LHC), named AsyncStageOut, manages more than 300k users' files per day of close to 1000 individual users per month with minimal delays, and providing a real time monitoring to users and service operators, while being highly available. The associated data volume represents a new set of challenges in the areas of database scalability and service performance and efficiency.
The service is designed as a thin application relying only on CouchDB as input and data storage. It has progressed from a limited prototype to a highly adaptable service, which manages and monitors the whole CMS user files steps.
In this presentation, Hassen Riahi will provide an overview of the AsyncStageOut model, present the motivations for using CouchDB, describe the deployment model and discuss the results achieved.

Speakers
HR

Hassen Riahi

CERN
Employer: CERN (http://home.web.cern.ch/about) | | Ongoing projects: | • Design and implementation of CMS users data management service | • Design and implementation of the Worldwide LHC Computing Grid (WLCG) File Transfer Service (FTS) monitoring Dashboard | • Commissioning... Read More →


Monday November 17, 2014 4:50pm - 5:40pm
Tas

4:50pm

Secure Services With Apache CXF - Andrei Shakirin, Talend
Security is the common requirement for almost all of the web service related products and solutions. Apache CXF framework provides a wide range of the security features for REST and SOAP services. It supports a number of security standards like OAuth, WS-Trust, WS-Federation. CXF users can benefit from new functionality in Authorization, Key Management and Security Token Service components.
This presentation will review CXF security features and illustrate them with the code. Andrei will show how the security requirements have been implemented in the real projects and discuss possible alternatives.
Which security options are available for the REST service? How to achieve federated authentication in CXF applications? How to implement authorization in web service? What is the benefit of the public key infrastructure and XKMS? - These questions will be discussed during the session.

Speakers
AS

Andrei Shakirin

Software Architect, Talend
Andrei is a software architect in the Talend team developing the open source Application Integration platform based on Apache projects. The areas of his interest are REST design, web services security, Microservices and agile development. Andrei is PMC and committer of Apache CXF... Read More →


Monday November 17, 2014 4:50pm - 5:40pm
Grand Ballroom

4:50pm

Unit Testing Apache Flex Applications - Justin Mclean, Class Software
FlexUnit has finally cleared Adobe's legal team been donated to the Apache Flex project and made it into it's first release.

In this session I go though how to efficiently unit test an Apache Flex application. I'll start with the basics by showing how to write and run simple Flex Unit tests. Then moving on to how to refactor your code so that it is testable and show several methods to pragmatically test your application. I'll cover a few more advanced topics including asynchronous unit testing, testing event dispatch and UI testing with FlexUnit. Finally for a bit of fun I'll show how FlexUnit can be used in Jenkins and the build status shown via Open Source Hardware Arduino traffic light shield.

Speakers
avatar for Justin Mclean

Justin Mclean

Founder, Class Software
Justin Mclean has more than 25 years experience in developing web based applications and is involved in the open source hardware movement. He runs his own consulting company Class Software and has spoken at numerous conferences in Australia and overseas including previous ApacheCon... Read More →


Monday November 17, 2014 4:50pm - 5:40pm
Petofi

4:50pm

Introduction To A Recommender System For Apache OFBiz - Anahita Goljahani
Recommender Systems are engines aimed at predicting users preferences based, e.g., on knowledge about past behaviors of similar users. Recommender Systems have become very popular in the recent years, e.g., to suggest products to buy, articles to read or music to listen. One of the main advantages of Recommendations is that they allow users to discover items that a simple search might not show up. This is one of the reasons why Recommendation engines have become a desirable, if not essential, component of commercial platforms.

In this presentation, Anahita Goljahani will discuss the design of a Recommender System for Apache OFBiz (http://ofbiz.apache.org/), the popular solution for the automation of business applications (ERP, CRM, MRP, E-Commerce, etc.).

Speakers
avatar for Anahita Goljahani

Anahita Goljahani

Anahita Goljahani is a consultant that has been involved in several research projects and has published articles for international conferences and scientific journals. Her interests include signal processing approaches for telecommunication systems and neuroscience investigations... Read More →


Monday November 17, 2014 4:50pm - 5:40pm
Huba
 
Tuesday, November 18
 

9:00am

Cassandra (And Hadoop) Case Studies From Finn.No - Mick Semb Wever, FINN.no
FINN.no is the leading classifieds website in Norway and the country's busiest website.

This session will go through various product development where Cassandra has shown to be the best choice. Focusing on the primary use-case: a tracking solution that's collects raw time-series data in c* and aggregates it near real-time using Hadoop into various new datasets from advert-centric statistics to user-centric behavioural analysis.

Mick will cover the final technical design chosen after three years of development iterations, touching on technologies: scribe, thrift, kafka, hadoop, pig, mahout; the hurdles faced along the way, integration improvements done between cassandra and hadoop, and the throughput and performance of today's systems.

Speakers
avatar for Mick Semb Wever

Mick Semb Wever

Team Member, The Last Pickle
Mick Semb Wever works at The Last Pickle helping customers deliver and improve Apache Cassandra based solutions. Prior to TLP he spent seven years at FINN.no building their Microservices platform utilizing Apache Cassandra, Hadoop, Spark and Kafka. He is the PMC Chair for Apache Tiles... Read More →


Tuesday November 18, 2014 9:00am - 9:50am
Tohotom

9:00am

State Of Apache Hbase, 1.0 Release - Nick Dimiduk, Hortonworks
The pace of innovation in HBase is rapidly increasing together with its popularity. In this talk, we will take a look at all the development that happened last year for a user level overview of all the recently added features, and releases in HBase. We will talk about the upcoming 1.0 release which is expected to arrive at summer 2014. We will cover which release to choose, binary / wire and source compatibility considerations and how to upgrade between releases. Specifically, we will talk about long list of new features in recent releases including client API changes, new PB based Filter and Coprocessor interfaces, namespaces, per-cell ACLs, region replicas and many other features.

Speakers
avatar for Nick Dimiduk

Nick Dimiduk

Hortonworks
Nick Dimiduk is a committer and PMC member on both Apache HBase and Apache Phoenix. He's Release Manager for the HBase 1.1 branch and an author of the book HBase in Action, on Manning Press. Nick has also contributed to a number of Apache projects around HBase, including, HTrace... Read More →


Tuesday November 18, 2014 9:00am - 9:50am
Arany

9:00am

Apache Brooklyn - What It Is And Why You Might Use It - Richard Downer, Cloudsoft
Apache Brooklyn recently joined the Incubator. Brooklyn can be described using phrases like "automated management of cloud applications" and "simplified deployment and runtime management of enterprise-grade applications", but soundbites like those only scratch the surface of what Brooklyn can do. In this presentation, Richard Downer will show what Brooklyn is, how it can help you, and how to develop with it. Afterwards, whenever you see the tagline "application modelling, monitoring and management", you'll realise that's *WAY* cooler than it sounds.

Speakers
avatar for Richard Downer

Richard Downer

Principal Engineer, Cloudsoft
Richard is an engineer at Cloudsoft, working on the Apache Brooklyn (incubating) project. Richard is a relative n00b at conference speaking having only spoken at one conference so far!


Tuesday November 18, 2014 9:00am - 9:50am
Huba

9:00am

Effective Web Application Development With Apache Sling - Robert Munteanu, Adobe
Apache Sling is an innovative web framework built on top of the Java Content Repository (JCR), that uses OSGi for its component model and fosters RESTful application design.

This talk shows how Sling integrates various standard-based technologies, like OSGi and the Content Repository API for Java to create a coherent framework for web application development. We will walk through the development of a simple application with minimal effort and demonstrate how to productize the resulting application.

We will pay special attention to some approaches which are not yet part of mainstream development, such as using OSGi for dependecy injection and JCR for persistence.

Speakers
avatar for Robert Munteanu

Robert Munteanu

Senior Computer Scientist, Adobe Systems Romania
Robert Munteanu is a software developer with a passion for Open Source. He works as a senior computer scientist on the Adobe Experience Manager. A frequent Open Source contributor, he focuses his efforts on projects such as Apache Sling, MantisBT, the Eclipse plugin for Reviewboard... Read More →


Tuesday November 18, 2014 9:00am - 9:50am
Tas

9:00am

Lucene Spatial - The Forgotten NoSQL Spatial Datastore - Steven Citron-Pousty, Red Hat
Lucene, while a mature and proven technology, has gotten little press in the NoSQL buzz. What started as a full-text search index solution has grown to also be the basis for Solr and ElasticSearch. Most importantly for today's talk they have added the capability to do spatial queries along with full text queries. In this talk I will give a quick intro to Lucene and surrounding projects, give it's sweet spot for use, and then give a quick code sample on indexing and searching of a spatial document collection. The code will use Java, JAX-RS, and CDI to create a geospatial REST service talking to a JavaScript Front-End using Leaflet and OpenStreetMap. The plan after is to have you go home with another geospatial storage and search tool in your toolbox.

Speakers
avatar for Steven Pousty

Steven Pousty

Developer Evangelist, Red Hat
Steve is a Dad, Son, Partner, and Developer Evangelist with OpenShift. He goes around and talks about cool technology that sometimes involves Red Hat Technology. He can teach you about Java, Python, PostgreSQL MongoDB, some JavaScript, Docker, and Kubernetes. He has deep subject area... Read More →


Tuesday November 18, 2014 9:00am - 9:50am
Elod/Ond

10:00am

Intro To Apache Usergrid, The BaaS Build On Cassandra - David Johnson, Apigee Corp
Whether you are building a mobile app or a web app, Apache Usergrid (incubating) can provide you with a complete backend that supports authentication, persistence and social features like activities and followers all via a comprehensive REST API — and backed by Cassandra, giving you linear scalability. All that, and Usergrid is open source too.

This session will explain how you can use Usergrid to provide a back-end for your application. We’ll start with an overview of Usergrid features, then explore in depth how to authenticate users, store data and query data with the REST API provided by a Usergrid server. We’ll develop a simple HTML5 app and package it as a native mobile app via Apache Cordova. We'll also cover how to run Usergrid locally for development and testing.

Speakers
avatar for David Johnson

David Johnson

Open Source Developer, Apigee
Dave Johnson is an open source developer who specializes in web technologies, social software and open standards. Dave works for Apigee and is active in several open source projects at the Apache Software Foundation and elsewhere. Dave is an experienced speaker who has spoken at ApacheCon... Read More →


Tuesday November 18, 2014 10:00am - 10:50am
Tas

10:00am

High Performance Solr - Shalin Shekhar Mangar, LucidWorks Inc.
What makes an Apache Solr installation high performance? Learn about what's keeping that CPU hot, memory tight, disk screaming and network busy on your Solr installation. Optimize CPU usage, reduce memory and disk contention, unclog that network and learn about hidden gotchas of query and filter performance, DocValues, transaction logs etc.

Although, numerous Apache Solr performance tuning tips are available on the world-wide-web but they are distributed between the official Apache Solr reference guide, Apache Lucene/Solr javadocs, code comments, Jira comments, various books, the mailing-list and many blogs. In many cases, such advice is unsubstantiated with numbers or not annotated with the trade-offs.

Shalin will back each piece of advice with statistics and benchmarks as well as inform you about the trade-offs made so you can make more informed decisions.

Speakers
avatar for Shalin Shekhar Mangar

Shalin Shekhar Mangar

Senior Solr Consultant, Lucidworks
I am a committer on Apache Lucene/Solr since 2008 as well as a member of the Lucene/Solr project management committee. I currently work at Lucidworks Inc. on Apache Solr and Lucidworks Search mostly on the SolrCloud side of things. In the past, I've worked at AOL for five years on... Read More →


Tuesday November 18, 2014 10:00am - 10:50am
Elod/Ond

10:00am

The OpenOffice Localization Community - Andrea Pescetti, Apache OpenOffice PMC
The sun never sets on the OpenOffice localization community. Volunteers from all continents help translating OpenOffice, its documentation and its website into dozens of languages.
We'll discuss the localization of OpenOffice both as a community story and a technical story. The community was restarted from scratch at Apache and welcomes new volunteers on a regular basis. Tools were developed and extended at Apache to help translators, with benefits for all Apache projects.

Speakers
AP

Andrea Pescetti

Member, Apache OpenOffice PMC
Andrea Pescetti has been active in free and open source software communities since 2001. He is currently serving as member of the Apache OpenOffice PMC, and helping the project as a volunteer, mainly in the Release Management and Localization fields. In his ordinary work activity... Read More →


Tuesday November 18, 2014 10:00am - 10:50am
Petofi

10:00am

Simple Software Is Hard - Tales From The Trenches - Bertrand Delacretaz, Adobe
Simple software wins every time, but it can be very hard to create. Based on concrete examples from past projects, we'll show how high one can put the bar in terms of simplicity, and how you need to be ready to throw away your work, even many times, to come up with something simple enough. There's no magic recipe for simple software, but our "tales from the trenches” should help you identify the key drivers that help us progress on this hard journey.

Speakers
avatar for Bertrand Delacretaz

Bertrand Delacretaz

Principal Scientist, Adobe
Bertrand Delacretaz works as a Principal Scientist in the AEM R&D team at Adobe’s Basel office, using open source tools to create world-class content management and digital marketing systems. Bertrand is an active member and current Director of the Apache Software Foundation... Read More →


Tuesday November 18, 2014 10:00am - 10:50am
Kond

11:20am

Ndfs: A Native Client For The Hadoop Distributed Filesystem - Colin McCabe, Cloudera
As the main filesystem for Hadoop, the Hadoop Distributed Filesystem an important part of the big data ecosystem. However, previously, non-Java Hadoop clients have had to deal with the JNI interface when communicating with HDFS. NDFS, our new project to create a native client for HDFS, offers many operational, performance, and practical advantages for these clients. In this presentation, I'll talk about the architecture of NDFS, the problems we solved when developing it, and our plans for the future.

Speakers
CM

Colin McCabe

Software Engineer, Cloudera
Colin McCabe is a Platform Software Engineer at Cloudera, where he works on HDFS and related technologies. He is a committer on HDFS. Prior to joining Cloudera, he worked on the Ceph Distributed Filesystem, and the Linux kernel, among other things. He studied Computer Science and... Read More →


Tuesday November 18, 2014 11:20am - 12:10pm
Arany

11:20am

Building Your Own BaaS With Apache Usergrid & Docker: Lessons Learned At Scale - Sungju Jin, Korea Telecom
BaaS(Backends-as-a-Service) and PaaS(Platforms-as-a-Service) are not just for hipsters anymore. 2013 saw many people building their own PaaS like Heroku, Cloudfoundry or BaaS like Parse, Kinvey.

However, many companies and open-source projects only focus on the advantages of building your own *aaS and the advantages of their own solution. Do we really know what works and what should be considered when building such system? Many people do not know whether they should build a PaaS or BaaS, how they should build it, or even if they should build one at all.

Drawing from our experience building both a production BaaS, serving hundreds of millions of requests every month at Korea Telecom, we will explain some of the lessons we learned building large application platforms.

Join us if you are interested in learning more about how to build your own BaaS. The answers may surprise you!

Speakers
avatar for Sungju Jin

Sungju Jin

Software Engineer, Apache Software Foundation
Sungju Jin is a Software Engineer. He is a committer on the Apache Usergrid. He has also presented at conferences such as O’Reilly’s OSCON, Biggest OSS Conf in Korea. He was working for Korea Telecom to develop cloud platform. And he was previously working for Samsung Electronics... Read More →


Tuesday November 18, 2014 11:20am - 12:10pm
Huba

11:20am

Scaling Solrcloud To A Very Large Number Of Collections - Shalin Shekhar Mangar, LucidWorks, Inc.
The traditional and typical search use case is the one large search collection distributed among many nodes and shared by all users. However, there is a class of applications which need a large number of small or medium collections which can be used, managed and scaled separately. This talk will cover Shalin and LucidWorks' efforts in helping a client set up a large scale SolrCloud setup with thousands of collections running on hundreds of nodes. Shalin will describe the bottlenecks that were found in SolrCloud when running a large number of collections and also the many features and optimisations that were contributed to Apache Solr to reduce or remove the choke points in the system. Finally, Shalin will talk about the benchmarking process and the lessons learned from supporting such an installation in production.

Speakers
avatar for Shalin Shekhar Mangar

Shalin Shekhar Mangar

Senior Solr Consultant, Lucidworks
I am a committer on Apache Lucene/Solr since 2008 as well as a member of the Lucene/Solr project management committee. I currently work at Lucidworks Inc. on Apache Solr and Lucidworks Search mostly on the SolrCloud side of things. In the past, I've worked at AOL for five years on... Read More →


Tuesday November 18, 2014 11:20am - 12:10pm
Elod/Ond

11:20am

Continuous Automated Deployment With Apache Ace - Jan Willem Janssen & Marcel Offermans, Luminis Technologies
Nowadays, it is quite common to have build infrastructure that, on every change in a repository, builds your software and runs all your tests. However this is where most development teams stop. This talk demonstrates how you can consistently deploy systems from development to testing, staging and production.
With Bndtools we semantically version bundles in the Eclipse IDE. The continuous build is setup to automatically deploy to a bundle repository and create snapshots of changed bundles compared to a baselined version that is available in a release repository. By hooking up these repositories to Apache ACE, such updates can instantly be deployed to systems in different configurations, allowing you to deploy new features to QA systems, beta users and finally roll them out to all other customers.
We’ll also go into real life scenarios and pitfalls we have encountered setting this up.

Speakers
avatar for Jan Willem Janssen

Jan Willem Janssen

Software Architect, Luminis Technologies
Jan Willem Janssen works as a software architect at Luminis Technologies and has several years of experience in various areas of software development. He has a strong interest in software architecture, modularity, security and creating scalable software systems. He actively participates... Read More →
MO

Marcel Offermans

Director, Luminis Technologies
Marcel Offermans is the Director at Luminis Technologies. He has been involved in lots of commercial OSGi projects for over eight years and is responsible for the architecture, development and support of an OSGi and cloud based software provisioning product. He is also a member of... Read More →


Tuesday November 18, 2014 11:20am - 12:10pm
Kond

11:20am

Apache TomEE, Java Ee Web Profile And More On Tomcat - Jean-Louis Monteiro, Tomitribe
Apache TomEE is the Java EE Web Profile certified version of Apache Tomcat and combines the simplicity of Tomcat with the power of Java EE. The first half of this session introduces TomEE and shows how Tomcat applications leveraging Java EE technologies can become simpler and lighter with a Java EE certified solution built right on Tomcat.
The second half jumps right into action and gives a coding tour of TomEE, including quickly bootstrapping projects, doing proper testing with Arquillian, and setting up environments. If you’re a Tomcat lover or a TomEE enthusiast, this is the session you don’t want to miss!

Speakers
JM

Jean-Louis Monteiro

Director of Engineering, Tomitribe
I'm a Senior Java Enterprise Software Architect and passionate about Open Source. My experience includes banking, insurance, telecommunications and public domain. I teach Java EE 6 at the local University in France. After six years contributing Apache OpenEJB, I was invited to join... Read More →


Tuesday November 18, 2014 11:20am - 12:10pm
Tohotom

1:30pm

Scalable Stream Processing With Apache Samza And Apache Kafka - Martin Kleppmann, LinkedIn
Samza, an Apache Incubator project, is a framework for processing and analysing high-volume data streams. It is built upon Apache Kafka and YARN (Hadoop 2.0). You can think of Samza as a real-time, continuously running version of MapReduce.

In this talk, Martin will show why stream processing is becoming an important part of the architecture of data-intensive applications, alongside storage and batch processing. We will explore how Samza works, and show how it reliably processes millions of messages per second. We will also examine what kinds of applications would benefit from using Samza.

Speakers
avatar for Martin Kleppmann

Martin Kleppmann

Software Engineer, LinkedIn
Martin is committer on Apache Samza and Apache Avro, software engineer at LinkedIn, and author at O'Reilly (currently writing a book on designing data-intensive applications). Previously he co-founded and sold two startups, Rapportive and Go Test It. His technical blog is at http://martin.kleppmann.com... Read More →


Tuesday November 18, 2014 1:30pm - 2:20pm
Arany

1:30pm

Apache Camel In The Belly Of The Docker Whale - Henryk Konsek, Red Hat
Do you enjoy a lightweight messaging with the Apache ActiveMQ? Do you find Apache Camel message routing amusing? Of course, we all do! Do you hate deployment of the messaging solution? Do you hate it with a passion? Of course, we all do...

Is there something we can do to make messaging deployment more fun? Does it have to be so difficult, error prone and annoying?

If you can't sleep at night because you are asking yourself those questions over and over again, this talk is for you. I will tell you what the big blue whale called Docker can do for you, so you can sleep tight again.

Speakers
avatar for Henryk Konsek

Henryk Konsek

Senior software engineer, Red Hat
I create large-scale IoT solutions. Engineer at Red Hat. Rhiot project founder.


Tuesday November 18, 2014 1:30pm - 2:20pm
Tas

1:30pm

Native Code And Off-Heap Data-Structures For Solr - Yonik Seeley, Heliosearch
Off-heap data structures and native code performance improvements for Apache Solr are being developed as part of the Heliosearch project. This presentation will cover the reasons behind these features, implementation details, and performance impacts. Recently developed features will also be covered (i.e. developed after this abstract was written).

Speakers
avatar for Yonik Seeley

Yonik Seeley

Search Engineer, Cloudera
Yonik Seeley is the creator of Solr. He works at Cloudera integrating and leveraging "Big Search" technologies into their advanced platform for machine learning and analytics. Yonik was a co-founder of LucidWorks, and he holds a master's degree in computer science from Stanford U... Read More →


Tuesday November 18, 2014 1:30pm - 2:20pm
Elod/Ond

1:30pm

Anatomy Of An Apache OpenOffice Extension - Pedro Giffuni, Apache Software Foundation
Developing code within the huge, and sometimes daunting, Apache OpenOffice code can sometimes be very difficult, specially if you only want to add a very specific piece of code that few people want.
As part of an ongoing effort to improve the Operation Research capabilities within Apache OpenOffice, I will try to document the different parts of an Apache OpenOffice extension.

Apache OpenOffice supports a framework called UNO, Universal Network Objects, for developing extensions. Extensions can be written in C/C++, Python or Java.

The UNO documentation is somewhat sparse and most of the developers end up taking some existing extension and use it as a template to write their own. This introductory talk will consider the characteristics behind UNO and will provide the basics to write such applications.

Speakers
avatar for Pedro Giffuni

Pedro Giffuni

Developer, Apache OpenOffice - FreeBSD
I am a Mechanical Engineer and I have a M. Sc. in Industrial Engineering but I have always been a hobbyist on computers and Networking. Since my University days (late mid-90's), I have been using FreeBSD actively in system administration. I am a FreeBSD and Apache OpenOffice developer... Read More →


Tuesday November 18, 2014 1:30pm - 2:20pm
Petofi

2:30pm

The Flink Big Data Analytics Platform - Márton Balassi, Hungarian Academy of Sciences & Gyula Fóra
Apache Incubator Flink is a next-generation platform for big data analysis originating from the Stratosphere project (www.stratosphere.eu). Flink offers an alternative runtime engine to Hadoop MapReduce, but uses HDFS for data storage and runs on top of YARN. Flink`s runtime streams data rather than processing them in batch, uses out-of-core implementations for data-parallel processing tasks, degrading to disk if main memory is not sufficient. Flink is programmable via a Java or Scala API that includes functional operators like map, reduce, join, cogroup, and cross. Analysis logic is specified without the need of linking user-defined functions. Flink includes a cost-based program optimizer that picks data shipping strategies. Finally, Flink features support for iterative programs and graph processing programs. As a consequence Flink is currently witnessing its first commercial use cases.

Speakers
avatar for Márton Balassi

Márton Balassi

Solutions Architect, Cloudera
Márton Balassi is a Solution Architect at Cloudera and a PMC member at Apache Flink. He focuses on Big Data application development, especially in the streaming space. Marton is a regular contributor to open source and has been a speaker of a number of Big Data related conferences... Read More →
avatar for Gyula Fóra

Gyula Fóra

Researcher, Distributed Systems, SICS
Gyula is a committer and PMC member for the Apache Flink project, currently working as a researcher at the Swedish Institute of Computer Science. His main expertise and interest is real-time distributed data processing frameworks, and their connections to other big data applications... Read More →


Tuesday November 18, 2014 2:30pm - 3:20pm
Arany

2:30pm

Apache Zookeeper In The Wild - Rakesh Radhakrishnan, Huawei Technologies
A high-performance centralized coordination service is a critical component for any distributed application. Apache Zookeeper is such a component and it has been in the past few years a practical solution to cluster coordination. When running ZooKeeper in production, however, there are many issues that one needs to be aware of, like proper connection management, number of direct children of a single node, herd effect, watcher implementations etc. In this presentation, I will cover some of my experience while running ZooKeeper in production.

Speakers
avatar for Rakesh R

Rakesh R

Technical Lead Engineer, Huawei Technologies
Rakesh is a committer of Apache ZooKeeper project and PMC of Apache BookKeeper project. He is System Architect with Huawei in their Bangalore R&D Center. He works on the enhancement and stabilization of Hadoop platform(especially HDFS), ZooKeeper, BookKeeper components to meet the... Read More →


Tuesday November 18, 2014 2:30pm - 3:20pm
Tas

2:30pm

Apache Bigtop: Saving The Day For Spark And Others - Roman Shaposhnik, Pivotal
Building a working data processing stack using open source or commercial components is a challenging and highly complex task. Multiple often conflicting dependencies, many development teams with different release trains might dictate a substantial coordination effort. A constant flow of new features, bug fixes, and other changes are almost a disaster in making when it comes to the regression and quality control at any stage between development and production environment. Businesses with internal development teams are facing the issues with integration points of their deliverables into the bigger, company wide data platform software. The problem is exaggerated by exponential growth of the standard libraries and transient dependencies. Oftentimes, it is next to impossible to create a well controlled and reproducible system environment in all stages of the platform life-cycle.

Speakers
avatar for Roman Shaposhnik

Roman Shaposhnik

Director of Open Source, Linux Foundation
Apache Software Foundation and Data, oh but also unikernels


Tuesday November 18, 2014 2:30pm - 3:20pm
Tohotom

2:30pm

Web Crawling With Apache Nutch - Sebastian Nagel, Exorbyte GmBH
Apache Nutch is an extensible and scalable web crawler based on Hadoop. This talk gives an overview of the crawler flow of work, its main components, job execution, the underlying data structures, and how it integrates with other Apache projects (Hadoop, Gora, Solr, Tika and HBase). The extensible plugin architecture is demonstrated by giving examples, how plugins help to adapt the crawler to specific use cases.
History, recent, and future developments of the Apache Nutch project are outlined, as well as the two branches under active development: the stable 1.x branch and the 2.x which is based on Apache Gora to abstract from storage back-ends.

Speakers
avatar for Sebastian Nagel

Sebastian Nagel

Crawl Engineer, commoncrawl.org
Sebastian Nagel works as crawl engineer at Common Crawl, a non-profit organization that makes web data freely accessible to everyone. Prior to joining Common Crawl he implemented search and data quality solutions at Exorbyte. Sebastian is a committer and PMC of Apache Nutch, a scalable... Read More →


Tuesday November 18, 2014 2:30pm - 3:20pm
Elod/Ond

3:50pm

Time Series Data With Apache Cassandra - Eric Evans, OpenNMS Group
Whether it's statistics, astronomy, finance, or network management, time series data plays a critical role in analytics and forecasting. Yet, while many tools exist, few are able to scale past memory limits; For those challenged by large volumes of data, there is much room for improvement.

Apache Cassandra is a fully distributed second-generation database. Cassandra stores data in key-sorted order making it ideal for time series, and its high throughput and linear scalability make it well suited to very large data sets.

This talk will cover some of the requirements and challenges of large scale time series storage and analysis. Cassandra data and query modeling for this use-case will be discussed, and Newts, an open source Cassandra-based time series store under development at The OpenNMS Group will be introduced.

Speakers
avatar for Eric Evans

Eric Evans

Senior Software Engineer, Wikimedia Foundation
Eric has more than a decade of experience with the engineering and operations of large-scale distributed systems. He joined Rackspace as a startup, and implemented a global DNS infrastructure utilizing IP anycast (possibly the first), and a novel data-center-wide IDS for which a patent... Read More →


Tuesday November 18, 2014 3:50pm - 4:40pm
Arany

3:50pm

Security As A Service Leveraged By Apache Projects - Oliver Wulff, Talend
This talk illustrates how you can build a Security solution for Web Applications and Web Services by combining different Apache Projects. This solution can be used for internal applications and applications hosted in the cloud by supporting federation.
The talk focus on the Apache Projects CXF/Fediz and Syncope. Apache Fediz is a subproject of the CXF project which provides a Web Services standard with wide range on WS-* support in the security space. Fediz on the one hand provides Web Application SSO and on the other hand integrates with the CXF Web Services security layer. This allows to support impersonation end-to-end based on industry standards. Apache Syncope is a perfect combination to CXF and Fediz to provide a tooling to manage identities and its permissions.
The talk finishes with the demonstration of the stack CXF, Fediz and Syncope.

Speakers
OW

Oliver Wulff

Solution Architect, Talend
Oliver Wulff works as a Solution Architect for Talend. He is actively working with Talend ESB in customer projects, which is based on the Apache projects CXF, Camel and Karaf. Oliver is an active commiter to the Apache CXF project as well. His passion is security where he supports... Read More →


Tuesday November 18, 2014 3:50pm - 4:40pm
Huba

3:50pm

Enabling IoT With Apache Etch And Coap - Giorgio Zoppi, Universitat Pompeu Fabra
The talk introduces the work that we have done about Apache Etch framework and shows the simple steps which have to be done to enable an application to be used as part of the Internet of the things with COAP Protocol.
It shows the features of Etch, such as fully bidirectional client to server communication, synchronous and asynchronous calls, data modelling, error handling and performance and its new binding with COAP as transport protocol. Adding this new protocol to Apache Etch enables its stack to communicate efficiently with Internet Web of Things in language neutral way. The Apache Ecosystem will benefit from those changes to Etch, because it will have a new way to implement network services for constrained resources network (i.e. wireless network or sensor networks).

Speakers
GZ

Giorgio Zoppi

Firmware Engineer, Indipendent
It is a personal project for investigating new Internet of Things world and try to connect it to a board. In my past I worked as consultant in Ericsson Research. My last public speech was a couple of months ago at Python Meetup Barcelona, where I spoke about web2py.


Tuesday November 18, 2014 3:50pm - 4:40pm
Tas

3:50pm

Multi Language Content Discovery Through Entity Driven Search - Antonio Morales, Zaizi
This presentation is about a Semantic Search Engine built on top of an Apache projects stack.
The scope of the system is to extract content from heterogeneus data sources, process and enrich it , index it and search over this content with advanced User Experience.
This result is achieved with the integration of 4 top level Apache projects : ManifoldCF, Stanbol, Marmotta and Solr.

Apache Manifold is used to access different data sources and extract the content from them: it's the engine that moves the main indexer core of the system. The document extracted are processed in a pipe-line manner, here happens the integration with Apache Stanbol and Apache Marmotta that allows the semantic enrichment of the content according to any Linked Data assets.
The last ring of the chain is custom Search API built on top of Apache Solr that allows advanced search features to improve user experience

Speakers
AM

Antonio Morales

R&D Senior Engineer, Zaizi
Senior Software Engineer working at the R&D division of Zaizi. Computer Engineer and M.Sc Software Engineer and Technology with broad experience in Analysis, Design, Development and Integration of enterprise web, mobile and cloud applications. He is one of the most security expert... Read More →


Tuesday November 18, 2014 3:50pm - 4:40pm
Elod/Ond

3:50pm

How to Turn Your Favorite Programming Language Into An AOO Macro Language - Rony Flatscher, WU Vienna
Apache OpenOffice (AOO) defines a scripting framework that can be exploited to add any programming language to Apache OpenOffice as a macro language. This presentation introduces the necessary overview and knowhow to become able to assess the effort to add your own favorite programming language to AOO. Although the AOO scripting framework is implemented in Java it is possible to add non-Java-implemented programming languages as demonstrated with the programming ooRexx which is itself implemented in C++. In addition all scripting languages that support Java's "javax.script" framework could be added to AOO using AOO's scripting framework.

Speakers
RF

Rony Flatscher

University Professor, WU Vienna
Rony G. Flatscher has been working as an Information System ("Wirtschaftsinformatik") professor at the WU Vienna (with 25,000 business students one of the largest of its kind) where he has been trying to empower the students with IT and IS skills to help them be more productive in... Read More →


Tuesday November 18, 2014 3:50pm - 4:40pm
Petofi

3:50pm

OSGi For Mere Mortals - Bertrand Delacretaz, Adobe
In the last few years, OSGi has become ""the"" module system for Java - but is OSGi just for gurus, or are mere mortals actually able to use it?
The simple tutorial RESTful server application presented in this talk aims to demonstrate that the average Java developer can actually use OSGi, and greatly benefit from it. It is built from scratch based on a set of standard and custom OSGi services, in a simple and understandable way.
Taking advantage of a number of build plugins and runtime tools provided by the Apache Felix and Apache Sling projects allows us to write little code in our example application, while exposing the advantages of an OSGi-based architecture in a simple and convincing way.

Speakers
avatar for Bertrand Delacretaz

Bertrand Delacretaz

Principal Scientist, Adobe
Bertrand Delacretaz works as a Principal Scientist in the AEM R&D team at Adobe’s Basel office, using open source tools to create world-class content management and digital marketing systems. Bertrand is an active member and current Director of the Apache Software Foundation... Read More →


Tuesday November 18, 2014 3:50pm - 4:40pm
Tohotom

4:50pm

Apache Giraph: Start Analyzing Graph Relationships In Your Big Data In 45 Minutes (Or Your Money Back)! - Roman Shaposhnik, Pivotal
The genesis of Hadoop was in analyzing massive amounts of data with a mapreduce framework. SQL­-on­Hadoop has followed shortly after that, paving a way to the whole schema-­on­-read notion. Discovering graph relationship in your data is the next logical step. Apache Giraph (modeled on Google’s Pregel) lets you apply the power of BSP approach to the unstructured data. In this talk we will focus on practical advice of how to get up and running with Apache Giraph, start analyzing simple data sets with built­-in algorithms and finally how to implement your own graph processing applications using the APIs provided by the project. We will then dive into how Giraph integrates with the Hadoop ecosystem (Hive, HBase, Accumulo, etc.) and will also provide a whirlwind tour of Giraph architecture.

Speakers
avatar for Roman Shaposhnik

Roman Shaposhnik

Director of Open Source, Linux Foundation
Apache Software Foundation and Data, oh but also unikernels


Tuesday November 18, 2014 4:50pm - 5:40pm
Arany

4:50pm

LDAP Testing: Does It Have To Be A Nightmare? - Emmanuel Lecharny, Symas
When it comes to testing the LDAP part of an application, it's usually a task that is seen as a punishment by many developpers. The JNDI API does not help, nor does the fact most of the time, you have to reset your LDAP server between each test.

We will expose a better way to do LDAP based tests.

Apache Directory Server has some revert mechanisms that spare developers the need of resetting the LDPA server between tests. We also have a full annotation based solution to initialize a LDAP server in unit tests.

Apache LDAP API is way simpler than JNDI and has extended features that makes it easer to use for any Java developper.

We will present those features with simple tests and some more complex.

Speakers
avatar for Emmanuel Lecharny

Emmanuel Lecharny

Architect, Symas
Emmanuel Lécharny is an Apache Member, former chairman of the Apache Directory project. | | He is currently working for Symas. | | He has been working on Apache Directory project for more than 10 years.


Tuesday November 18, 2014 4:50pm - 5:40pm
Petofi

4:50pm

Set Up And Use Apache Openmeetings In 30 Minutes - Maxim Solodovnik, UniPro
Apache Openmeetings is the open source solution to organize appointments, meetings, presentations and online classes. Unfortunately installing and configuring it is one of the hardest tasks potential customers are facing. During tutorial Maxim Solodovnik will show installing and configuring is not "a big deal".

Speakers
avatar for Maxim Solodovnik

Maxim Solodovnik

Senior Software Developer, Unipro
Apache Openmeetings Chair/Commiter/PMC | Apache member | Speaker at ApacheConEU 2014: "Set up and use Apache OpenMeetings in 30 minutes" | | I'm software developer currently working at Unipro (www.unipro.ru), Novosibirsk, Russia. I currently developing both Web-based and... Read More →


Tuesday November 18, 2014 4:50pm - 5:40pm
Kond

4:50pm

JBatch++ With Apache BatchEE - Mark Struberg
Apache BatchEE is an implementation of the JSR-352 JBatch specification.

BatchEE is based on the original reference implementation but includes many bug-fixes and improvements.
In addition to the jbatch core module we also added many useful helpers which do make batch programming a breeze.
We did also add a few ways to control your batch jobs, e.g. via JAX-RS, a simple GUI or even as stand-alone command line client. That way you can use BatchEE in JavaEE 5, 6 and 7 environments and even in Java SE.

After an introduction to writing JBatch batches in general we will showcase fancy features of BatchEE and we surely have some best-practice tips to share.

Speakers
MS

Mark Struberg

Mark Struberg is a software architect with over 20 years of programming experience. He has been working with Java since 1996 and is actively involved in open source projects in the Java and Linux area. He is Apache Software Foundation member and serves as PMC and Committer for Apache... Read More →


Tuesday November 18, 2014 4:50pm - 5:40pm
Elod/Ond

4:50pm

Weaving Web Applications With Ease - Apache Tapestry - Ulrich Staerk, Senacor
Apache Tapestry is a component-oriented Java web application framework with a focus on simplicity, ease of use, and developer productivity. Some of Tapestry’s features include live class reloading in development mode, convention over configuration - eliminating almost all XML configuration, Post/Redirect/Get, and an integrated Inversion of Control (IoC) container, which allows almost all of Tapestry’s behavior to be configured or replaced.
Three years after the last major release of Apache Tapestry, 5.4 is just around the corner. In this talk Uli will give an introduction to Tapestry and speak about the new features of 5.4, such as the new framework agnostic JavaScript layer and Bootstrap integration.

Speakers
US

Ulrich Staerk

Senior Developer, Senacor Technologies
Uli currently is a Senior Developer at Senacor working on IT transformation projects. He's a member of the Apache Tapestry Project Management Committee and leads the Apache’s involvement in Google Summer of Code. Prior to his current job he was a research associate at Freie Universität... Read More →


Tuesday November 18, 2014 4:50pm - 5:40pm
Tohotom
 
Wednesday, November 19
 

9:30am

OSv: Probably The Best OS For Cloud Workloads You've Never Heard Of - Roman Shaposhnik, Pivotal
OSv is the revolutionary new open source technology that combines the power of virtualization and micro-services architecture. This combination allows unmodified applications deployed in a virtualized environment to outperform bare-metal deployments. Yes. You've heard it right: for the first time ever we can stop asking the question of how much performance would I lose if I virtualize. OSv lets you ask a different question: how much would my application gain in performance if I virtualize it. This talk will start by looking into the architecture of OSv and the kind of optimizations it makes possible for native, unmodified applications. We will then focus on JVM-specific optimizations and specifically on speedups available to ASF projects when they are deployed on OSv

Speakers
avatar for Roman Shaposhnik

Roman Shaposhnik

Director of Open Source, Linux Foundation
Apache Software Foundation and Data, oh but also unikernels


Wednesday November 19, 2014 9:30am - 10:20am
Arany

9:30am

Lizard - Clustering An RDF Triplestore - Andy Seaborne, Epimorphics Ltd
This is the story of building a clustered SPARQL database. Demand on
triplestores is growing, both in terms of scale and reliable operation.
Replication and clustering are needed for fault tolerant operation and for
larger datasets. Server replication of a single machine database only get so far.
It is easier to have a system designed to harness the capabilities of
several machines in one system. The goal of ""Lizard"" is to provide a solid
platform for triplestore operation.

In this presentation, Andy will describe how ""Lizard"" takes the Apache Jena
TDB database design and extends it to cluster operation. The talk will
cover the core principles of the TDB design and SPARQL execution, the
choices to be made in the ""Lizard"" design that rework the system for a
medium scale cluster system, and report of the current status.

Speakers
avatar for Andy Seaborne

Andy Seaborne

Andy works on infrastructure for linked data graph systems. He was lead editor for SRARQL, the RDF Query Language. Andy developed the ARQ query engine which is released as part of Apache Jena where he is a committer. He has spoken at conferences and at developer events about linked... Read More →


Wednesday November 19, 2014 9:30am - 10:20am
Petofi

9:30am

Building Ranking Infrastructure: Data-Driven, Lean, Flexible - Sergii Khomenko, STYLIGHT
Nowadays there are plenty of solution to build a search subsystem. The question is how to keep such a system flexible and easy to react on data-driven decisions, constantly improve the quality. In talk are presented lessons learned from our experience of building lean ranking infrastructure, that could be used with data-driven approach in product development. With slides we walk through the process of scaling out the search system from a couple to 13 countries around the world, but keeping flexibility, that allows to test hypothesis on different levels and perform a/b testing in different dimensions.

Speakers
avatar for Sergii Khomenko

Sergii Khomenko

Data Scientist, STYLIGHT
Data scientist at one of the biggest fashion communities, STYLIGHT. Data analysis and visualisation hobbyist, working on problems not only in working time but in free time for fun and personal data visualisations. | | Speaker at different conferences: Berlin Buzzwords 2014, ApacheCon... Read More →


Wednesday November 19, 2014 9:30am - 10:20am
Elod/Ond

9:30am

Deploying OSGi On An Army Of Cubietrucks - Frank Lyaruu, Sendrato
Our challenge was to track the location 200k people at a large festival using custom made wristbands, on a tight budget and preferably with only open source software. In this presentation we will briefly outline the overall system architecture, and subsequently focus on the software stack used to make this work. In essence, we need the versatility of cloud computing, but in a ""camping-style data centre"" We will show how Apache Felix, an OSGi runtime, provides the necessary mix of dynamism and efficiency to pull this off.

Speakers
avatar for Frank Lyaruu

Frank Lyaruu

Full-stack Developer and CTO, Dexels
Full-stack developer and CTO at Dexels in Amsterdam and architect at Sendrato. We’re mostly active in the Sports and Entertainment branch, and recently also in Wearable computing. We’ve recently converted the infrastructure of a large client to run OSGi using Apache Karaf, and... Read More →


Wednesday November 19, 2014 9:30am - 10:20am
Tas

10:40am

What's With The 1S And 0S? Making Sense Of Binary Data At Scale With Tika And Friends - Nick Burch, Quanticate
If you have one or two files, you can take the time to manually work out what they are, what they contain, and how to get the useful bits out (probably....). However, this approach really doesn't scale, mechanical turks or no! Luckily, there are Apache projects out there which can help!

In this talk, we'll first look at how we can work out what a given blob of 1s and 0s actually is, be it textual or binary. We'll then see how to extract common metadata from it, along with text, embedded resources, images, and maybe even the kitchen sink! We'll see how to do all of this with Apache Tika, and how to dive down to the underlying libraries (including its Apache friends like POI and PDFBox) for specialist cases. Finally, we'll look a little bit about how to roll this all out on a Big Data or Large-Search case.

Speakers
avatar for Nick Burch

Nick Burch

CTO, Quanticate
Nick began contributing to Apache projects in 2003, and hasn't looked back since! Most of the projects Nick has worked in belong in the "Content" space, such as Apache POI (ex-PMC Chair), Apache Tika and Apache Chemistry. As well as coding projects, Nick is also involved in a number... Read More →


Wednesday November 19, 2014 10:40am - 11:30am
Arany

10:40am

Manage Linked Media With Apache Marmotta - Thomas Kurz, Redlink GmBH
The integration of multimedia assets on the web with structured (linked) data promises further opportunities for digital market places regarding findability and recommendations. The new W3C standards for Media Annotation, Media Fragment UIRs and Linked Data Platforms build a stable base for this purpose. Thomas Kurz shows how to use the Linked Data Platform Apache Marmotta in combination with the Red5 Media Server as a backend for the storage and retrieval of Linked Media. In his talk he is going to show extensions for a seamless integration of media streaming for Non-RDF resources and spatio-regional media fragment retrieval with SPARQL.

Speakers
TK

Thomas Kurz

Co-Founder, Salzburg Research
Thomas Kurz is co-founder of Redlink GmbH, Senior Researcher at Salzburg Research, and contributor to Apache Marmotta. Thomas is working mostly on Open Source projects within large-scale National and European research projects like KiWi, SNML and MICO. He presented his work in many... Read More →


Wednesday November 19, 2014 10:40am - 11:30am
Petofi

10:40am

“Your Search Doesn’t Work.” - How to Find Out Whether or Not the Search Box You Offer Users is Helping - Isabel Drost-Fromm, Elasticsearch
Web sites without search functionality are unimaginable today - you search for comments and code on github, you look for books in your favourite webshop, you use the search box of your favourite blog to find articles.

When offering your search for your own application - how do you know that your search actually provides a benefit to the user instead of causing lots of frustration over results not found? Only checking that the favourite book about witches of your child is ranked top of all children books clearly doesn’t help.

This talk will walk you through the options of determining search quality - from purely offline metrics that work even before deploying version 1.0 to production to online A/B testing to check continuous improvement. I will highlight some Lucene and Elasticsearch features that can tremendously help you deploy your own search quality checks.

Speakers
avatar for Isabel Drost-Fromm

Isabel Drost-Fromm

Open Source Strategist, Europace AG
Isabel Drost-Fromm is Open Source Strategist at Europace AG Germany. She's a member of the Apache Software Foundation, co-founder of Apache Mahout and mentored several incubating projects. Isabel is interested in all things FOSS, search and text mining with a decent machine learning... Read More →


Wednesday November 19, 2014 10:40am - 11:30am
Elod/Ond

10:40am

Collaborated Editing On ODF - Svante Schubert, OASIS ODF Advanced Collaboration Subcommittee
Automated Changes on OpenDocuments (Svante Schubert, Freelancer) - The OpenDocument format (ODF) consists of compressed XML files. ODF is read and written by many Office applications; the Apache ODF Toolkit (incubating) provides the ability of an easy automated access. In this presentation, Svante Schubert will describe how recent changes of the OpenDocument format in the OASIS standard for version 1.3, allow interoperability for real-time editing and how this is being mapped to the ODF Toolkit in a way that even collaboration with OOXML and HTML applications (such as browsers) is possible.

Speakers
avatar for Svante Schubert

Svante Schubert

Chair, Freelancer
Svante works as freelancer on Document Formats, focusing on the OpenDocument Format. He is the submitter of the actual change-tracking proposal of the OASIS ODF Advanced Collaboration Subcomittee, where he recently become the chair. He had been working for Sun Microsystems for twelve... Read More →


Wednesday November 19, 2014 10:40am - 11:30am
Huba

10:40am

OSGi Remote Services In A Microservices World - Marcel Offermans, Luminis Technologies
In a Microservices architecture applications get decomposed into functionally cohesive services that are implemented as separate processes that interact through the network using simple protocols. This allows applications to scale during deployment by distributing and replicating the services across compute resources as required.

However popular, common criticism includes the use of implicit and poorly defined interfaces, the fact that every component must implement remoting concerns and the fact that it introduces significant processing overhead as every service invocation traverses the network, even if the services are on the same box!

In this presentation we discuss how OSGi Remote Services can address these concerns in an OSGi context by providing the developer strongly typed services that may or may not actually be transparently remoted across a scaled-out deployment.

Speakers
MO

Marcel Offermans

Director, Luminis Technologies
Marcel Offermans is the Director at Luminis Technologies. He has been involved in lots of commercial OSGi projects for over eight years and is responsible for the architecture, development and support of an OSGi and cloud based software provisioning product. He is also a member of... Read More →


Wednesday November 19, 2014 10:40am - 11:30am
Tas

10:40am

Wicket And Java EE Sitting In A Tree - Martijn Dashorst, Topicus B.V.
Apache Wicket strives to enable developers to be very productive and craft maintainable web applications. Java EE also enables developers to achieve high productivity. So what happens when you combine both technologies? In this session Martijn Dashorst shows how to leverage the available Java EE technologies such as CDI, JPA, Bean Validation and JAX-RS in your Wicket applications.

Speakers
MD

Martijn Dashorst

Software engineer, Topicus B.V.
Martijn Dashorst has been involved with Apache Wicket since it was made open source over ten years ago. He is a proud developer for over 17 years. At Topicus he helps maintain and create Wicket applications for the majority of educational professionals in the Netherlands. Martijn... Read More →


Wednesday November 19, 2014 10:40am - 11:30am
Tohotom

10:40am

Where Is Apache Infrastructure Going? - David Nalley, Apache Software Foundation
What's going on in ASF Infrastructure? What changes have we seen in the past 6 months and what does the future hold? Is everything moving to the cloud? What exactly is the scope of Infrastructure at the ASF?

Speakers
DN

David Nalley

VP of Infrastructure, ASF
David Nalley is the Vice President, Infrastructure at the Apache Software Foundation


Wednesday November 19, 2014 10:40am - 11:30am
Grand Ballroom

11:40am

Introduction To Apache jclouds - Ignasi Barrera, Abiquo
Apache jclouds is an open source library that helps you get started in the cloud and utilizes your Java or Clojure development skills. The jclouds API gives you the freedom to use portable abstractions or cloud-specific features. It's a multi-cloud toolkit that works with both public and private clouds.

There is a great community behind this toolkit working together to provide a better experience for developers in the cloud. Their goal is to simplify the control of many different clouds while still giving you the freedom to use cloud-specific features. The result is a toolkit that allows developers to write better code, in a shorter period of time, that works with supported clouds.

Learn about what jclouds can do for you.

Speakers
avatar for Ignasi Barrera

Ignasi Barrera

Founding Engineer, Tetrate
Ignasi is an engineer that has been building hybrid multi-cloud platforms and SDN solutions for more than 8 years. He is a long-term open source contributor, especially to the Apache jclouds project (a Java library that provides common APIs for all the major cloud providers). In 2015... Read More →


Wednesday November 19, 2014 11:40am - 12:30pm
Dery/Mikszath

11:40am

Jenkins At Apache: Our Current Status, Future Plans And What We've Learned - Andrew Bayer, Cloudera
Apache’s Jenkins service at builds.apache.org ranks as one of the largest in Jenkins’ publicly available usage statistics, in number of jobs and builds. Providing such a massive scale Jenkins service presents a number of challenges, and we’ve been working hard to improve Apache’s offering. Andrew Bayer will talk about where we were earlier in 2014, what we’ve done to get to the more reliable and useful state we’re in now, and what’s planned for the future. Andrew will also give some pointers on best practices for users of builds.apache.org to get the most out of Apache’s Jenkins service, as well as some lessons we’ve learned that may be helpful to others running massive scale Jenkins setups.

Speakers
AB

Andrew Bayer

Build and Tools Architect, CloudBees
Andrew Bayer is an Apache member, a PMC member on a number of projects and a member of the ASF Infra team.


Wednesday November 19, 2014 11:40am - 12:30pm
Grand Ballroom

11:40am

Open Annotation Support For Apache Stanbol - Rupert Westenthaler, Salzburg Research
Open Annotation is an W3C working draft that uses RDF to (formally) describe - annotate - associations between related resources. In this presentation Rupert Westenthaler will show how Apache Stanbol can be configured to use an Annotation Model compatible to this recommendation; This presentation will first provide an overview on Open Annotation and present strengths and weaknesses of it when used for the Stanbol Enhancer. The second section will present the Open Annotation compatible annotation model for the Stanbol Enhancer. and how this model is generated based on the currently used one. The final section will show how to configure Stanbol to use the new model and conclude with a demonstration.

Speakers
RW

Rupert Westenthaler

Researcher, Salzburg Research
Rupert Westenthaler is co-founder of Redlink GmbH, Researcher at Salzburg Research and contributor to Apache Stanbol as well as some other Apache projects. Rupert is a Software Engineer specialized on applied research. His main interests are Natural Language Processing, Information... Read More →


Wednesday November 19, 2014 11:40am - 12:30pm
Petofi

11:40am

Flexible Search In Apache Jackrabbit Oak - Tommaso Teofili, Adobe Systems
Apache Jackrabbit Oak is the next generation content repository based on the JCR specification, designed to be scalable for high read/write throughput, huge number of nodes in the repository and highly concurrent operations. In this presentation Tommaso Teofili will describe the flexible and pluggable search architecture of Oak which allows to define multiple indices to address specific types of queries with specific constraints for performant indexing and searching. A deeper focus on the Apache Lucene and Apache Solr based index implementations will be given, showing some insights on how they have been integrated to address hierarchical content search together with some performance benchmarks and real life use cases.

Speakers
avatar for Tommaso Teofili

Tommaso Teofili

Software Engineer, Adobe Systems
Open source enthusiast and member at the Apache Software Foundation, working as a software engineer for Adobe Systems on data replication and search. Passionate about natural language processing and machine learning.


Wednesday November 19, 2014 11:40am - 12:30pm
Elod/Ond

11:40am

Enterprise Development With Apache Karaf - Achim Nierbeck, Apache Software Foundation
Apache Karaf is an enterprise grade application server for running OSGi applications. This presentation will give you an idea of how to build Enterprise Applications with Apache Karaf. Achim Nierbeck will cover a broad range of possibilities to use Apache Karaf in an enterprise environment. For example how to combine JPA, web-development and Clustering, all based on the Open-Source tools Apache Karaf, OPS4j Pax Web and Apache Karaf Cellar.

Speakers
avatar for Achim Nierbeck

Achim Nierbeck

Senior IT Consultant, codecentric
Achim Nierbeck is a senior IT Consultant working for codecentric AG in Karlsruhe. He has 15 years experience of working in the field of Java Enterprise. In his Private time the Apache Member works on OpenSource Technology like the OSGi Server Apache Karaf and the OSGi Web-Container... Read More →


Wednesday November 19, 2014 11:40am - 12:30pm
Tas

11:40am

Apache Wicket: 10 Years And Beyond - Martijn Dashorst, Topicus B.V.
With the Tenth Anniversary of Wicket behind us, Wicket is still one of the thriving survivors of the Great Web Framework Wars of the mid 00's. Is there a future for server-side frameworks? In this presentation Martijn Dashorst provides a brief history of Wicket.

With a State of the Wicket, Martijn will look at who is currently using Wicket, the community and current release plans.

The majority of this session will be dedicated to the future of Wicket: does a component oriented, Java web framework have a future in the era of native clients and client side JavaScript frameworks? Martijn will layout the plans of making Wicket more productive for current users, on integrating better with JavaEE technologies and much more.

Speakers
MD

Martijn Dashorst

Software engineer, Topicus B.V.
Martijn Dashorst has been involved with Apache Wicket since it was made open source over ten years ago. He is a proud developer for over 17 years. At Topicus he helps maintain and create Wicket applications for the majority of educational professionals in the Netherlands. Martijn... Read More →


Wednesday November 19, 2014 11:40am - 12:30pm
Tohotom

2:00pm

Introduction To Apache Slider - Steve Loughran, Hortonworks
With YARN, Apache Hadoop can deploy distributed applications —applications which can dynamically expand or contract their size based on demand or other factors. It allows the application to choose the placement of distributed components within the cluster, as well as their resource requirements such as CPUs and memory. As YARN evolves to support long-lived services, YARN applications can become services supporting broader uses.

Taking advantage of these features has required the application to be rewritten as a YARN application, central to which is the Application Master —a process which manages the allocation of components across the cluster, deployment of these components, as well as failure handling.

The Apache Slider project can deploy distributed applications without requiring them to be ported to YARN: Slider provides the Application Master and the allocation, deplo

Speakers
avatar for Steve Loughran

Steve Loughran

Member of Technical Staff, Hortonworks
Steve Loughran is a developer at Hortonworks, where he works on leading-edge Hadoop applications, most recently on Apache Slider and on Apache Spark's integration with Hadoop and YARN, and Hadoop's S3A connector to Amazon S3. He's the author of Ant in Action, a member of the Apache... Read More →


Wednesday November 19, 2014 2:00pm - 2:50pm
Arany

2:00pm

Cloud Storage With Apache jclouds - Andrew Gaul
Object stores provide a flexible, scalable, and cost-effective abstraction to build applications in both the public and private cloud. Apache jclouds abstracts all major providers, allowing Java source-level compatibility while providing access to vendor-specific interfaces. This talk will cover how to integrate jclouds into applications and how to manage containers and objects. It will also discuss differences between providers, scaling out the number of clients and number of objects, and tradeoffs in application design.

Speakers
avatar for Andrew Gaul

Andrew Gaul

Bounce Storage
I am broadly interested in storage technologies and have worked in cloud storage, network file systems, and block storage. I have specific interests in object storage and cross-cloud interoperability. I created S3Proxy and actively contribute to Apache jclouds. I co-founded Bounce... Read More →


Wednesday November 19, 2014 2:00pm - 2:50pm
Dery/Mikszath

2:00pm

Improved OVA Support In CloudStack - Likitha Shetty, Citrix
Improved OVA support in CloudStack’ is a feature targeted for the 4.5 release of CloudStack. With this feature in place, users will be able to on-board an existing Virtual Machine into CloudStack with all of its disks intact.

The talk will focus on the architectural and design changes that will be made and on the new concepts that will be introduced into CloudStack as part of this feature. In addition, the talk will cover what the initial approach to the problem was and how based on community feedback it evolved into a better solution that aligns with how resources are traditionally created and orchestrated in CloudStack. It will also shed some light on how we could leverage this development work to add new functionalities going forward for e.g. adding support for other information encapsulated in an OVA like ISOs, networks, device controller types etc.

Speakers
LS

Likitha Shetty

CITRIX
Likitha has been working on CloudStack since joining Citrix in late 2011, right after finishing her undergraduate studies in Computer Science. Her development work was initially focused on API and network orchestration layer, specifically EC2 integration and dedication of network... Read More →


Wednesday November 19, 2014 2:00pm - 2:50pm
Elod/Ond

2:00pm

Quadrupling Your Elephants - RDF And The Hadoop Ecosystem - Rob Vesse, YarcData
The Hadoop ecosystem is excellent in providing a large variety of data processing paradigms and supports a wide variety of data formats. However support for RDF within this ecosystem is often non-existent or confined to academic research which has yet to make the jump to production ready systems. In this talk I will present several projects I've been involved in both inside and outside the ASF which are looking at bringing production ready RDF support to the Hadoop ecosystem.

Speakers
avatar for Rob Vesse

Rob Vesse

Software Engineer, YarcData
Rob Vesse is a Software Engineer at YarcData (part of Cray Inc) working on their Urika Graph Analytics platform. He is their resident RDF, SPARQL and Semantic Web expert having been heavily involved in these areas for the past 5 years. | | Currently he is a committer and PMC member... Read More →


Wednesday November 19, 2014 2:00pm - 2:50pm
Petofi

2:00pm

Felix Http - Paving The Road To The Future - Jan Willem Janssen & Marcel Offermans, Luminis Technologies
Developing modular web-based applications is easy using the HTTP-service implementation of Apache Felix. It hides the complexity and boilerplate code needed to get your servlets and filters up and running in no-time. With the recent effort in aligning the HTTP-service with the latest servlet specifications, new functionality like asynchronous servlets and filters will be supported out-of-the-box.
In this presentation we will not only talk about the new features provided by the Felix HTTP-service implementation, but also demonstrate how to use them in your project. In addition, we talk about upcoming improvements to the Felix HTTP-service, like native support for SPDY and WebSockets.

Speakers
avatar for Jan Willem Janssen

Jan Willem Janssen

Software Architect, Luminis Technologies
Jan Willem Janssen works as a software architect at Luminis Technologies and has several years of experience in various areas of software development. He has a strong interest in software architecture, modularity, security and creating scalable software systems. He actively participates... Read More →
MO

Marcel Offermans

Director, Luminis Technologies
Marcel Offermans is the Director at Luminis Technologies. He has been involved in lots of commercial OSGi projects for over eight years and is responsible for the architecture, development and support of an OSGi and cloud based software provisioning product. He is also a member of... Read More →


Wednesday November 19, 2014 2:00pm - 2:50pm
Tas

3:00pm

The Other Apache Technologies Your Big Data Solution Needs - Nick Burch, Quanticate
In this talk, we'll take a look at a range of projects from the Apache Software Foundation, looking at those which complement the "headline projects" to build out your big data solution. While we can't cover every project at Apache (there are just too many these days!), we'll take a tour through some of the up-coming and lesser-known established projects out there, those that should prove very helpful to you in building your big data solution. We'll see that Apache is more than just the webserver, Hadoop and Lucene, and with any luck point you at projects that'll save you time and effort!

Speakers
avatar for Nick Burch

Nick Burch

CTO, Quanticate
Nick began contributing to Apache projects in 2003, and hasn't looked back since! Most of the projects Nick has worked in belong in the "Content" space, such as Apache POI (ex-PMC Chair), Apache Tika and Apache Chemistry. As well as coding projects, Nick is also involved in a number... Read More →


Wednesday November 19, 2014 3:00pm - 3:50pm
Arany

3:00pm

Taming The Cloud Database With Apache jclouds - Zack Shoylev, Rackspace
Managing application infrastructure is not what developers want to spend time figuring out. The challenge of quickly automating facets of application development such as database infrastructure setup is still present in many cloud application projects. This presentation will introduce developers to current cloud services (private and public clouds, terminology, service types, OpenStack) and relevant Java SDKs. This will include discussing the advantages of using open source, portable, and provider-agnostic solutions versus more closed systems. The talk will describe high-level jclouds architecture and present code examples to manage cloud-based database infrastructure using OpenStack Trove. Due to popular demand, a short tutorial on implementing abstraction layers and custom APIs for databases in jclouds will also be presented.

Speakers
avatar for Zack Shoylev

Zack Shoylev

Software Developer, Rackspace
Zack is a Software Developer for Rackspace and a contributor to the Apache jclouds project. In addition to working on code, Zack often puts on his Developer Advocate hat and has previously given talks about jclouds and the cloud at ApacheCON and JavaOne.


Wednesday November 19, 2014 3:00pm - 3:50pm
Dery/Mikszath

3:00pm

Towards a Commons RDF Library - Sergio Fernández, Redlink GmbH
In the Java world there is an historical incompatibility issue between the two major RDF toolkits: Apache Jena and OpenRDF Sesame. But behind this effort there is a completely different approach than before. This library does not pretend to be a generic API wrapping those libraries, but a set of interfaces for the RDF 1.1 concepts that can be used to expose cores concepts using common Java interfaces. In the initial phase commons-rdf is focused on a subset of the core concepts defined by RDF 1.1 (URI/IRI, Blank Node, Literal, Triple, and Graph). In particular, Commons RDF aims to provide a type-safe, non-general API that covers RDF 1.1. In a future phase we may define interfaces for Datasets or Quads.

Counts in core contributors from both Jena and Sesame, But it will live or die on its own merits!

The working repository can be found at http://github.com/wikier/commons-rdf

Speakers
avatar for Sergio Fernández

Sergio Fernández

Software Engineer, Redlink GmbH
I'm a Software engineer specialized in innovation, with a focus on Data Architectures. My interests include Distributed Architectures, Data Integration, Linked Data and System Engineering. I've worked as software engineer and project manager in different industries, but always somehow... Read More →


Wednesday November 19, 2014 3:00pm - 3:50pm
Petofi

3:00pm

Reflections on the Design of Business Applications - Christian Schneider, Talend
In this talk I will look into some good and bad examples of how to structure your business applications with a special focus on OSGi. I will show some anonymized examples encountered at customers of good and bad design and reflect on how the bad ones can be improved. Some highlights will be technical vs business focus in the structure, separating business code from technical code, Minimizing dependencies, designing for change

Speakers
avatar for Christian Schneider

Christian Schneider

Open source Architect, Talend
Christian is an Open Source Architect at Talend. He was the responsible architect for the Services Oriented Architecture of EnBW Trading GmbH (Trading floor of one of Germany's largest utility companies). He specializes in OSGi and practical integration solutions.


Wednesday November 19, 2014 3:00pm - 3:50pm
Tas

3:00pm

How Secure Is Your Web Framework? - Lukasz Lenart, ASF
In this talk I would like to present few recently discovered security vulnerabilities in the Apache Struts 2 web framework and based on that try to increase developers' awareness of the overall application security. In my opinion, nowadays developers don’t care about the security too much, they depend on what the framework gives them. Quite often, they even don’t check security bulletins to see if a new vulnerability wasn’t discovered recently in his/her famous framework. I would like show few web attacks and explain how it was possible to harm an application or even entire server, how to prevent such kind of attacks in the future and so on. Examples will base on the Apache Struts 2, but almost all the same problems were pointed out in the Spring MVC or in the Apache Wicket. Each framework has its own weaknesses :-)

Speakers
avatar for Lukasz Lenart

Lukasz Lenart

SoftwareMill
Lukasz Lenart, a programmer with fondness, it has always been his hobby, until became commercial activity. Believes that a good programmer should not be dependent on the language, but rather look to the future and try various languages and technologies, depending on requirements... Read More →


Wednesday November 19, 2014 3:00pm - 3:50pm
Elod/Ond

3:00pm

Tapestry 5.4: Teach An Old Horse New Tricks And Why Full-Stack Frameworks Still Matter - Thilo Tanner, RepRisk AG
The first part of the presentation will give an introduction to the Apache Tapestry framework and its core concepts. The talk will cover the pros and cons of component-based web development and how the builtin IOC can help to structure applications and adjust the internal behavior of Tapestry. The first part of the presentation will also focus on the JavaScript stack, which has been completely re-written for the upcoming 5.4 release. The second part of the presentation will share the experience of how Tapestry helped RepRisk solve everyday development tasks, how it improved our development, testing and deployment process​, and why we think Tapestry increased our software quality.

Speakers
TT

Thilo Tanner

IT Lead, RepRisk AG
Thilo Tanner works as IT Lead for RepRisk AG, a global business intelligence provider on environmental, social and governance risks associated with companies and projects. Thilo has a wide experience in the field of big data processing and integration solutions and can look back on... Read More →


Wednesday November 19, 2014 3:00pm - 3:50pm
Tohotom

3:00pm

Ask Infra - David Nalley, Apache Software Foundation
This panel will involve folks who work on the ASF Infrastructure team. Feel free to ask the hard questions: When will we bring back CVS or RCS support? How soon will Atlassian Stash be implemented? Can we please stop email, again? We welcome these questions and more.

Speakers
DN

David Nalley

VP of Infrastructure, ASF
David Nalley is the Vice President, Infrastructure at the Apache Software Foundation


Wednesday November 19, 2014 3:00pm - 3:50pm
Grand Ballroom