Virtual Conference 2020
Meet the experts from global companies like Cloudera, Google, Godaddy, Netflix, Splunk and more, who have built scalable streaming infrastructure and enterprise-grade applications.
Hear why and how they use Flink as the stream processing engine of choice for large-scale stateful applications, including real-time analytics, real-time search and content ranking, fraud/anomaly/threat detection.
Cloudera
Streaming Engineering Lead / Apache Flink PMC at Cloudera
Marton is a Flink PMC member and one of the first contributors to the streaming API. He has driven big data adoption at around 50 customers as a Senior Solutions Architect at Cloudera during the last four years. He is the manager of the newly formed Streaming Analytics team and focuses on adding Flink to the Cloudera platform.
Apache Flink - Completing Cloudera’s End to End Streaming Platform
Ververica
CTO & Co-founder at Ververica
Stephan Ewen is CTO and co-founder at Ververica where he leads the development of the stream processing platform based on open source Apache Flink. He is also a PMC member and one of the original creators of Apache Flink. Before working on Apache Flink, Stephan worked on in-memory databases, query optimization, and distributed systems. He holds a Ph.D. from the Berlin University of Technology.
Introducing Stateful Functions 2.0: Stream Processing meets Serverless Applications
Ververica
Head of Product at Ververica
As Head of Product for Ververica Platform Konstantin is responsible for Ververica's commercial product, an enterprise-ready stream processing platform based on Apache Flink. Previously, he was leading the solutions architecture team and helping our clients as well as the Open Source community to get the most out of Apache Flink and Ververica Platform. Before joining Ververica he worked as a Senior Consultant with TNG Technology Consulting, where he supported their clients mainly in the areas of Distributed Systems and Automation. Konstantin has studied Mathematics and Computer Science at TU Darmstadt specializing in Stochastics and Algorithmics.
Splunk
Distinguished Engineer at Splunk
Eric was an early employee at Cloudera before he founded Rocana which was acquired by Splunk. Today, he is a Distinguished Engineer at Splunk working on platform services including stream processing. He's the author of "Hadoop Operations" and is involved in a number of open source projects.
Dell EMC
Cloudera
VP Engineering at Cloudera
Joe Witt is Vice President of Engineering at Cloudera focused on the Cloudera Data Flow (CDF) product. Joe spent 10 years at NSA most of which developing what became Apache NiFi. In 2015 Joe and a team of cofounders left NSA and started Onyara which was acquired by Hortonworks which later merged with Cloudera. Joe is a member of the project management committee for Apache NiFi.
Apache Flink - Completing Cloudera’s End to End Streaming Platform
Adobe
Architect, Digital Experience Cloud at Adobe
Fakrudeen is an Architect in Digital Experience Cloud with focus and expertise in Big data and ML technologies. Formerly, he was Senior Manager at Yahoo, managing Yahoo front page content ranking and personalization system.
Software Engineer at Google
Ahmet Altay is a Senior Software Engineer at Google working on Apache Beam (PMC member) and Cloud Dataflow. Previously he worked at Microsoft on operating systems. He has a master's degree from Stanford University.
Distributed Processing for Machine Learning Production Pipelines
Netflix
Senior Software Engineer at Netflix
Abhay Amin is a senior software engineer at Netflix working for Consolidated Logging team. At Netflix, he focuses on building and scaling real time & batch metrics platforms using spark, Flink and Kafka. Prior to Netflix, he has a similar experience of building data products for e-commerce and finance.
Building a metric platform using Flink for massive scale at Netflix
Big Data Institute
Managing Director at Big Data Institute
Jesse Anderson is a Data Engineer, Creative Engineer and Managing Director of Big Data Institute.
He works with companies ranging from startups to Fortune 100 companies on Big Data. This includes training on cutting edge technologies like Apache Kafka, Apache Hadoop and Apache Spark. He has taught over 30,000 people the skills to become data engineers.
He is widely regarded as an expert in the field and for his novel teaching practices. Jesse is published on O’Reilly and Pragmatic Programmers. He has been covered in prestigious publications such as The Wall Street Journal, CNN, BBC, NPR, Engadget, and Wired.
FinTech Studios
Senior Software Engineer at FinTech Studios
Austin Cawley-Edwards is a Senior Software Engineer at FinTech Studios, using real-time data to bring clarity to financial news. He frequently works with Apache Flink, RabbitMQ, and Elasticsearch. He also loves API design, taking part in collaborative communities, and sometimes JavaScript.
Reliable, Real-Time AI with the Ververica Platform + a Kubernetes Operator in a Growing Startup
Cogility
Data Scientist at Cogility
Samantha is a data scientist, working on the Cogynt team in Cogility Software. She helps the team develop rich data science features on Cogynt to ensure customers can meet their data needs swiftly and with ease. Prior to working at Cogility, she worked on building a machine learning and analytics platform for a surgical simulation system at a medical device company in California.
Amazon Web Services
Software Developer at Amazon Web Services
Tirtha Chatterjee is a software developer at the Amazon Managed Streaming for Kafka team. He worked on building the cluster health monitoring system on Apache Flink that monitors the health of Apache Kafka clusters that his team supports.
Tirtha has years of experience on working across the software stack - from core storage and streaming systems to front-end website development. He is a genuine open source enthusiast and has contributed to multiple open source projects such as Pidgin, KDE, Rekonq, Lokalize and Apache Kafka.
Needle in the Haystack - Monitoring health of a huge Kafka fleet with Flink
Tencent
Software Developer at Tencent
Zili Chen is contributing for Apache Flink for over a year and now one of Flink committers. He focuses on consistency in distributed system and flexibility of user-facing interface.
Implement Reliable, Isolated & Unified Job Submission
TensorFlow Developer Advocate at Google
A data scientist and TensorFlow addict, Robert has a passion for helping developers quickly learn what they need to be productive. He's used TensorFlow since the very early days and is excited about how it's evolving quickly to become even better than it already is. Before moving to data science Robert led software engineering teams for both large and small companies, always focusing on clean, elegant solutions to well-defined needs. In his spare time Robert sails, surfs occasionally, and raises a family.
Distributed Processing for Machine Learning Production Pipelines
Product Manager, V.P. of D&I at the ASF at Google
Gris is an experienced tech strategist who's worked with distributed communities for over 8 years. She has a Masters in Operation Research and Data Science from UC Berkeley and is passionate about big data analytics, open source projects, information architecture, diversity and inclusion in tech & Italian wines.
She’s an industrial engineer by formation, and therefore has scaled products and organizations for over 10 years. From her work on oil rigs to online communities, she’s proven to be able to adapt to diverse industries and environments. She also enjoys solving undefined problems and to spearhead solutions no one has designed before.
Equality and inclusion as a way of developing diverse, resilient, and more relevant OSS
Netflix
Director, Data Platform Architecture at Netflix
Justin Cunningham is the Director of Data Platform Architecture at Netflix focused primarily on data movement, schemas, and supporting the Netflix Studio. Previously, Justin was a Group Tech Lead at Yelp, leading efforts centered around experimentation and metrics, real-time data infrastructure, and machine learning. Before Yelp, Justin worked at several small startups that he founded.
Cloudera
Flink Engineer at Cloudera
Gyula is a Software Engineer in the Flink Engineering team at Cloudera working on integrating Flink into the Cloudera platform.
He has been a committer and contributor since the early days of Flink streaming and has used Flink in large scale production at King for almost 4 years delivering innovative real-time applications at a global scale.
Gyula grew up in Budapest where he first started working on distributed stream processing and later became a core contributor to the Apache Flink project. Gyula has been a speaker at numerous big data related conferences and meetups, talking about stream processing technologies and use-cases.
Netflix
Senior Software Engineer at Netflix
I have degrees in Electrical Engineering and Mathematics from CMU. My expertise is in distributed data processing engines. In open source, I have worked on Apache Apex and Apache Drill.
Amazon Web Services
Engineering Head for Amazon Kinesis Data Analytics service at Amazon Web Services
Praveen Gattu heads engineering team for Amazon Kinesis Data Analytics service. Praveen has worked in Amazon Webservices for past 12 years in Kinesis and S3.
Lessons learned on Apache Flink application availability in a hosted Apache Flink service
Eventador.io
Co-Founder and CEO at Eventador.io
Kenny has decades of experience with various database platforms behind some of the busiest companies in the world. He has had roles as Architect, Director, Manager, Developer, and DBA.
He was a key member of the early teams that scaled Paypal and then eBay on Oracle. He ran one of the busiest PostgreSQL installations in the world at Hi5 and was an early adopter of MongoDB using it for various large projects at Shutterfly.
He is an active member in the PostgreSQL community and scaled Hi5 from just a few servers to dozens running multi-terabye workloads on SSD and SAN backends. He has contributed to the early versions of pg_reorg, and wrote the pgstat2 utility as well as other tools and performance techniques. He’s been blogging about databases including PostgreSQL for years.
He has been an active MongoDB community member, speaker, MongoDB evangelist, and now Mongo Master. In 2011 he formed the MongoDB as a Service provider ObjectRocket with colleagues from eBaY. ObjectRocket was acquired by Rackspace in 2012.
He is active in the Apache Kafka and Apache Flink communities - speaking at conferences and participating in community events. SQLStreambuilder, Eventador’s flagship product is built using Apache Flink.
Currently, Kenny is a Founder at Eventador.io, a streaming data platform. He is focused on building innovative data services to power the next generation of applications that must aggregate, mutate, filter, and join data in real time.
Writing an interactive streaming SQL and materialized view engine using Flink
Lyft
Software Engineer at Lyft
Kailash has over 5 years of experience building data infrastructure, search infrastructure and computer vision systems. During his time at Lyft, Kailash has worked in teams which manage kafka cluster, build flink jobs to persist data to S3 and manage platform for real-time distribution of messages / events. In his erstwhile life, he worked as an investment banker advising corporates on IPO / M&A strategies.
Large-scale near-real-time (NRT) data analytics platform empowered by Apache Flink
Uber
Software Engineer at Uber
Teng(Niel) Hu is a software engineer at Uber, previously worked on dynamic pricing for 2 years, now works in Uber AI Lab as a research engineer.
Ververica
Software Engineer, Apache Flink PMC Member at Ververica
Fabian Hueske is a committer and PMC member of Apache Flink. He is one of the three original authors of the Stratosphere research system, from which Apache Flink was forked in 2014. Fabian is a co-founder of data Artisans (now Ververica), a Berlin-based startup devoted to fostering Flink, where he works as a software engineer and contributes to Apache Flink. He holds a PhD in computer science from TU Berlin and is a co-author of "Stream Processing with Apache Flink".
Godaddy
Principal Software Engineer at Godaddy
Ever since I was a kid, I was very passionate about computers. When I was in grade IV, I joined a hardware institute in India to build my own PC. Back then, I would use that hand-built PC with a 486 processor to play games like Dave.
My hardware guru introduced me to my first programming language "Visual Basic". I build a very basic application for my dad that would help him keep track of inventory at his tire shop.
After pursing my masters in computer science at USC and interning at CallFire - A VOIP company and Mindjolt - A gaming company, I started working at Yahoo as a Software Engineer in the Ads and Data Platform team. I learned a lot about building applications at scale and got introduced to Hadoop and the magic of Map Reduce at Yahoo about a decade ago. After 5 years at Yahoo, I joined a 30 person startup called AtScale to help build a BI platform that can make it easy for customer to query terabytes of data really fast. For the last couple of years, I have been with Godaddy and we are building our streaming data platform using beam and flink to make data available to our downstream customers in a low latency fashion. I am an Apache Beam contributor and love spending time with my son and wife.
Alibaba
Senior Director at Alibaba
Xiaowei Jiang is a Senior Director at Alibaba. He currently leads the Hologres team at AliCloud. This product provides unified storage and service for offline and real time data. Previously, he worked as Tech Lead at Facebook and Principal Engineer at Microsoft SQL Server.
Data Warehouse, Data Lakes, What's Next?
Pravega by Dell EMC
Senior Distinguished Engineer at Pravega by Dell EMC
Flavio Junqueira is a Senior Distinguished Engineer at Dell. He holds a PhD in computer science from the University of California, San Diego, and he is interested in various aspects of distributed systems, including distributed algorithms, concurrency, and scalability. His recent work at Dell focuses on stream analytics, and specifically, on the development of a novel storage system for streams called Pravega. Before Dell, Flavio held an engineering position with Confluent and research positions with Yahoo! Research and Microsoft Research. Flavio has co-authored a number of scientific publications (over 4,000 citations according to Google Scholar) and an O’Reilly ZooKeeper book on Apache ZooKeeper. Flavio is an Apache Member and has contributed to projects hosted by the ASF, including Apache ZooKeeper (as PMC and committer), Apache BookKeeper (as PMC and committer), and Apache Kafka.
Everything is connected: How watermarking, scaling, and exactly once impact one another in Pravega
Pravega by Dell EMC
Distinguished Engineer at Pravega by Dell EMC
Tom Kaitchuck is among the original group of developers of the Pravega project and is currently a core contributor employed by Dell. He holds a BS Degree from Valparaiso University. Tom an ardent open source software developer previously held senior software developer positions with Google and Amazon. Tom’s interests include Distributed systems, Asynchronous communication, Concurrency, Scaling systems, Consistency models.
Everything is connected: How watermarking, scaling, and exactly once impact one another in Pravega
DiDi
Staff Engineer at DiDi
Xue Kang is a team lead of real-time computing team at DiDi, providing stable, efficient real-time computing service with low cost. He has a master degree from Zhejiang University, and has has rich experience on batch and steam processing technology.
Flink's application at Didi
Mux
Staff Software Engineer at Mux
I've been building video encoding & delivery platforms for over 12 years (MobiTV, Brightcove/Zencoder, and now Mux). I'm currently a Staff Software Engineer working on the Mux Data service which provides realtime and historical analytics for Internet video playback. I’ve built high-volume stream-processing applications for Mux Data and Mux Video (our full-service video encoding and distribution service) that have served some of the most widely watched video streams on the Internet. Interests include Kafka, Flink, Kubernetes, and Go.
Amazon Web Services
Software Development Engineer @AWS Kinesis at Amazon Web Services
I am software development engineer at AWS Kinesis and working mainly on managed service for Flink.
Lessons learned on Apache Flink application availability in a hosted Apache Flink service
Ververica
Head of Product at Ververica
As Head of Product for Ververica Platform Konstantin is responsible for Ververica's commercial product, an enterprise-ready stream processing platform based on Apache Flink. Previously, he was leading the solutions architecture team and helping our clients as well as the Open Source community to get the most out of Apache Flink and Ververica Platform. Before joining Ververica he worked as a Senior Consultant with TNG Technology Consulting, where he supported their clients mainly in the areas of Distributed Systems and Automation. Konstantin has studied Mathematics and Computer Science at TU Darmstadt specializing in Stochastics and Algorithmics.
Yelp
Software Engineer at Yelp
Catlyn is a software engineer on the stream processing team at Yelp where she builds and maintains infrastructure that makes real-time data processing with Flink easy and reliable. Most recently, she’s been focusing on bringing in Apache Beam into the streaming ecosystem at Yelp.
Alibaba
Senior Engineer, Committer of Apache Flink at Alibaba
Bowen is a committer of Apache Flink and senior engineer at Alibaba. He has been working on Flink for over 3 years, with extended experience on developing and operating Flink in Alibaba at an unprecedented scale.
Besides committing code and reviewing designs, Bowen is a frequent speaker of Flink at conferences and events, evangelizing Flink and stream processing, to make the world a little bit more real-time data driven at a time.
Production-Ready Flink and Hive Integration - what story you can tell now
StreamNative
Staff Software Engineer at StreamNative
Neng Lu is a staff software engineer at StreamNative where he drives the development of Apache Pulsar and the integrations with big data ecosystem. Before that, he was a senior software engineer at Twitter. He was the core committer to the Heron project and the leading engineer for Heron development at Twitter. He also worked on Twitter’s monitoring and key-value storage systems. Before joining Twitter, he got his master's degree from UCLA and a bachelor degree from Zhejiang University.
Build your next-generation stream platform based on Apache Pulsar
Alibaba
Architect of Flink Engine at Alibaba
Yuan Mei is the architect of Flink Engine at Alibaba (joined Sept. 2019). Before that, she led to building Turbine: Facebook’s Service Management Platform for Stream Processing (ICDE2020). She has various experiences building Stream Processing Systems (Puma, VLDB2018) and many other realtime systems at Facebook (SIGMOD2016). She holds a Ph.D. from MIT CSAIL, under the supervision of Prof. Samuel Madden & Prof. Michael Stonebraker.
Netflix
Senior Software Engineer at Netflix
Jagannathrao Mudda is a Senior Software Engineer at Netflix working in Consolidated Logging team. At Netflix, he is building schema-aware data streams of user behavior and application performance data that enables analytics and personalization using technologies such as Flink, Spark, Kafka, Hadoop etc. Prior to Netflix, he has several years of experience in leading software engineering teams for both large and small companies and building large scale, high-performance batch and real-time processing systems in domains such as online advertising and web analytics at Yahoo, data warehousing at BitYota, data platform at LifeLock/Symantec, and continuous data protection services at BMC Software.
High-Quality Performant and Cost Efficient Schema-Aware Data Streams on Flink at Netflix Scale
Criteo
Staff Development Lead at Criteo
Piyush is a Staff Development Lead in Criteo’s AI Lab. His work involves building data infrastructure solutions to accelerate the pace of Machine Learning innovation at Criteo. Previously, he worked on various big-data libraries like Scalding, Algebird and Parquet at Twitter.
In his free time, he's found either trail running or curled up with a book (typically not at the same time).
Building FeatureFlow, Criteo’s feature data generation platform
Ververica
Apache Flink Committer, Software Engineer at Ververica
Piotr Nowojski is a Software Engineer in Ververica and Flink committer working mostly on Flink’s runtime code. Previously, he was a Software Engineer in Teradata working on Presto – distributed batch SQL query engine.
Hyperconnect
Azar, Matchmaking, Recommendation at Hyperconnect
Jacob Oh is a senior software engineer and team leader at Hyperconnect. Data Application Team is designing, developing and maintaining Hyperconnect's Recommendation Engine (Such as Azar's matchmaking system). Hyperconnect is the company serving the services like Azar based on Real-time Communication and ML Tech. With the experiences on researching UX and engineering Applications, He design and develop the products to make a better service experience.
Cloudera
Software Engineer at Cloudera
Matyas has been working at Cloudera and assisting customers on their big data journey since 2016. After being a member of the Support, then the Professional Services team he has joined Engineering as a founding member of the Cloudera Flink team. He focuses on enterprise requirements including security and operations. Before joining Cloudera he was responsible for delivering classical software development projects in the Telecommunication and Financial sectors. He is a wine enthusiast and a hobby winemaker. He is married and proud owner of a 'sausage dog', Ziggy.
Alibaba
Staff Software Engineer & Senior Manager at Alibaba
Jiangjie (Becket) is currently a software engineer at Alibaba where he mostly focus on the development of Apache Flink and its ecosystem. Prior to Alibaba, Becket worked at LinkedIn to build streams infrastructures around Apache Kafka after he received Master degree from Carnegie Mellon University in 2014. Becket is a PMC member of Apache Flink and Apache Kafka.
Senior Software Engineer, Weibo Machine Learning at Weibo
Qian Yu is a senior algorithm engineer in Weibo. She has been working on building real-time data processing and online machine learning framework with Flink for several years. Also, she is experienced in the recommendation system applied in social media, helped to match best content to their users in Weibo.
Ververica
Marketing Manager at Ververica
Markos Sfikas is a Marketing Manager at Ververica. He obtained an MSc in International Marketing from the University of Strathclyde. He previously worked at ResearchGate and LinkedIn in the areas of Product Marketing, Content Marketing & Online Advertising.
Equality and inclusion as a way of developing diverse, resilient, and more relevant OSS
Meeshkan
CEO at Meeshkan
Mike Solomon is the CEO of Meeshkan, a Helsinki-based startup on a mission to help companies build, maintain, and ship great sandboxes and digital twins of their infrastructure.
How Streaming Helps Your Staging Environment and Sandboxes Stay Up To Date
Cloudera
Staff Software Engineer at Cloudera
Wilfred is a Staff Software Engineer from Cloudera in Australia. He has worked on Hadoop for over 6 years mainly on Apache YARN, MapReduce and Spark. Currently, he works on the YuniKorn scheduler project. Before Cloudera, he has worked for SUN Microsystems and Oracle as part of the Identity Management teams as a developer and consultant for over 10 years. Wilfred holds a Master’s degree in Decision Support Systems from Sunderland University.
Alibaba
Apache Flink Committer & PMC Member / Staff Engineer at Alibaba
Jincheng Sun is a PMC member of Apache Flink and ACL Beijing, He is also a committer for Flink, Beam, IoTDB. He is also an engineering lead at Alibaba Group. During 9 years working experiences at Alibaba, he participated and lead some of the critical systems inside the company and started the development of PyFlink.
Ververica
Apache Flink PMC, Software Engineer at Ververica
Tzu-Li (Gordon) Tai is an Apache Flink PMC and Software Engineer at Ververica. He is currently working on the Stateful Functions API (https://statefun.io) in Apache Flink. In the past, he has contributed to various other parts of the Apache Flink project, including some of the more popular streaming connectors for Flink (Apache Kafka, AWS Kinesis, etc.) as well as several topics surrounding evolvability of stateful Flink streaming applications.
Stateful Functions: Polyglot Event-Driven Functions for Stateful Distributed Applications
Cogility
VP of Engineering at Cogility
Aslam Tajwala, VP of Engineering at Cogility Inc. His current work is focused on creating an analytics platform that makes it easy to capture human expertise and reasoning to allow analyzing high-volume, high-velocity data.
Netflix
Senior Software Engineer at Netflix
Ramayan Tiwari is an engineer in the Consolidated Logging team at Netflix, where he oversees the ingestion and processing of user behavior and app analytics events. Ramayan's interest and expertise lie in building and operating large scale distributed systems, distributed data stores, and messaging systems. Before Netflix, Ramayan worked Cruise to develop storage solutions for events generated from self-driving cars, Salesforce in their distributed job scheduler, and Amazon's metadata service.
High-Quality Performant and Cost Efficient Schema-Aware Data Streams on Flink at Netflix Scale
Salesforce
Principal Software Engineer at Salesforce
Andrew Torson is a Principal Data Engineer with Salesforce. His current work is focused on real-time ML based anomaly detection and application performance monitoring for the Salesforce cloud software. Before joining Salesforce, he was a data engineering lead working on the Smart Pricing platform in the Walmart Labs, generating real-time algorithmic price decisions for the global Walmart e-commerce catalog. Andrew is a Scala enthusiast and an active Flink developer with a long industry track-record. He holds a PhD degree in Operations Management from the New York University and M.Sci in Applied Mathematics from the Moscow Institute of Physics and Technology.
Google Dataflow
Dev Advocate at Google Dataflow
In his 7 years @Google, Reza has been lucky enough to work with developers from many industries from Gaming to Banking, applying Google's Data Analytics technologies to new domains. Currently as developer advocate for Google Cloud Dataflow and Apache Beam, he gets to have fun talking about stream processing all day long, or should that be stream processing unbounded... :-)
Distributed Processing for Machine Learning Production Pipelines
Ververica
Software Engineer, Apache Flink PMC Member at Ververica
Timo Walther is a committer and PMC member of the Apache Flink project. He studied Computer Science at TU Berlin. Alongside his studies, he participated in the Database Systems and Information Management Group there and worked at IBM Germany. Timo works as a software engineer at Ververica. In Flink, he is mainly working on the Table & SQL API.
Alibaba
Big Data Senior Software Engineer at Alibaba
Yang Wang, more than 5 years Senior Software Engineer @ Alibaba. He is interested in big data, cloud native technologies and have lots experiences of large scale cluster resource management. Now mainly focus on Flink deployment, to make Flink run everywhere (e.g. on-premise cluster, cloud, serverless, IOT, etc.).
Ververica
Solutions Architect at Ververica
Seth Wiesman is a Solutions Architect at Ververica, where he works with engineering teams inside of various organizations to build the best possible stream processing architecture for their use cases.
Alibaba
Software Engineer at Alibaba
Jark Wu is a committer and PMC member of Apache Flink. He works as a software engineer at Alibaba and contributes to Apache Flink since 4 years ago. In Flink, he is mainly working on the Table & SQL API. Prior to Flink, Jark worked on JStorm which is a Java-version Apache Storm in Alibaba.
Lyft
Software Engineer, Streaming Platform at Lyft
Micah Wylde is a software engineer on the streaming compute team at Lyft, focused on the development of Apache Flink and Apache Beam. Previously, he built data infrastructure for fighting internet fraud at Sift and real-time bidding infrastructure for ads at Quantcast.
How Lyft built a streaming data platform with Flink on Kubernetes
Lyft
Staff Engineer, Streaming Platform at Lyft
Ying currently works in Lyft's streaming platform team, where he investigates large-scale near real-time data ingestion and streaming pubsub infrastructure. Prior to Lyft Inc, he worked at Linkedin where he designed kafka-driven cross data center replication for Espresso -- Linkedin's scalable, time-line consistent source-of-truth NoSQL database.
Large-scale near-real-time (NRT) data analytics platform empowered by Apache Flink
Cloudera
Staff Software Engineer, Apache Hadoop Committer and PMC member at Cloudera
Weiwei Yang is a Staff Software Engineer from Cloudera, an Apache Hadoop committer and PMC member. He has been working on big data areas, especially Hadoop, over 8 years. He is focused on technology around large scale, hybrid computation systems. Before Cloudera, he worked in Alibaba’s realtime computation infrastructure team, focused on enhancing the platform to be more efficient, stable and scalable. Prior to that, he worked in the IBM big data organization for several years. Weiwei holds a master’s degree from Peking University.
Hyperconnect
Software Engineer, Match Unit Lead at Hyperconnect
Gihoon Yeom has worked as a software engineer for 4 years at HyperConnect. He is interested in using data to create new value and has focused on real-time distributed data engineering projects using Spark and Flink.
Data driven matchmaking streaming at Hyperconnect
Alibaba
Engineering Lead at Alibaba
Kurt Young is a PMC member of Apache Flink and Apache Druid. He is also an engineering lead at Alibaba Group. During 9 years working experiences at Alibaba, he participated and lead some of the critical systems inside company, such as search engine, scheduling system, monitoring and online analytics system. He also lead a SQL engine team which heavily based on Apache Flink, serving nearly all business units of Alibaba group.
Baidu
Technical Lead, Senior R&D Engineer at Baidu
Lead of Real Time processing platform of IoT department of Baidu Cloud.
Alibaba
Staff Engineer at Alibaba
Jeff has 11 years of experience in big data industry. He is an open source veteran, start to use Hadoop since 2009 and is PMC of several Apache projects Tez/Livy/Zeppelin and committer of Apache Pig. His past experience is not only on big data infrastructure, but also on how to leverage these big data tools to get insight. He speaks several times on big data conferences like Hadoop summit, Strata + Hadoop world. Now he works in Alibaba Group as a staff engineer. Prior that he works in Hortonworks where he developed these popular big data tools.
Apache Flink, Flink and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event.