About the speakers
Meet the experts from global companies like Apple, Pinterest, Splunk, Netflix, Shopify, and more, who have built scalable streaming infrastructure and enterprise-grade applications.
Hear why and how they use Flink as the stream processing engine of choice for large-scale stateful applications, including real-time analytics, real-time search and content ranking, fraud/anomaly/threat detection.
Speakers
Abhay Amin is a senior software engineer at Netflix working for Consolidated Logging team. At Netflix, he focuses on building and scaling real time & batch metrics platforms using spark, flink and kafka. Prior to Netflix, he has similar experience of building data products for e-commerece and finance.
Ahmet Altay is a Senior Software Engineer at Google working on Apache Beam (PMC member) and Cloud Dataflow. Previously he worked at Microsoft on operating systems. He has a master's degree from Stanford University.
Andrew Torson is a Principal Data Engineer with Salesforce. His current work is focused on real-time ML based anomaly detection and application performance monitoring for the Salesforce cloud software. Before joining Salesforce, he was a data engineering lead working on the Smart Pricing platform in the Walmart Labs, generating real-time algorithmic price decisions for the global Walmart e-commerce catalog. Andrew is a Scala enthusiast and an active Flink developer with a long industry track-record. He holds a PhD degree in Operations Management from the New York University and M.Sci in Applied Mathematics from the Moscow Institute of Physics and Technology.
Ever since I was a kid, I was very passionate about computers. When I was in grade IV, I joined a hardware institute in India to build my own PC. Back then, I would use that hand-built PC with a 486 processor to play games like Dave. My hardware guru introduced me to my first programming language "Visual Basic". I build a very basic application for my dad that would help him keep track of inventory at his tire shop. After pursing my masters in computer science at USC and interning at CallFire - A VOIP company and Mindjolt - A gaming company, I started working at Yahoo as a Software Engineer in the Ads and Data Platform team. I learned a lot about building applications at scale and got introduced to Hadoop and the magic of Map Reduce at Yahoo about a decade ago. After 5 years at Yahoo, I joined a 30 person startup called AtScale to help build a BI platform that can make it easy for customer to query terabytes of data really fast. For the last couple of years, I have been with Godaddy and we are building our streaming data platform using beam and flink to make data available to our downstream customers in a low latency fashion. I am an Apache Beam contributor and love spending time with my son and wife.
Austin Cawley-Edwards is a Senior Software Engineer at FinTech Studios, using real-time data to bring clarity to financial news. He frequently works with Apache Flink, RabbitMQ, and Elasticsearch. He also loves API design, taking part in collaborative communities, and sometimes JavaScript.
Jiangjie (Becket) is currently a software engineer at Alibaba where he mostly focus on the development of Apache Flink and its ecosystem. Prior to Alibaba, Becket worked at LinkedIn to build streams infrastructures around Apache Kafka after he received Master degree from Carnegie Mellon University in 2014. Becket is a PMC member of Apache Flink and Apache Kafka.
Bowen is a committer of Apache Flink and senior engineer at Alibaba. He has been working on Flink for over 3 years, with extended experience on developing and operating Flink in Alibaba at an unprecedented scale. Besides committing code and reviewing designs, Bowen is a frequent speaker of Flink at conferences and events, evangelizing Flink and stream processing, to make the world a little bit more real-time data driven at a time.
Catlyn is a software engineer on the stream processing team at Yelp where she builds and maintains infrastructure that makes real-time data processing with Flink easy and reliable. Most recently, she’s been focusing on bringing in Apache Beam into the streaming ecosystem at Yelp.
Fabian Hueske is a committer and PMC member of Apache Flink. He is one of the three original authors of the Stratosphere research system, from which Apache Flink was forked in 2014. Fabian is a co-founder of data Artisans (now Ververica), a Berlin-based startup devoted to fostering Flink, where he works as a software engineer and contributes to Apache Flink. He holds a PhD in computer science from TU Berlin and is a co-author of "Stream Processing with Apache Flink".
Fakrudeen is an Architect in Digital Experience Cloud with focus and expertise in Big data and ML technologies. Formerly, he was Senior Manager at Yahoo, managing Yahoo front page content ranking and personalization system.
Flavio Junqueira is a Senior Distinguished Engineer at Dell. He holds a PhD in computer science from the University of California, San Diego, and he is interested in various aspects of distributed systems, including distributed algorithms, concurrency, and scalability. His recent work at Dell focuses on stream analytics, and specifically, on the development of a novel storage system for streams called Pravega. Before Dell, Flavio held an engineering position with Confluent and research positions with Yahoo! Research and Microsoft Research. Flavio has co-authored a number of scientific publications (over 4,000 citations according to Google Scholar) and an O’Reilly ZooKeeper book on Apache ZooKeeper. Flavio is an Apache Member and has contributed to projects hosted by the ASF, including Apache ZooKeeper (as PMC and committer), Apache BookKeeper (as PMC and committer), and Apache Kafka.
Gihoon Yeom has worked as a software engineer for 4 years at HyperConnect. He is interested in using data to create new value and has focused on real-time distributed data engineering projects using Spark and Flink.
Gris is an experienced tech strategist who's worked with distributed communities for over 8 years. She has a Masters in Operation Research and Data Science from UC Berkeley and is passionate about big data analytics, open source projects, information architecture, diversity and inclusion in tech & Italian wines. She’s an industrial engineer by formation, and therefore has scaled products and organizations for over 10 years. From her work on oil rigs to online communities, she’s proven to be able to adapt to diverse industries and environments. She also enjoys solving undefined problems and to spearhead solutions no one has designed before.
Gyula is a Software Engineer in the Flink Engineering team at Cloudera working on integrating Flink into the Cloudera platform. He has been a committer and contributor since the early days of Flink streaming and has used Flink in large scale production at King for almost 4 years delivering innovative real-time applications at a global scale. Gyula grew up in Budapest where he first started working on distributed stream processing and later became a core contributor to the Apache Flink project. Gyula has been a speaker at numerous big data related conferences and meetups, talking about stream processing technologies and use-cases.
I am software development engineer at AWS Kinesis and working mainly on managed service for Flink.
Jacob Oh is a senior software engineer and team leader at Hyperconnect. Data Application Team is designing, developing and maintaining Hyperconnect's Recommendation Engine (Such as Azar's matchmaking system). Hyperconnect is the company serving the services like Azar based on Real-time Communication and ML Tech. With the experiences on researching UX and engineering Applications, He design and develop the products to make a better service experience.
Jagannathrao Mudda is a Senior Software Engineer at Netflix working in Consolidated Logging team. At Netflix, he is building schema-aware data streams of user behavior and application performance data that enables analytics and personalization using technologies such as Flink, Spark, Kafka, Hadoop etc. Prior to Netflix, he has several years of experience in leading software engineering teams for both large and small companies and building large scale, high-performance batch and real-time processing systems in domains such as online advertising and web analytics at Yahoo, data warehousing at BitYota, data platform at LifeLock/Symantec, and continuous data protection services at BMC Software.
Jesse Anderson is a Data Engineer, Creative Engineer and Managing Director of Big Data Institute. He works with companies ranging from startups to Fortune 100 companies on Big Data. This includes training on cutting edge technologies like Apache Kafka, Apache Hadoop and Apache Spark. He has taught over 30,000 people the skills to become data engineers. He is widely regarded as an expert in the field and for his novel teaching practices. Jesse is published on O’Reilly and Pragmatic Programmers. He has been covered in prestigious publications such as The Wall Street Journal, CNN, BBC, NPR, Engadget, and Wired.
Jincheng Sun is a PMC member of Apache Flink and ACL Beijing, He is also a committer for Flink, Beam, IoTDB. He is also an engineering lead at Alibaba Group. During 9 years working experiences at Alibaba, he participated and lead some of the critical systems inside the company and started the development of PyFlink.
Justin Cunningham is the Director of Data Platform Architecture at Netflix focused primarily on data movement, schemas, and supporting the Netflix Studio. Previously, Justin was a Group Tech Lead at Yelp, leading efforts centered around experimentation and metrics, real-time data infrastructure, and machine learning. Before Yelp, Justin worked at several small startups that he founded.
Kenny has decades of experience with various database platforms behind some of the busiest companies in the world. He has had roles as Architect, Director, Manager, Developer, and DBA. He was a key member of the early teams that scaled Paypal and then eBay on Oracle. He ran one of the busiest PostgreSQL installations in the world at Hi5 and was an early adopter of MongoDB using it for various large projects at Shutterfly. He is an active member in the PostgreSQL community and scaled Hi5 from just a few servers to dozens running multi-terabye workloads on SSD and SAN backends. He has contributed to the early versions of pg_reorg, and wrote the pgstat2 utility as well as other tools and performance techniques. He’s been blogging about databases including PostgreSQL for years. He has been an active MongoDB community member, speaker, MongoDB evangelist, and now Mongo Master. In 2011 he formed the MongoDB as a Service provider ObjectRocket with colleagues from eBaY. ObjectRocket was acquired by Rackspace in 2012. He is active in the Apache Kafka and Apache Flink communities - speaking at conferences and participating in community events. SQLStreambuilder, Eventador’s flagship product is built using Apache Flink. Currently, Kenny is a Founder at Eventador.io, a streaming data platform. He is focused on building innovative data services to power the next generation of applications that must aggregate, mutate, filter, and join data in real time.
As Head of Product for Ververica Platform Konstantin is responsible for Ververica's commercial product, an enterprise-ready stream processing platform based on Apache Flink. Previously, he was leading the solutions architecture team and helping our clients as well as the Open Source community to get the most out of Apache Flink and Ververica Platform. Before joining Ververica he worked as a Senior Consultant with TNG Technology Consulting, where he supported their clients mainly in the areas of Distributed Systems and Automation. Konstantin has studied Mathematics and Computer Science at TU Darmstadt specializing in Stochastics and Algorithmics.
Markos Sfikas is a Marketing Manager at Ververica. He obtained an MSc in International Marketing from the University of Strathclyde. He previously worked at ResearchGate and LinkedIn in the areas of Product Marketing, Content Marketing & Online Advertising.
Micah Wylde is a software engineer on the streaming compute team at Lyft, focused on the development of Apache Flink and Apache Beam. Previously, he built data infrastructure for fighting internet fraud at Sift and real-time bidding infrastructure for ads at Quantcast.
Mike Solomon is the CEO of Meeshkan, a Helsinki-based startup on a mission to help companies build, maintain, and ship great sandboxes and digital twins of their infrastructure.
Neng Lu is a staff software engineer at StreamNative where he drives the development of Apache Pulsar and the integrations with big data ecosystem. Before that, he was a senior software engineer at Twitter. He was the core committer to the Heron project and the leading engineer for Heron development at Twitter. He also worked on Twitter’s monitoring and key-value storage systems. Before joining Twitter, he got his master's degree from UCLA and a bachelor degree from Zhejiang University.
Piotr Nowojski is a Software Engineer in Ververica and Flink committer working mostly on Flink’s runtime code. Previously, he was a Software Engineer in Teradata working on Presto – distributed batch SQL query engine.
Piyush is a Staff Development Lead in Criteo’s AI Lab. His work involves building data infrastructure solutions to accelerate the pace of Machine Learning innovation at Criteo. Previously, he worked on various big-data libraries like Scalding, Algebird and Parquet at Twitter. In his free time, he's found either trail running or curled up with a book (typically not at the same time).
Praveen Gattu heads engineering team for Amazon Kinesis Data Analytics service. Praveen has worked in Amazon Webservices for past 12 years in Kinesis and S3.
Ramayan Tiwari is an engineer in the Consolidated Logging team at Netflix, where he oversees the ingestion and processing of user behavior and app analytics events. Ramayan's interest and expertise lie in building and operating large scale distributed systems, distributed data stores, and messaging systems. Before Netflix, Ramayan worked Cruise to develop storage solutions for events generated from self-driving cars, Salesforce in their distributed job scheduler, and Amazon's metadata service.
In his 7 years @Google, Reza has been lucky enough to work with developers from many industries from Gaming to Banking, applying Google's Data Analytics technologies to new domains. Currently as developer advocate for Google Cloud Dataflow and Apache Beam, he gets to have fun talking about stream processing all day long, or should that be stream processing unbounded... :-)
A data scientist and TensorFlow addict, Robert has a passion for helping developers quickly learn what they need to be productive. He's used TensorFlow since the very early days and is excited about how it's evolving quickly to become even better than it already is. Before moving to data science Robert led software engineering teams for both large and small companies, always focusing on clean, elegant solutions to well-defined needs. In his spare time Robert sails, surfs occasionally, and raises a family.
Samantha is a data scientist, working on the Cogynt team in Cogility Software. She helps the team develop rich data science features on Cogynt to ensure customers can meet their data needs swiftly and with ease. Prior to working at Cogility, she worked on building a machine learning and analytics platform for a surgical simulation system at a medical device company in California.
I've been building video encoding & delivery platforms for over 12 years (MobiTV, Brightcove/Zencoder, and now Mux). I'm currently a Staff Software Engineer working on the Mux Data service which provides realtime and historical analytics for Internet video playback. I’ve built high-volume stream-processing applications for Mux Data and Mux Video (our full-service video encoding and distribution service) that have served some of the most widely watched video streams on the Internet. Interests include Kafka, Flink, Kubernetes, and Go.
Seth Wiesman is a Solutions Architect at Ververica, where he works with engineering teams inside of various organizations to build the best possible stream processing architecture for their use cases.
Teng(Niel) Hu is a software engineer at Uber, previously worked on dynamic pricing for 2 years, now works in Uber AI Lab as a research engineer.
Timo Walther is a committer and PMC member of the Apache Flink project. He studied Computer Science at TU Berlin. Alongside his studies, he participated in the Database Systems and Information Management Group there and worked at IBM Germany. Timo works as a software engineer at Ververica. In Flink, he is mainly working on the Table & SQL API.
I have degrees in Electrical Engineering and Mathematics from CMU. My expertise is in distributed data processing engines. In open source, I have worked on Apache Apex and Apache Drill.
Tirtha Chatterjee is a software developer at the Amazon Managed Streaming for Kafka team. He worked on building the cluster health monitoring system on Apache Flink that monitors the health of Apache Kafka clusters that his team supports. Tirtha has years of experience on working across the software stack - from core storage and streaming systems to front-end website development. He is a genuine open source enthusiast and has contributed to multiple open source projects such as Pidgin, KDE, Rekonq, Lokalize and Apache Kafka.
Tom Kaitchuck is among the original group of developers of the Pravega project and is currently a core contributor employed by Dell. He holds a BS Degree from Valparaiso University. Tom an ardent open source software developer previously held senior software developer positions with Google and Amazon. Tom’s interests include Distributed systems, Asynchronous communication, Concurrency, Scaling systems, Consistency models.
Tzu-Li (Gordon) Tai is an Apache Flink PMC and Software Engineer at Ververica. He is currently working on the Stateful Functions API (https://statefun.io) in Apache Flink. In the past, he has contributed to various other parts of the Apache Flink project, including some of the more popular streaming connectors for Flink (Apache Kafka, AWS Kinesis, etc.) as well as several topics surrounding evolvability of stateful Flink streaming applications.
Weiwei Yang is a Staff Software Engineer from Cloudera, an Apache Hadoop committer and PMC member. He has been working on big data areas, especially Hadoop, over 8 years. He is focused on technology around large scale, hybrid computation systems. Before Cloudera, he worked in Alibaba’s realtime computation infrastructure team, focused on enhancing the platform to be more efficient, stable and scalable. Prior to that, he worked in the IBM big data organization for several years. Weiwei holds a master’s degree from Peking University.
Wilfred is a Staff Software Engineer from Cloudera in Australia. He has worked on Hadoop for over 6 years mainly on Apache YARN, MapReduce and Spark. Currently, he works on the YuniKorn scheduler project. Before Cloudera, he has worked for SUN Microsystems and Oracle as part of the Identity Management teams as a developer and consultant for over 10 years. Wilfred holds a Master’s degree in Decision Support Systems from Sunderland University.
Xue Kang is a team lead of real-time computing team at DiDi, providing stable, efficient real-time computing service with low cost. He has a master degree from Zhejiang University, and has has rich experience on batch and steam processing technology.
Yang Wang, more than 5 years Senior Software Engineer @ Alibaba. He is interested in big data, cloud native technologies and have lots experiences of large scale cluster resource management. Now mainly focus on Flink deployment, to make Flink run everywhere (e.g. on-premise cluster, cloud, serverless, IOT, etc.).
Ying Xu: Ying currently works in Lyft's streaming platform team, where he investigates large-scale near real-time data ingestion and streaming pubsub infrastructure. Prior to Lyft Inc, he worked at Linkedin where he designed kafka-driven cross data center replication for Espresso -- Linkedin's scalable, time-line consistent source-of-truth NoSQL database. Kailash HD: Kailash has over 5 years of experience building data infrastructure, search infrastructure and computer vision systems. During his time at Lyft, Kailash has worked in teams which manage kafka cluster, build flink jobs to persist data to S3 and manage platform for real-time distribution of messages / events. In his erstwhile life, he worked as an investment banker advising corporates on IPO / M&A strategies.
Lead of Real Time processing platform of IoT department of Baidu Cloud.
Qian Yu is a senior algorithm engineer in Weibo. She has been working on building real-time data processing and online machine learning framework with Flink for several years. Also she is experienced in recommendation system applied in social media, helped to matching best content to their users in Weibo.
Zili Chen is contributing for Apache Flink for over a year and now one of Flink committers. He focuses on consistency in distributed system and flexibility of user-facing interface.
Marton is a Flink PMC member and one of the first contributors to the streaming API. He has driven big data adoption at around 50 customers as a Senior Solutions Architect at Cloudera during the last four years. He is the manager of the newly formed Streaming Analytics team and focuses on adding Flink to the Cloudera platform.
Eric was an early employee at Cloudera before he founded Rocana which was acquired by Splunk. Today, he is a Distinguished Engineer at Splunk working on platform services including stream processing. He's the author of "Hadoop Operations" and is involved in a number of open source projects.
Joe Witt is Vice President of Engineering at Cloudera focused on the Cloudera Data Flow (CDF) product. Joe spent 10 years at NSA most of which developing what became Apache NiFi. In 2015 Joe and a team of cofounders left NSA and started Onyara which was acquired by Hortonworks which later merged with Cloudera. Joe is a member of the project management committee for Apache NiFi.
Stephan Ewen is CTO and co-founder at Ververica where he leads the development of the stream processing platform based on open source Apache Flink. He is also a PMC member and one of the original creators of Apache Flink. Before working on Apache Flink, Stephan worked on in-memory databases, query optimization, and distributed systems. He holds a Ph.D. from the Berlin University of Technology.
Jeff has 11 years of experience in big data industry. He is an open source veteran, start to use Hadoop since 2009 and is PMC of several Apache projects Tez/Livy/Zeppelin and committer of Apache Pig. His past experience is not only on big data infrastructure, but also on how to leverage these big data tools to get insight. He speaks several times on big data conferences like Hadoop summit, Strata + Hadoop world. Now he works in Alibaba Group as a staff engineer. Prior that he works in Hortonworks where he developed these popular big data tools.
Kailash has over 5 years of experience building data infrastructure, search infrastructure and computer vision systems. During his time at Lyft, Kailash has worked in teams which manage kafka cluster, build flink jobs to persist data to S3 and manage platform for real-time distribution of messages / events. In his erstwhile life, he worked as an investment banker advising corporates on IPO / M&A strategies.
Matyas has been working at Cloudera and assisting customers on their big data journey since 2016. After being a member of the Support, then the Professional Services team he has joined Engineering as a founding member of the Cloudera Flink team. He focuses on enterprise requirements including security and operations. Before joining Cloudera he was responsible for delivering classical software development projects in the Telecommunication and Financial sectors. He is a wine enthusiast and a hobby winemaker. He is married and proud owner of a 'sausage dog', Ziggy.
Jark Wu is a committer and PMC member of Apache Flink. He works as a software engineer at Alibaba and contributes to Apache Flink since 4 years ago. In Flink, he is mainly working on the Table & SQL API. Prior to Flink, Jark worked on JStorm which is a Java-version Apache Storm in Alibaba.
Kurt Young is a PMC member of Apache Flink and Apache Druid. He is also an engineering lead at Alibaba Group. During 9 years working experiences at Alibaba, he participated and lead some of the critical systems inside company, such as search engine, scheduling system, monitoring and online analytics system. He also lead a SQL engine team which heavily based on Apache Flink, serving nearly all business units of Alibaba group.
Yuan Mei is the architect of Flink Engine at Alibaba (joined Sept. 2019). Before that, she led to building Turbine: Facebook’s Service Management Platform for Stream Processing (ICDE2020). She has various experiences building Stream Processing Systems (Puma, VLDB2018) and many other realtime systems at Facebook (SIGMOD2016). She holds a Ph.D. from MIT CSAIL, under the supervision of Prof. Samuel Madden & Prof. Michael Stonebraker.
Aslam Tajwala, VP of Engineering at Cogility Inc. His current work is focused on creating an analytics platform that makes it easy to capture human expertise and reasoning to allow analyzing high-volume, high-velocity data.
Xiaowei Jiang is a Senior Director at Alibaba. He currently leads the Hologres team at AliCloud. This product provides unified storage and service for offline and real time data. Previously, he worked as Tech Lead at Facebook and Principal Engineer at Microsoft SQL Server.