With the exploding interests in Machine Learning technology in recent years, they have also introduced many aspects of computing that are required to support such a grand vision in what ML is capable of delivering. On the infrastructural layer, we need to handle the high frequency of data ingestion with low latency, and one of the best mechanisms that we can think of leveraging on is streaming. So what is streaming, and what are the different choices we have as a platform? We will learn about a few options, and zoom in to learn more about what a true next-generation, cloud-native streaming platform such as Apache Pulsar is capable of, beyond what the more common messaging platforms that we have these days.
Mary is a Java Champion and a passionate Streaming Developer Advocate at DataStax, a leading data management company that champions Open Source software and specializes in Big Data, DB-as-a-service, Streaming, and Cloud-Native systems. She spent 3.5 years previously as a very effective advocate at IBM, focusing on Java, Jakarta EE, OpenJ9, Open Source, Cloud, and Distributed Systems. She transitioned from Unix/C to Java around 2000 and has never looked back since then. She considers herself a polyglot and loves to continue learning new and better ways to solve real-life problems. She is an active tech community builder outside of her day job, and currently the President of the Chicago Java Users Group (CJUG), as well as a co-organizer for several IBM-sponsored meetup groups in the Greater Chicago area.