PyData Tel Aviv 2024

Alon Oring

Alon Oring is the Head of Research at Dynamic Infrastructure, a predictive maintenance startup focused on using computer vision to identify defects and risks in critical infrastructure before they evolve into large-scale failures. Since joining Dynamic Infrastructure in 2019, Alon has led the development of several core technologies that obtained state-of-the-art performance and are currently serving multiple customers worldwide. Additionally, Alon is an active lecturer on deep learning, machine learning, and data science at Reichman University (IDC Herzliya), international coding boot camps, and an active mentor for up-and-coming data scientists.

  • A Shallow Introduction to Self-Attention
Aviv Vromen

Aviv Vromen is an experienced ML and data infrastructure engineer with a strong background in Python. He is currently working at bluevine, where he has played a key role in the company's success in the financial technology sector. Prior to that, Aviv made a contributions as an algorithm developer at Rafael, focusing on complex multi-agent systems.
In his conference talk, Aviv aims to share his approach to using aggregated data in order to improve feature calculation.

  • Optimizing Data-Driven Decisions: Introducing an Aggregation Engine for Efficient Feature Creation
Daniel Goldfarb

Daniel is an engineer at Bloomberg with experience developing Trading Systems, Risk Analytics, and applications for Financial Analysis of Equities and Fixed Income securities. He holds a Ph.D. in Molecular Biophysics from the University of Virginia, and was a CFA charter holder and member of the Chartered Financial Analyst Institute for more than 10 years. He is the Open Source maintainer of Matplotlib’s MPLFINANCE package, and the author of McGraw-Hill’s “Biophysics Demystified.”

  • Adding Your Own Data Apps to JupyterLab
David Katz

David is a Software Engineer​ at Mobileye, specializing in self-service big data analytics platforms.
With extensive experience in big data engineering, he has a strong background in developing and optimizing large-scale data pipelines using Python, PySpark, and AWS.
David has also contributed to open-source projects in the Python data ecosystem. (https://github.com/DavidKatz-il)

  • Empowering ML Developers with Self Serve Data Analytics
Ehud Karavani

Ehud is a research staff member at IBM Research, marrying machine learning with causal inference to address questions in medicine and healthcare.
He combines applied research with tool development for research, having created and currently maintaining Causallib—an open-source Python package for flexible causal inference modeling—used by many practitioners in both academia and industry. Over his 8 years at IBM, he has led the causality strategy for the company's global efforts in drug discovery, consulted to many of its research labs worldwide, lectured on causality to staff and clients, developed novel methodologies and published his research.
He holds an MSc. in computer science and computational biology from the Hebrew University, where he worked on trait prediction using DNA and assessed its potential consequences for population genetics and embryo selection. A musician and hiker, but mostly a parent.

  • Causal inference with Causallib
Eran Krakovsky

Biography

  • APL-Inspired Techniques for Advanced NumPy
Eyal Gruss

Dr. Eyal Gruss - Code/media/text artist, algorithms researcher, teaches computational creativity at the Holon institute of Technology. https://eyalgruss.com

  • Let our optima combine!
Geva Kipper

I'm an M.Sc graduate of Tel-Aviv University in Computer Science and I work for Google on products that aim to make phone calls more tolerable.
I've been excited about data and algorithms since I was young, and what I like even more is trying to get other to be as excited about them as I am.
I always try to pick projects that will interest the general public, or at least make them laugh.

  • Identifying Repetitive Songs using LZ Compression
Isan Rivkin

Isan Rivkin is R&D Team Leader in Treeverse, the company behind lakeFS, an open source platform that delivers resilience and manageability to object-storage based data lakes. Isan engineered and maintained petabyte-scale data infrastructure at analytics giant SmilarWeb.

  • Building a Reproducible RAG Pipeline for a Q&A ChatBot with LangChain and Ollama
Jonathan Harel

Jonathan is a digital comedian - touching on everything that involves humor, technology and creativity. He is the cofounder of Fine, a company that helps developers build better software, faster; and the creator of "Dark{mode}": a docu-comedy web series that covers developer experience topics, trends, and best practices.

  • Standup Comedy
Menachem Kluft

Menachem Kluft - Senior Backend Developer in Mobileye.
Uri Mogilevsky - Technical lead in Mobileye.
We're part of a group creating a data processing infrastructure. This system serves different groups, addresses various needs, and tackles challenging data processing tasks across the entire company.

  • Using Row Groups for fast filtering of large parquet files
Michael Ethan Levinger

Hello! I'm Michael, a Data Scientist with over 4 years of experience, specializing in developing advanced algorithms for fraud prevention in the fintech industry.

Currently, I work as a Data Scientist within the risk department at Melio, a rapidly growing fintech company.

Additionally, I'm a mentor at Masterschool, where I work closely with my mentees to help them achieve their goals, stay motivated and on track.

Alongside my work in data science, I'm also an avid ultra-marathon runner and a former coach. I believe that maintaining a healthy mind and body is essential for a fulfilling life and enjoy pushing myself to new physical and mental limits.

I'm always looking for opportunities to collaborate and make a positive impact in the world. If you're interested in connecting with me or learning more about my work, feel free to send me a message!

  • Securing Language Models Against Prompt Injection with the Powerful LangChain Framework
Mor Hananovitz

Head of Data and Data scientists at Parazero, IoT and signal processing expert.
Community lead and Mentor in WiDS.
MSc mechanical engineering, researching fluid dynamic models.

  • The TL;DR of EDA
Moran Reznik

2019-2020 data science intern at CheckPoint
2020-2022 data analyst at Ebay
2022-2023 senior data analyst at Lusha

  • BertTopic: From Free-Text feedbacks to Calls for Action.
Omri Fima

Omri is a Data Hacker, Maker, and LEGO builder. Currently, He is a Prinicipal Engineer at Walmart Global Tech.

  • Ibis framework - Making data science work at any scale.
Ortal Ashkenazi

Ortal Ashkenazi is a seasoned NLP researcher with over four years of experience at Gong, where she specialized in transforming data into actionable insights using advanced language models. Prior to that, she worked as a software engineer for 2.5 years at Algotec, developing information systems and processing solutions in the field of medical imaging. Holding an MSc from the Technion, her thesis focused on generating medical screening questionnaires by analyzing social media data, bridging artificial intelligence with practical healthcare solutions. Her expertise lies in model evaluation and deploying production-ready NLP solutions that address real-world challenges.

  • Unveiling the Journey of Natural Language Processing (NLP): Milestones, Limitations, and Practical Applications
Ran Bar Zik

Senior software architect at CyberArk, journalist at The Marker, lecturer at Haifa university, author of 6 software development books, blogger at internet-israel.com

  • Keynote: The Dangerous Data Anonymization
Reuven M. Lerner

Reuven is a full-time trainer in Python and data science, teaching companies around the world via in-person, online, and recorded courses. He is the author of both "Python Workout" and "Pandas Workout" (Manning), and writes both "Better Developers" (weekly articles about Python) and "Bamboo Weekly" (weekly Pandas puzzles based on current events). Reuven lives with his wife and children in Modi'in, Israel.

  • Times and Dates in Pandas
Roman Olshanskiy

With over 10 years of experience as a software engineer and leader in various domains, including backend, frontend, cloud computing, big data, and infrastructure, I am passionate about building innovative and efficient solutions, fostering collaboration, and bringing teams together to reach new heights.

  • Empowering ML Developers with Self Serve Data Analytics
Shirli Di-Castro Shashua

Dr. Shirli Di-Castro Shashua is a professional in machine learning and AI technologies. She earned her PhD from the Technion in the Faculty of Electrical and Computer Engineering, specializing in reinforcement learning, following her BSc in Biomedical Engineering from Ben Gurion University. Currently, Shirli holds the role of Senior Data Scientist at Embie, where she develops innovative solutions to fertility clinics using advanced generative AI capabilities.

  • AI, SQL, and GraphQL Walk into a Fertility Clinic… LLM-based Medical feature development
Shuki Cohen

Shiki is a seasoned Data Scientist with an emphasis on NLP, classical ML, visualization, and experimentation.

Driven by a great passion for the field, I am inspired by unintuitive insights and inferences made by smart algorithms. In my talks, I try to convey my typical spirit and enthusiasm while delivering crisp takeaways.

  • Live Coding: ChatGPT Goes Beyond Its Knowledge Cut-Off With External Database Integration
Tomer Doitshman

Tomer is a security research team lead in Cato Research Labs at Cato Networks, with a keen interest in various aspects of cybersecurity, including reverse engineering, network protocol analysis, and detecting malicious traffic. Additionally, Tomer is enthusiastic about machine learning and thrives on tackling intricate challenges within this field. Presently, his main area of focus is network-based security research, where he endeavors to devise innovative approaches for detecting threats in corporate network
settings.

  • AIOps for Security: SaaS Compliance Automation with a Python Stack
Uri Mogilevsky-Schay

.

  • Using Row Groups for fast filtering of large parquet files
Yoav Nordmann

Yoav Nordmann is a Backend & Data Architect and Tech Lead with over 20 years of experience. At Tikal he holds the position of a Group Leader mentoring fellow workers. He is passionate about new and emerging technologies, knowledge sharing and a fierce advocate for open source. Being in the industry for so long gives him a sense of perspective on different languages, architectures, and hypes.

  • Processing Biggish Data with DuckDB and Python
Yoel Zeldes

Over 14 years of experience as a software engineer and algorithm developer in various domains, including NLP, recommender systems, vision, and cybersecurity.

I am passionate about good quality code, interesting ideas and sophisticated algorithms. I love encountering elegant equations while trying to solve real-life problems.

  • Live Coding: ChatGPT Goes Beyond Its Knowledge Cut-Off With External Database Integration
Yonathan Guttel

Yonathan Guttel is a Data Scientist at Lightricks, serving in the Business DS team. In this role, he aids the marketing and finance sectors by crafting models, tools, and pipelines, refining revenue forecasts and marketing strategies.

  • 2D ARIMA: Capturing New Trends for Distant Time Horizons in Cohort Revenue Forecasting