Job Description
Reddit is a community of communities. It’s built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. With 100,000+ active communities and approximately 126 million daily active unique visitors, Reddit is one of the internet’s largest sources of information. For more information, visit www.redditinc.com . Location: Reddit has a flexible first workforce. Don't live near our office? No worries: you can work remotely from anywhere in the UK or the Netherlands. Team Overview We’re building a scalable feature platform that powers Ads ML by making high-quality features and training datasets easy to build, share, and maintain. Our small but growing team works on projects like batch & realtime feature management platform, training set generation platform, sequence features platform and, agentic and automated ML workflows for feature lifecycle management. We are looking for an engineer with experience in building high-scale data infrastructure and exposure to ML platforms to help evolve and scale our feature management systems. This is not a pure ML modeling role. The ideal candidate is excited about building reliable infrastructure, data pipelines, and developer-facing tools that make ML engineers more productive. What You’ll Do Design and build data infrastructure that supports large-scale feature and training set computation, transformation, and storage. Develop frameworks for batch and real-time features with a focus on reliability, scalability, and ease of use. Build platform capabilities for feature governance, including lineage tracking, validation, drift detection, anomaly monitoring, reproducibility, and versioning Partner with ML engineers to ensure smooth integration of feature engineering workflows into ML production systems. Build systems that support agentic ML workflows, including automated feature discovery, featu