Job Description
This is a remote position with in-person onboarding and retreats a few times a year. About Vana Vana is the leading decentralized AI network powered by user-owned data. Data is the most valuable resource in the digital world and it has been furthered by AI. Vana taps into the power of every individual to contribute data to leading AI models. Our guiding use case is a user-owned AI foundation model, collectively owned by 100 million users who've all contributed their data to it. Projects like the Reddit data DAO that onboarded 140k users are built on Vana. By building an open ecosystem to break down the walled gardens of tech giants, we are accelerating AI progress and giving users ownership in the AI models they create. We got started out of MIT and have raised $20M from leading VCs including Paradigm and Polychain. Role Overview We’re looking for a Distributed Systems Engineer to help design and build the scalable infrastructure powering data access and compute in the Vana network. You’ll be designing and building systems that allow users to connect their data in a privacy-preserving way, as well as pool and monetize it. As the backbone of Vana’s decentralized data layer, these systems move encrypted data across nodes, verify permissions through cryptographic proofs, and support ML workflows such as AI training and fine-tuning. They leverage Trusted Execution Environments (TEEs) to ensure that no entity can ever access raw user data. Here are some of the teams and projects that are already building on our infrastructure: An EV startup collects video data of edge cases like near-collision events to train better driving models A robotics-startup wants to fine-tune a model for grasping irregular objects A fintech works on improving the correctness of their risk-modeling by analyzing transaction records in a GDPR-compliant way Key Responsibilities Lead the development of data query and compute layer with the goal of scaling it to process data required to train AI model