Senior Data Engineer

Tavus

Completely RemoteFull TimeEngineering & Architecture
Posted Today

Job description

Responsibilities

  • Own Tavus's entire data strategy from sourcing to optimization
  • Source and curate multimodal datasets including text, video, and images
  • Build and scale data pipelines that power AI model training
  • Master video data challenges specific to ML training workflows
  • Optimize labeling and automation workflows for efficiency
  • Unlock value from internal platform data to drive product improvements
  • Define best practices and standards in emerging data problem spaces

Requirements

  • Extreme ownership mentality with end-to-end accountability for data systems
  • ML-first mindset with experience supporting AI model development
  • Hands-on experience with LLMs and multimodal datasets
  • Strong automation skills and infrastructure-as-code practices
  • Expert-level Python, SQL, and large-scale data processing
  • Ability to operate in ambiguous problem spaces and establish new standards

About the Company

Tavus builds the human layer of AI, pioneering multi-modal models for human perception and state-of-the-art avatar rendering. Their technology powers text-to-video AI avatars and real-time conversational video experiences across healthcare, recruiting, sales, and education. Backed by Sequoia, Y Combinator, and Scale VC, Tavus is creating the foundation for the next generation of AI employees, assistants, and companions.

Skills & tools

PythonSQLLLMMachine LearningData Analysis

What the team is looking for

Use this list as a quick fit check before you apply.

  1. 01Extreme ownership of data strategy
  2. 02ML-first mindset
  3. 03Experience with LLMs and multimodal datasets
  4. 04Strong automation skills
  5. 05Strong Python, SQL, and large-scale data processing
  6. 06Ability to define best practices in new problem spaces