Senior Backend Engineer, Data Mining

5 days ago


Singapore MOTIONAL SINGAPORE PTE. LIMITED Full time

Roles & Responsibilities Mission Summary: At Motional, we're transforming how autonomous vehicles discover critical intelligence hidden within petabytes of multimodal sensor data. Our next-generation autonomous driving stack depends on finding the rare edge cases, long-tail scenarios, and model errors that matter most. OmniTag , our ML-powered multimodal data mining framework, is the engine that powers this discovery.As a Senior Backend Engineer on the Data Mining team, you'll architect and own the production systems that enable data scientists and ML engineers to rapidly mine, analyze, and extract insights from billions of data points across cameras, LiDAR, radar, and other modalities. You won't maintain a platform, you'll evolve its core foundation, ensuring OmniTag scales to support Motional's most ambitious autonomy challenges. Your work directly impacts the quality and speed at which we improve our perception and planning models. What You'll Do: Architect the OmniTag Engine: Design and build the high-throughput, low-latency backend systems that execute billion-scale inference across Ray/Spark, transforming raw sensor data into unified multimodal representations. You'll optimize for both query latency and resource efficiency in a cost-sensitive, cloud-based environment. Scale Multimodal Data Pipelines : Own the complete data journey - from ingestion, normalization, and preprocessing of heterogeneous modalities (image, video, LiDAR, audio) through encoding, indexing, and cached embedding storage. Ensure pipelines are robust, observable, and meet the SLOs expected by downstream ML teams. Evolve the Vector Search and Retrieval Engine : Enhance our in-house billion-scale vector search engine to power RAG-driven few-shot dataset creation. Optimize embedding storage, retrieval performance, and filtering across billions of examples to enable rapid interactive mining workflows. Own Data Quality and Observability : Build comprehensive monitoring, logging, and alerting for multimodal data preprocessing pipelines. Develop data validation frameworks that catch regressions in data alignment, normalization, or encoding quality—critical for maintaining model performance. Collaborate on Encoder-Decoder Adaptation : Work closely with ML engineers to support domain-specific fine-tuning workflows, model versioning, and A/B testing of new encoders and decoders. Ensure the backend infrastructure enables rapid experimentation with emerging open-source multimodal foundation models. Drive Production Reliability : Establish patterns for graceful degradation, fault tolerance, and cost optimization. Operate OmniTag as a mission-critical data platform serving the entire ML organization, with a focus on reliability, debuggability, and operational excellence. What We're Looking For: BS in Computer Science or a related field, or equivalent professional experience 6+ years designing, building, and operating large-scale distributed systems in production environments Deep, hands-on expertise with Ray or Spark (or both) for distributed data processing and large-scale inference workloads Expert-level Python proficiency with strong software engineering fundamentals: testing (unit, integration, and end-to-end), CI/CD pipelines, containerization, and code review practices Proven experience optimizing and scaling production data pipelines that process terabytes or petabytes of data Strong SQL and data manipulation skills; comfort with both structured and semi-structured data Experience with cloud infrastructure (AWS preferred: S3, EC2, EKS, EMR, IAM) and infrastructure-as-code patterns Demonstrated track record of shipping robust, well-tested, production-grade systems and mentoring junior engineers Bonus Points: MS/PhD in Computer Science, Machine Learning, or a related field. Experience building or scaling vector databases, large-scale information retrieval systems, or similarity search engines. Hands-on work with multimodal machine learning models, foundation models (LLMs/VLMs), or embeddings-based systems. Familiarity with ML frameworks (PyTorch, JAX) and the ecosystem around multimodal models. Production experience with workflow orchestration (Airflow, Kubeflow, Dagster) and stream processing (Kafka, Flink). Understanding of model serving patterns, feature stores, or ML ops infrastructure. Domain knowledge in autonomous driving, computer vision, or sensor fusion. Experience with ML-based data mining, active learning, or contrastive learning approaches. Tell employers what skills you have Information RetrievalMentoringMachine LearningAirflowAutonomyPipelinesArchitectSoftware EngineeringReliabilitySQLDistributed SystemsPythonOrchestrationS3DatabasesShipping



  • Singapore MOTIONAL SINGAPORE PTE. LIMITED Full time

    Mission Summary At Motional, we're transforming how autonomous vehicles discover critical intelligence hidden within petabytes of multimodal sensor data. Our next-generation autonomous driving stack depends on finding the rare edge cases, long-tail scenarios, and model errors that matter most. OmniTag, our ML-powered multimodal data mining framework, is the...


  • Singapore, Central, Singapore Motional Full time $120,000 - $180,000 per year

    Mission Summary:At Motional, we're transforming how autonomous vehicles discover critical intelligence hidden within petabytes of multimodal sensor data. Our next-generation autonomous driving stack depends on finding the rare edge cases, long-tail scenarios, and model errors that matter most. OmniTag, our ML-powered multimodal data mining framework, is the...


  • Singapore DayOne Data Centers Full time

    Senior Software Engineer (Backend) page is loaded## Senior Software Engineer (Backend)locations: Corporate Office-Singaporetime type: Full timeposted on: Posted Todayjob requisition id: JR266Join DayOne – Shaping the Future of Data Infrastructure DayOne is a global leader in the development and operation of high-performance data centers. As one of the...


  • Singapore H2 GAMES PTE. LTD. Full time

    We are looking for a Senior Data Engineer to build and manage the data pipelines and infrastructure supporting the cutting-edge user growth platform. This role will ensure seamless data integration and real-time analytics capabilities. **Responsibilities**: - Design, develop, and maintain scalable data pipelines to process and analyze data from multiple...

  • Senior Data Engineer

    2 weeks ago


    Singapore H2 GAMES PTE. LTD. Full time

    We are looking for a Senior Data Engineer to build and manage the data pipelines and infrastructure supporting the cutting-edge user growth platform. This role will ensure seamless data integration and real-time analytics capabilities. Job Responsibilities: Design, develop, and maintain scalable data pipelines to process and analyze data from multiple...

  • Research Engineer I

    5 days ago


    Singapore Nanyang Technological University Full time

    Key Responsibilities: The successful applicant will be responsible for the development of data mining algorithms, and building systems for managing data streams. This includes: - Designing and developing scalable algorithms for processing data of large scale. - Wring research papers of high quality based on research results - Building deployable systems...


  • Singapore MANPOWER STAFFING SERVICES (SINGAPORE) PTE LTD Full time

    About the job We are looking for a Senior / Data Engineer to build and manage the data pipelines and infrastructure supporting the cutting‐edge user growth platform. This role will ensure seamless data integration and real‐time analytics capabilities. Job Responsibilities Design, develop, and maintain scalable data pipelines to process and analyze data...


  • Singapore MANPOWER STAFFING SERVICES (SINGAPORE) PTE LTD Full time

    Roles & Responsibilities About the job We are looking for a Senior / Data Engineer to build and manage the data pipelines and infrastructure supporting the cutting-edge user growth platform. This role will ensure seamless data integration and real-time analytics capabilities. Job Responsibilities: Design, develop, and maintain scalable data pipelines to...


  • Singapore Looking for a new job? Full time $120,000 - $180,000 per year

    Senior Backend EngineerAbout the RoleWere a fast-growing, VC-backed cybersecurity startup based in Singapore, building cutting-edge technology to help some of the worlds most targeted organisations understand and reduce their real-time security exposure.Were looking for a (Senior) Backend Engineer to join our high-impact engineering team. You'll play a...


  • Singapore NodeFlair Full time

    We are working with a fast-growing cybersecurity startup company, and as part of their continued growth, NodeFlair has been engaged to search for a Senior Backend Engineer to join their Singapore team. **The package is competitive at SGD 84k-156k (excluding bonus).** We're looking for a Senior Backend Engineer to help shape and develop the core backend...