Lead Machine Learning Engineer

23 hours ago


Singapore Thoughtworks Full time $150,000 - $250,000 per year

Machine Learning Engineers specializing in Inference Optimization focus on maximizing the efficiency, speed, and cost-effectiveness of deploying AI models across diverse environments. They apply advanced optimization techniques to improve runtime inference and application performance. Their work ensures that clients can scale AI solutions sustainably, whether in the cloud, on-premises, or at the edge.

As a Lead Machine Learning Engineer at Thoughtworks, you'll combine deep technical capability with team leadership and architectural thinking. You'll guide teams through complex optimization challenges, design scalable inference systems, and ensure AI solutions are not only high-performing but operationally sustainable. You'll act as a bridge between hands-on engineering and strategic technical direction, mentoring others while shaping the standards and practices that define excellence in inference engineering.

(Tips: Thoughtworks Singapore will be shortlisting applicants who have a current right to work in Singapore i.e. Singapore Citizens and Singapore Permanent Residents only.)

Job responsibilities
  • Lead the design and implementation of advanced model optimization pipelines, including quantization, pruning, and distillation.Architect and tune inference runtimes and serving frameworks to achieve optimal performance across deployments.
  • Guide teams in implementing high-throughput serving strategies (continuous batching, KV caching, speculative decoding, asynchronous scheduling).
  • Develop benchmarks and performance dashboards to measure and communicate system-level efficiency improvements (throughput, latency, GPU utilization, cost).
  • Evaluate trade-offs across accuracy, performance, and cost, and design architectures to meet target SLAs across varied hardware environments (cloud, on-prem, edge).
  • Collaborate with infrastructure, MLOps, and product teams to embed inference optimization into production workflows and platform designs.
  • Provide technical leadership and mentorship to engineers, fostering a culture of experimentation, rigor, and continuous performance improvement.
  • Contribute to the development of internal frameworks, reference architectures, and playbooks for scalable and cost-efficient inference.
  • Engage with clients to translate optimization outcomes into business value and articulate the ROI of technical improvements.
Job qualifications Technical Skills
  • Deep practical expertise in model and runtime optimization techniques (quantization, pruning, distillation, batching, caching).
  • Proven experience optimizing inference workloads using frameworks such as vLLM, NVIDIA Triton/Dynamo.
  • Strong proficiency in deep learning frameworks (e.g. PyTorch, TensorFlow) with production deployment experience.
  • Ability to diagnose and optimize performance using profiling tools (e.g. Nsight, PyTorch/TensorFlow profilers).
  • Solid understanding of GPU and accelerator architectures, and experience tuning workloads for cost and performance efficiency.
  • Experience designing and benchmarking scalable inference systems across heterogeneous environments (GPU clusters, serverless, edge).
  • Familiarity with observability stacks, telemetry, and cost instrumentation for AI workloads.
Professional Skills
  • Demonstrated ability to lead small-to-medium engineering teams or technical workstreams.
  • Skilled at balancing hands-on delivery with architectural oversight and mentorship.
  • Strong communication and stakeholder engagement skills and are able to connect low-level optimizations with business impact.
  • Comfortable in ambiguous and fast-evolving technology landscapes, with a passion for applied innovation.
  • Commitment to continuous learning and knowledge sharing across teams and communities.
Other things to know Learning & Development

There is no one-size-fits-all career path at Thoughtworks: however you want to develop your career is entirely up to you. But we also balance autonomy with the strength of our cultivation culture. This means your career is supported by interactive tools, numerous development programs and teammates who want to help you grow. We see value in helping each other be our best and that extends to empowering our employees in their career journeys.

About Thoughtworks

Thoughtworks is a dynamic and inclusive community of bright and supportive colleagues who are revolutionizing tech. As a leading technology consultancy, we're pushing boundaries through our purposeful and impactful work. For 30 years, we've delivered extraordinary impact together with our clients by helping them solve complex business problems with technology as the differentiator. Bring your brilliant expertise and commitment for continuous learning to Thoughtworks. Together, let's be extraordinary.

#LI-Onsite

See here our AI policy.



  • Singapore NEWBRIDGE ALLIANCE PTE. LTD. Full time

    Our Client is a leading Adtech company specializing in digital products. With a strong focus on innovation and cutting-edge technology, we are committed to revolutionizing the advertising industry and providing unparalleled solutions to our clients. **Role Overview: As a Machine Learning Engineer with our client, you will play a pivotal role in developing...


  • Singapore minden.ai Full time

    **Who we are**: minden.ai is a technology venture founded by Temasek in strategic partnership with DFI Retail Group and coalition partners Breadtalk Group, DBS Bank, PAssion Card, Mandai Wildlife Group, Singtel, Great Eastern, FoodPanda and GoJek. We are on a mission to redefine how brands engage with their customers through the power of machine learning and...


  • Singapore TikTok Full time $120,000 - $240,000 per year

    Machine learningLead Machine Learning Engineer - User Growth - Recommendation - SingaporeLocation:SingaporeEmployment Type:RegularJob Code:A248801AResponsibilitiesAbout the TeamThe TikTok User Growth (UG) algorithm team is the core engine propelling the rapid growth of TikTok. Facing real-world challenges in manifold scenarios, the team skillfully leverages...


  • Singapore Thoughtworks Full time

    Lead Machine Learning Engineers at Thoughtworks use modern architectures to develop end-to-end scalable machine learning systems and applications. They use their specialized depth and breadth of knowledge to impact the achievement of client, project or service objectives and advocate for ways of working to promote and deliver excellence. They operate within...


  • Singapore Thoughtworks Full time

    Lead Machine Learning Engineers at Thoughtworks use modern architectures to develop end-to-end scalable machine learning systems and applications. They use their specialized depth and breadth of knowledge to impact the achievement of client, project or service objectives and advocate for ways of working to promote and deliver excellence. They operate within...


  • Singapore TikTok Pte. Ltd. Full time $100,000 - $250,000 per year

    Responsibilities TikTok Core Feed Recommendation team sits in the center of TikTok, designs, implements and improves the core recommendation algorithm that powers the "for you" feed, "following" feed, etc. of the TikTok app. The recommendation system we built connects hundreds of millions of users with relevant content out of billions of videos in...


  • Singapore TikTok Pte. Ltd. Full time $104,000 - $130,878 per year

    ResponsibilitiesTikTok-Data Video Recommendation Team is responsible for the personalized recommendation algorithms for TikTok's hundreds of millions of global users. Here, you will collaborate with top algorithm engineers in the industry, leveraging your expertise in deep learning, recommendation algorithms, and large models to continuously transform and...


  • Singapore Thoughtworks Inc. Full time

    # Lead Machine Learning EngineerSingapore, Singapore## Job responsibilities* Lead the design and implementation of advanced model optimization pipelines, including quantization, pruning, and distillation.Architect and tune inference runtimes and serving frameworks to achieve optimal performance across deployments.* Guide teams in implementing high-throughput...


  • Singapore Standard Chartered Full time

    Job ID: 26008 Location: Singapore, SG Area of interest: Technology Job type: Regular Employee Work style: Office Working Opening date: 16 Apr 2025 **Key Responsibilities** **Strategy** - Delivering, building and maintaining the Data Driven solutions that are core to Group Operations strategy **Business** - Continually look at the environment to...


  • Singapore Snaphunt Pte Ltd Full time

    Company Snaphunt Pte Ltd Designation Machine Learning Engineer Date Listed 17 Feb 2025 Job Type Entry Level / Junior Executive - Full/Perm Job Period Immediate Start, Permanent Profession IT / Information Technology Industry Computer and IT Location Name Singapore Allowance / Remuneration $5,000 - 6,000 monthly Company Profile Our client...