Senior Machine Learning Engineer

7 hours ago


Singapore Thoughtworks Full time

Join to apply for the Senior Machine Learning Engineer role at Thoughtworks Machine Learning Engineers specializing in Inference Optimization focus on maximizing the efficiency, speed, and cost-effectiveness of deploying AI models across diverse environments. They apply advanced techniques at every stage of the model lifecycle from training through runtime inference to application logic and observability. Their work ensures that clients can scale AI solutions sustainably, whether in the cloud, on-premises, or at the edge. By combining deep expertise in model compression, runtime acceleration, and serving frameworks with an understanding of real-world business needs, they directly influence system performance and operational cost. They design, implement, and benchmark cutting‐edge optimization strategies to deliver measurable gains in throughput, latency, and GPU utilization. As a Senior Machine Learning Engineer at Thoughtworks, you'll bring both engineering rigor and creative problem‐solving to one of AI's fastest‐evolving domains. Job Responsibilities Implement and tune advanced model optimization techniques such as post‐training quantization, pruning, and knowledge distillation. Configure and optimize inference runtimes and serving frameworks (e.g., NVIDIA Triton, vLLM, TensorRT‐LLM, DeepSpeed, SGLang). Enable high‐throughput serving using continuous batching, KV caching, speculative decoding, and asynchronous scheduling. Apply kernel fusion strategies to reduce latency and memory overhead. Evaluate trade‐offs across accuracy, throughput, latency, and GPU/accelerator utilization for different hardware footprints (cloud, on‐prem, serverless, edge). Develop and maintain performance benchmarks using profiling tools (e.g., PyTorch/TensorFlow profilers, Nsight) to identify bottlenecks. Collaborate with AI delivery teams to embed inference best practices into application logic (e.g., prompt optimization, caching, model routing). Contribute to internal knowledge sharing, technical playbooks, and enablement material to uplift inference engineering capabilities across teams. Job Qualifications Technical Skills Strong foundation in machine learning with expertise in inference optimization techniques (quantization, pruning, distillation, batching, KV caching, etc.). Hands‐on experience with modern inference runtimes and compilers (eg. TensorRT, ONNX Runtime, vLLM, Triton, DeepSpeed). Familiarity in deep learning frameworks with production‐ready engineering practices. Understanding of benchmarking and profiling workloads to guide optimization decisions. Familiarity with GPU/accelerator architectures and cloud inference APIs. Understanding of trade‐offs between model accuracy, performance, and cost, and ability to tune accordingly. Comfort working across multiple model types (eg. LLM, VLM, SLM) and deployment environments (cloud, on‐prem, edge). Professional Skills Ability to translate technical optimizations into tangible business outcomes (e.g., lower cost per token). Comfortable in fast‐moving, ambiguous environments and motivated to explore new research directions. Good communication skills to explain performance trade‐offs and recommendations to both technical and non‐technical stakeholders. A mindset of continuous learning and sharing, eager to mentor peers and contribute to a culture of technical excellence. Other things to know Learning & Development There is no one‐size‐fits‐all career path at Thoughtworks: however you want to develop your career is entirely up to you. But we also balance autonomy with the strength of our cultivation culture. This means your career is supported by interactive tools, numerous development programs and teammates who want to help you grow. We see value in helping each other be our best and that extends to empowering our employees in their career journeys. About Thoughtworks Thoughtworks is a dynamic and inclusive community of bright and supportive colleagues who are revolutionizing tech. As a leading technology consultancy, we're pushing boundaries through our purposeful and impactful work. For 30+ years, we've delivered extraordinary impact together with our clients by helping them solve complex business problems with technology as the differentiator. Bring your brilliant expertise and commitment for continuous learning to Thoughtworks. Together, let's be extraordinary. See here our AI policy. Referrals increase your chances of interviewing at Thoughtworks by 2x Seniority level Not Applicable Employment type Full-time Job function Engineering and Information Technology Industries Software Development and IT Services and IT Consulting #J-18808-Ljbffr



  • Singapore Portcast Full time

    Description Join to apply for the Senior Machine Learning Engineer role at Portcast 2 days ago Be among the first 25 applicants Join to apply for the Senior Machine Learning Engineer role at Portcast About Us: Portcast is a venture-backed startup which predicts global trade flows to help logistics and shipping companies become more profitable. We are a...


  • Singapore Portcast Full time

    Description Join to apply for the Senior Machine Learning Engineer role at Portcast 2 days ago Be among the first 25 applicants Join to apply for the Senior Machine Learning Engineer role at Portcast About Us: Portcast is a venture-backed startup which predicts global trade flows to help logistics and shipping companies become more profitable. We are a...


  • Singapore Portcast Full time

    Join to apply for the Senior Machine Learning Engineer role at Portcast 2 days ago Be among the first 25 applicants Join to apply for the Senior Machine Learning Engineer role at Portcast About Us: Portcast is a venture-backed startup which predicts global trade flows to help logistics and shipping companies become more profitable. We are a predictive...


  • Singapore PLUANG TECHNOLOGIES PTE. LTD. Full time

    Roles & Responsibilities As a Machine Learning Engineer (Trading & Financial Intelligence) , you will contribute to the development of AI-powered systems and autonomous agents that transform how financial analysis and decision-making are conducted. Working under the guidance of senior team members, you will help build intelligent solutions that analyze...


  • Singapore PLUANG TECHNOLOGIES PTE. LTD. Full time

    As a Machine Learning Engineer (Trading & Financial Intelligence) , you will contribute to the development of AI‑powered systems and autonomous agents that transform how financial analysis and decision‑making are conducted. Working under the guidance of senior team members, you will help build intelligent solutions that analyze markets, extract insights...


  • Singapore SGX Group Full time

    Overview SGX is looking for a Machine Learning Engineer who is passionate about building scalable data/machine learning platforms and pioneering solutions. As a Machine Learning Engineer, you will play a crucial role in transforming how we run and deploy AI/ML models. Your work will directly impact our ability to build and deliver AI/ML use cases that will...

  • Software Engineer

    2 weeks ago


    Singapore NodeFlair Full time

    **Job Summary**: **Job Type** Permanent **Seniority** Junior **Years of Experience** Information not provided **Tech Stacks** TensorFlow AWS LoRa SpaCy PyTorch Python **Responsibilities** - In this role, you will: - Collaborate with senior engineers to develop and implement machine learning models and algorithms for our AI answer engine. - Assist in...


  • Singapore Pluang Technologies Full time $80,000 - $120,000 per year

    As a Machine Learning Engineer (Trading & Financial Intelligence) , you will contribute to the development of AI-powered systems and autonomous agents that transform how financial analysis and decision-making are conducted. Working under the guidance of senior team members, you will help build intelligent solutions that analyze markets, extract insights from...


  • Singapore SHIELD Full time $120,000 - $200,000 per year

    SHIELD is a device-first fraud intelligence platform that helps digital businesses worldwide eliminate fake accounts and stop all fraudulent activity. Powered by SHIELD AI, we identify the root of fraud with the global standard for device identification (SHIELD Device ID) and actionable fraud intelligence, empowering businesses to stay ahead of new and...


  • Singapore ST Engineering Full time

    Direct message the job poster from ST Engineering Responsibilities Design, build and deploy Generative AI solutions on AWS Provide L2/L3 level support for Generative AI solutions in production Develop predictive models for failure forecasting, quality, and inventory planning Collaborate across engineering, supply chain, and MRO teams to embed data-driven...