Big Data AI Engineer

3 days ago


Singapur, Singapore Pixalate, Inc Full time

AI Engineer - Big Data Employment Type: Full-TimeLocation : Remote, SingaporeLevel: Entry to Mid Level (PhD Required) Bridge Cutting-Edge AI Research with Petabyte-Scale Data Systems About the Role Work at the intersection of big data and AI, where you'll develop intelligent, self-healing data systems processing trillions of data points daily. You'll have autonomy to pursue research in distributed ML systems and AI-enhanced data optimization, with your innovations deployed at unprecedented scale within months, not years. This isn't traditional data engineering - you'll implement agentic AI for autonomous pipeline management, leverage LLMs for data quality assurance, and create ML-optimized architectures that redefine what's possible at petabyte scale. Key Research Areas & Responsibilities AI-Enhanced Data Infrastructure Design intelligent pipelines with autonomous optimization and self-healing capabilities using agentic AI Implement ML-driven anomaly detection for terabyte-scale datasets Distributed Machine Learning at Scale Build distributed ML pipelines Develop real-time feature stores for billions of transactions Optimize feature engineering with AutoML and neural architecture search Required Qualifications Education & Research PhD in Computer Science, Data Science, or Distributed Systems (exceptional Master's with research experience considered) Published research or expertise in distributed computing, ML infrastructure, or stream processing Technical Expertise Core Languages: Expert SQL (window functions, CTEs), Python (Pandas, Polars, PyArrow), Scala/Java Big Data Stack: Spark 3.5+, Flink, Kafka, Ray, Dask Storage & Orchestration: Delta Lake, Iceberg, Airflow, Dagster, Temporal Cloud Platforms: GCP (BigQuery, Dataflow, Vertex AI), AWS (EMR, SageMaker), Azure (Databricks) ML Systems: MLflow, Kubeflow, Feature Stores, Vector Databases, scikit-learn + search CV, H2O AutoML, auto-sklearn, GCP Vertex AI AutoML Tables Neural Architecture Search: KerasTuner, AutoKeras, Ray Tune, Optuna, PyTorch Lightning + Hydra Research Skills Track record with 100TB+ datasets Experience with lakehouse architectures, streaming ML, and graph processing at scale Understanding of distributed systems theory and ML algorithm implementation Preferred Qualifications Experience applying LLMs to data engineering challenges Ability to translate complex AutoML/NAS research into practical production workflows Hands‑on project examples of feature engineering automation or NAS experiments Proven success in automating ML pipelines, from raw data to an optimized model architecture Contributions to Apache projects (Spark, Flink, Kafka) Knowledge of privacy-preserving techniques and data mesh architectures What Makes This Role Unique You’ll work with one of the few truly petabyte-scale production datasets outside of major tech companies, with the freedom to experiment with cutting‑edge approaches. Unlike traditional big data roles, you’ll apply the latest AI research to fundamental data challenges - from using LLMs to understand data quality issues to implementing agentic systems that autonomously optimize and heal data pipelines. About us About us Pixalate is an online trust and safety platform that protects businesses, consumers and children from deceptive, fraudulent and non‑compliant mobile, CTV apps and websites.We're seeking a PhD‑level AI Engineer to lead cutting‑edge research in agentic AI systems, multimodal analysis, and advanced reasoning architectures that will directly impact millions of users worldwide. Our software and data have been used to unearth multiple high profile criminal and illegal surveillance cases including: UNICEF: Pixalate was recently recognized by UNICEF as a Key Innovator for protecting children’s online privacy. Gizmodo: An iCloud Feature Is Enabling a $65 Million Scam, New Research Says Adweek: A 7-Figure Ad Fraud Scheme Running on Roku Underlines Murkiness of CTV Washington Post: Your kids’ apps are spying on them Pro Publica: Porn, Piracy, Fraud: What Lurks Inside Google’s Black Box Ad Empire ABC7 News: The State of Children's Privacy Online NBC News: How many apps are tracking your children Our team of lawyers, data scientists, engineers, economists and researchers span globally with presence in California, New York, Washington DC, London and Singapore. Pixalate is an equal opportunity employer committed to building a diverse team. Benefits Benefits At Pixalate, we offer an extremely competitive salary, outstanding benefits, and a dynamic work environment. You will have the opportunity to work on pioneering technologies alongside some of the brightest minds in the industry. If you're passionate about maintaining high software quality and thrive in a fast‑paced, challenging environment, you'll fit right in. Monthly internet reimbursement Casual, remote work environment Hybrid, flexible hours Opportunity for advancement Fun annual team events Being part of a high performing team that wants to win and have fun doing it We particularly encourage applications from underrepresented groups in AI research. #LI-MW1 #J-18808-Ljbffr



  • Singapur, Singapore Pixalate Full time

    5 days ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Level Entry to Mid Level (PhD Required) Bridge Cutting-Edge AI Research with Petabyte-Scale Data Systems Pixalate is an online trust and safety platform that protects businesses, consumers and children from deceptive, fraudulent and non-compliant...


  • Singapur, Singapore Patsnap Full time

    About PatSnap PatSnap empowers IP and R&D teams by providing better answers, so they can make faster decisions with more confidence. Founded in 2007, Patsnap is the global leader in AI-powered IP and R&D intelligence. Our domain-specific LLM, trained on our extensive proprietary innovation data, coupled with Hiro, our AI assistant, delivers actionable...

  • Big Data Engineer

    4 weeks ago


    Singapur, Singapore ALMR CONSULTING PTE. LTD. Full time

    Job Summary We are looking for an experienced Big Data Engineer with at least 5 years of experience in managing data pipelines and processing within Big Data environments (e.g. Cloudera Data Platform). The role involves designing, developing, and maintaining data ingestion and transformation jobs to support analytics and reporting needs. Key Responsibilities...

  • Data Engineer

    5 days ago


    Singapur, Singapore Tap Growth ai Full time

    Join to apply for the Data Engineer role at Tap Growth ai We're Hiring: Data Engineer! We are seeking a skilled and detail-oriented Data Engineer to join our growing team. The ideal candidate will be responsible for designing, building, and maintaining scalable data pipelines and infrastructure to support analytics, reporting, and data-driven decision-making...

  • Big Data Engineer

    1 week ago


    Singapur, Singapore Tata Consultancy Services Limited Full time

    Requirements Proven experience as a Big Data Engineer or similar role. Strong proficiency in Apache Spark and Scala programming. Experience with big data technologies such as Hadoop, Hive, Kafka, etc. Familiarity with data warehousing concepts and tools. Knowledge of SQL and NoSQL databases. Excellent problem-solving skills and attention to detail. Strong...


  • Singapur, Singapore Binance Full time

    Senior Data Warehouse Engineer, AI Data Services Crypto Jobs Job Description Singapore (Hybrid work environment) Full-time About the Role: As a Senior Data Warehouse Engineer , you'll be at the core of Binance’s AI Data Services team, tasked with designing scalable, high-performance data warehouse solutions . This includes: Building ETL pipelines , data...

  • AI Data Engineer

    1 day ago


    Singapur, Singapore InnoCellence Full time

    We are looking for a skilled and experienced Data Science Engineer to join our team. The ideal candidate will be responsible for designing, building, and maintaining robust data pipelines to support the processing and analysis of clinical study and digital device sensor data. As a Data Science Engineer, you will work closely with data scientists and software...

  • Big Data Engineer

    1 week ago


    Singapur, Singapore OCBC Full time

    Who We Are As Singapore’s longest established bank, we have been dedicated to enabling individuals and businesses to achieve their aspirations since 1932. How? By taking the time to truly understand people. From there, we provide support, services, solutions, and career paths that meet their individual needs and desires. Today, we’re on a journey of...

  • AI Data Engineer

    4 weeks ago


    Singapur, Singapore InnoCellence Full time

    Overview We are looking for a skilled and experienced Data Science Engineer to join our team. The ideal candidate will be responsible for designing, building, and maintaining robust data pipelines to support the processing and analysis of clinical study and digital device sensor data. You will work closely with data scientists and software engineers to...

  • Data Engineer

    2 weeks ago


    Singapur, Singapore QuantumBlack, AI by McKinsey Full time

    Data Engineer - QuantumBlack, AI by McKinsey Who You\'ll Work With Driving lasting impact and building long-term capabilities with our clients is not easy work. You are the kind of person who thrives in a high performance/high reward culture - doing hard things, picking yourself up when you stumble, and having the resilience to try another way forward. Your...