Get the latest updates on AI-powered hiring, career growth, and technical deep-dives delivered to your inbox.
Dyson
At Dyson, we’re driven by a relentless pursuit of innovation—pushing boundaries in engineering, AI, and robotics. Our new Data Intelligence team sits at the heart of this mission: shaping Dyson’s future through data. Here, we blend creativity, precision, and audacity to power intelligent products. We craft data strategies and pipelines that fuel the next generation of connected devices.
You’ll work alongside brilliant minds from Dyson global engineering team and external software/hardware partners in an environment built for exploration, discovery, delivery and impact.
We are looking for a specialized Lead Data Intelligence Machine Learning Engineer to design and implement in-house tools that automate our data labelling pipelines. Your primary goal will be to reduce our reliance on manual annotation by leveraging techniques like Active Learning, Weak Supervision, and Synthetic Data Generation.
You will bridge the gap between raw data collection and model-ready datasets, ensuring high-quality labels at scale.
At least 8+ years of professional experience in Machine Learning engineering, specifically focused on data centric-AI or computer vision/NLP pipelines.
Proficiency in Python: Mastery of the Machine Learning stack (PyTorch or TensorFlow, NumPy, Pandas, Scikit-learn).
Automated Labelling Expertise: Proven experience with Weak Supervision (labelling functions) or Active Learning strategies (uncertainty sampling, diversity sampling).
Data Engineering: Experience with SQL and NoSQL databases, and managing large-scale unstructured data (images, text, or audio).
Cloud Infrastructure: Familiarity with AWS (SageMaker Ground Truth), GCP (Vertex AI), or Azure ML labelling services.
Version Control for Data: Experience with DVC (Data Version Control) or similar tools to track dataset iterations.
Hands-on expertise building auto-labelling solutions or working with large-scale data annotation workflows.
Advanced skills in Python (and/or other relevant languages), and experience with key ML/data science libraries (e.g. TensorFlow, PyTorch, scikit-learn, pandas).
Experience designing, deploying, and maintaining scalable data pipelines, including data cleansing, transformation, and storage (cloud, on-prem, or hybrid).
Strong background in feature engineering, data analysis, and data visualization—comfortable using tools like Jupyter, Tableau, or Power BI.Great communicator who documents solutions clearly and collaborates effortlessly across technical and non-technical teams.
Able to balance speed and quality, stay curious about new developments, and deliver results in a fast-moving environment.
Bachelor’s or Master's degree in computer science, Engineering, Mathematics, Data Science, or a related field.
Dyson is an equal opportunity employer. We know that great minds don’t think alike, and it takes all kinds of minds to make our technology so unique.
We welcome applications from all backgrounds and employment decisions are made without regard to race, colour, religion, national or ethnic origin, sex, sexual orientation, gender identity or expression, age, disability, protected veteran status or other any other dimension of diversity.
Verified Listing
This role has been verified for authenticity, market-rate compensation, and remote eligibility.