We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results

Senior Applied Scientist, AI Data Platform (CoreAI)

Microsoft
United States, Washington, Redmond
Sep 21, 2025
OverviewJoin Microsoft's CoreAI team to build the AI Data Platform, the foundation for secure, scalable, reusable datasets that power model development. The AI Data Platform team's mission is to build a central AI data platform that breaks down Microsoft's data silos and manages the full lifecycle of first-party, third-party, synthetic, and human-labeled data, accelerating AI model development with secure, reusable, and compliant datasets. The AI Data Platform team is responsible for large-scale data infrastructure, automation tools, and intelligence services to transform how Microsoft collects, generates, manages, and shares AI training data. We are seeking Applied Scientists to drive scientific innovation in data generation, validation, evaluation, and automation. You will set the vision for intelligent, ML-driven services that manage the end-to-end data lifecycle, and partner with leaders across Microsoft to ensure Microsoft's data investments deliver maximum AI impact.
ResponsibilitiesResponsibilities Advancing machine learning and data science to improve data quality, automate dataset generation, and design intelligent agent-driven services that manage the end-to-end data lifecycle. Develop ML-based pipelinesfor data generation, validation, augmentation, and discovery (e.g., synthetic data, human-in-the-loop workflows). Design and train intelligent agentsto automate key parts of the dataset lifecycle, including ingestion, validation, PII detection and handling, governance, discovery, and feedback loops. Build evaluation methodsto measure dataset quality, coverage, and usefulness for large-scale model training. Leverage AI/ML techniques(e.g., classification, clustering, anomaly detection, embeddings, LLM-based evaluation) to improve data discovery, curation, and governance. Collaborate with engineersto integrate scientific methods and models into scalable pipelines and platform services. Partner with AI product and research teams(CoreAI, MAI, M365, GitHub, MSR, and more) to align datasets with model training needs and identify new opportunities. Contribute thought leadershipby publishing or sharing insights internally and externally to shape Microsoft's data-centric AI practices.
Applied = 0

(web-759df7d4f5-28ndr)