Job Description
**Role Number:** 200622800-0836
**Summary**
As part of the work on machine-generated dialog we are developing novel measurements of its quality. These include cutting-edge llm-judges for aspects like groundedness (lack of hallucinations), Siri Tone and Style (a suite of Design requirements), Safety, and others.
**Description**
To measure our progress on this front, we need to track the state of our dataset composition, accuracy of llm-judges, human expert review results in a central and visual representation. A DRI for Metrics and Reporting will:
**Minimum Qualifications**
+ M.S. or Ph.D. in Computer Science, Data Science, Data Engineering
+ 3+ years in data-science and/or data-engineering (iceberg, pandas python, Tableau or equivalent, data collection and visualization)
+ 2+ years of python coding
+ Good understanding of metrics, crowd science, annotation analysis, statistics
+ Ability to work independently and cross-functionally ...
**Summary**
As part of the work on machine-generated dialog we are developing novel measurements of its quality. These include cutting-edge llm-judges for aspects like groundedness (lack of hallucinations), Siri Tone and Style (a suite of Design requirements), Safety, and others.
**Description**
To measure our progress on this front, we need to track the state of our dataset composition, accuracy of llm-judges, human expert review results in a central and visual representation. A DRI for Metrics and Reporting will:
**Minimum Qualifications**
+ M.S. or Ph.D. in Computer Science, Data Science, Data Engineering
+ 3+ years in data-science and/or data-engineering (iceberg, pandas python, Tableau or equivalent, data collection and visualization)
+ 2+ years of python coding
+ Good understanding of metrics, crowd science, annotation analysis, statistics
+ Ability to work independently and cross-functionally ...
Ready to Apply?
Take the next step in your AI career. Submit your application to Apple today.
Submit Application