PREP Research Associate - Researching Scenarios for AI Evaluations and Metrics

PREP0003671

July 17, 2025

This position is part of the National Institute of Standards (NIST) Professional Research Experience (PREP) program. NIST recognizes that its research staff may wish to collaborate with researchers at academic institutions on specific projects of mutual interest, thus requires that such institutions must be the recipient of a PREP award. The PREP program requires staff from a wide range of backgrounds to work on scientific research in many areas. Employees in this position will perform technical work that underpins the scientific research of the collaboration.

Research Title:

Researching Scenarios for AI Evaluations and Metrics

The work will entail:

The National Institute of Standards and Technology (NIST) is building a library of sector-specific scenarios to facilitate AI evaluations and measurements. These scenarios are grounded in real-world AI use cases and developed with input from the broader AI community and sector stakeholders. Well-defined key elements of scenarios will ensure AI evaluations are both realistic and effective. The PREP candidate will be responsible for assisting in refining and documenting the AI scenario collection and generation process. The candidate will actively participate in NIST measurement science and be involved in human-centered research and evaluations of AI technologies.

Key responsibilities will include but are not limited to:

Developing methodology to assess various dimensions of the “goodness” of scenarios for AI evaluations, such as, but not limited to:
- Measurability of risks and benefits in AI scenarios, both at the individual and organizational levels.
- Applicability of scenarios to various evaluation types, such as evaluations to elicit negative impacts (e.g., risks), evaluations to elicit positive impacts (e.g., benefits), and evaluations of human-AI interaction.
Operationalizing higher-level key performance indicators (KPIs) and metrics of AI scenarios into meaningful measures of risks and benefits.
Refining and documenting the AI scenario collection and generation process for replicability and efficient scenario library development.
Presenting results at internal meetings and occasional meetings with external stakeholders.
Ensuring that results, protocols, and documentation have been archived or otherwise transmitted to the larger organization.

Qualifications

Background in any of the following or comparable fields: Computer Science, Human-Computer Interaction (HCI), Industrial/Organizational (I/O) Psychology, Cognitive Psychology, Human Factors/Engineering Psychology, Psychometrics, Economics.
Education level: graduate student or higher (postdoc preferred).
Strong background in research methodology.
Competency in quantitative and/or qualitative research and data analysis.
Knowledge/interest in human-computer interaction and human-AI interaction.
Knowledge/interest in machine learning and AI test and evaluation.
Ability to work both in teams and independently.
Strong oral and written communication skills.

Number of Positions and hours/week

1 position, if postdoctoral (preferred), full-time at 40 hours/week
The details of the work arrangement are negotiable, but the position is envisioned to be telework, with the possibility of occasional in-person meetings on the NIST Gaithersburg campus.

Apply Here

The university is an Equal Employment Opportunity/Affirmative Action employer that does not unlawfully discriminate in any of its programs or activities on the basis of race, color, religion, sex, national origin, age, disability, veteran status, sexual orientation, gender identity or expression, or on any other basis prohibited by applicable law.