
AI/ML Engineer – Assessment Systems
Northeastern University College of Engineering
part-time
Posted on:
Location Type: Office
Location: Boston • Massachusetts • United States
Visit company websiteExplore more
Salary
💰 $25 per hour
Tech Stack
About the role
- Develop AI scoring architecture combining rule-based and LLM components
- Develop prompt engineering strategies for rubric-aligned evaluation
- Implement quantitative indicator scoring (frequency counts, behavioral thresholds)
- Create LLM-based qualitative judgment systems for complex indicators
- Build data processing pipelines for transcript preparation and analysis
- Compare AI ratings against human expert ground truth (n=200+ transcripts)
- Calculate agreement statistics (correlation, Cohen's kappa, confusion matrices)
- Identify systematic rating discrepancies and refine prompts/algorithms
- Implement confidence scoring to flag uncertain AI judgments
- Conduct bias audits across demographic subgroups (gender, race/ethnicity)
- Deploy validated scoring system for 300+ student transcripts
- Create technical documentation for scoring methodology
- Develop user-facing explanations of AI ratings for educators
Requirements
- Graduate student or professional in Computer Science, Data Science, Computational Linguistics, or related field
- Strong Python programming skills (pandas, scikit-learn, numpy)
- Experience with large language model APIs (OpenAI GPT-4, Claude, or similar)
- Demonstrated ability to design and implement NLP pipelines
- Understanding of evaluation metrics (precision, recall, F1, correlation, agreement statistics)
- Experience with version control (Git) and collaborative coding practices
- Experience with prompt engineering and LLM optimization
- Background in natural language processing or computational linguistics
- Familiarity with educational assessment or psychometrics
- Knowledge of inter-annotator agreement statistics (Cohen's kappa, Krippendorff's alpha)
- Experience with retrieval-augmented generation (RAG) systems
- Understanding of bias detection and fairness metrics in AI systems
Benefits
- medical
- vision
- dental
- paid time off
- tuition assistance
- wellness & life
- retirement
- commuting & transportation
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Pythonpandasscikit-learnnumpyNLP pipelinesprompt engineeringLLM optimizationevaluation metricsinter-annotator agreement statisticsretrieval-augmented generation
Soft Skills
collaborative codingtechnical documentationuser-facing explanations