upwork
Senior Lead Research Scientist, Agentic AI
Preview — apply on company site for full detailsApply Now
At a Glance
- Location
- Canada
- Posted
- 2026-02-09T17:59:44-05:00
Key Requirements
Required Skills
CI/CDMachine LearningPyTorchPython
Certifications
- SAFe
Domain Knowledge
- Accounting
- Engineering
- Finance
- Marketing
- Medical
Responsibilities
50/50 Split between research and engineering/productionalization.
Advance agentic benchmarking.
Define and maintain a rigorous evaluation suite for agents (task success, reliability, recovery, safety, latency, and cost).
Establish protocols, datasets, and reproducible metrics aligned to best practices in agentic evaluation; continuously harden benchmarks against loopholes and overfitting.
Lead novel studies on agent planning, tool use, reflection/memory, safety, and multi‑agent coordination.
Publish at top venues (e.g., NeurIPS/ICML/ICLR/ACL) and present learnings internally and externally.