upwork

Senior Lead Research Scientist, Agentic AI

Apply Now

At a Glance

Location
Canada
Posted
2026-02-09T17:59:44-05:00

Key Requirements

Required Skills

CI/CDMachine LearningPyTorchPython

Certifications

  • SAFe

Domain Knowledge

  • Accounting
  • Engineering
  • Finance
  • Marketing
  • Medical

Responsibilities

50/50 Split between research and engineering/productionalization.

Advance agentic benchmarking.

Define and maintain a rigorous evaluation suite for agents (task success, reliability, recovery, safety, latency, and cost).

Establish protocols, datasets, and reproducible metrics aligned to best practices in agentic evaluation; continuously harden benchmarks against loopholes and overfitting.

Lead novel studies on agent planning, tool use, reflection/memory, safety, and multi‑agent coordination.

Publish at top venues (e.g., NeurIPS/ICML/ICLR/ACL) and present learnings internally and externally.