upwork

Senior Lead Research Scientist, Agentic AI

Preview — apply on company site for full detailsApply Now

At a Glance

Location: Canada
Posted: 2026-02-09T17:59:44-05:00

Key Requirements

Required Skills

CI/CDMachine LearningPyTorchPython

Certifications

SAFe

Domain Knowledge

Accounting
Engineering
Finance
Marketing
Medical

Responsibilities

50/50 Split between research and engineering/productionalization.

Advance agentic benchmarking.

Define and maintain a rigorous evaluation suite for agents (task success, reliability, recovery, safety, latency, and cost).

Establish protocols, datasets, and reproducible metrics aligned to best practices in agentic evaluation; continuously harden benchmarks against loopholes and overfitting.

Lead novel studies on agent planning, tool use, reflection/memory, safety, and multi‑agent coordination.

Publish at top venues (e.g., NeurIPS/ICML/ICLR/ACL) and present learnings internally and externally.