Description

LLM Research Engineer

Montreal (preferred) · Eastern Canada · Remote flexibility

A fast growing AI company is pushing the boundaries of software development by combining compiler technology with next-generation agentic AI. They are seeking a Research Engineer who can bridge research and engineering, someone excited to explore new methods, validate them quickly, and help transform ideas into production-ready systems.

This role suits someone hands-on, curious, and motivated by the challenge of moving beyond academic research to build technology that works in the real world.

What you’ll do

  • Research and prototype agentic AI / LLM methods for problem-solving, planning, and code generation.
  • Evaluate and benchmark optimization approaches to ensure strong real-world performance.
  • Track the latest developments in AI research and open-source projects, and translate them into applied solutions.
  • Write production-aware code that can be integrated into live systems.
  • Work alongside a highly technical team to ship breakthrough features.

What’s needed

  • Advanced degree (Master’s/PhD) in Computer Science, Mathematics, Physics, or equivalent hands-on experience.
  • Prior work in AI/ML, ideally applying LLMs or agents to complex tasks.
  • Strong grounding in ML fundamentals and ability to apply them in new contexts.
  • Fluency in Python and modern ML frameworks (PyTorch, TensorFlow, JAX), plus familiarity with LLM APIs, agentic frameworks, and cloud platforms.
  • Balance of research creativity with practical engineering discipline.

Nice to have

  • Experience fine-tuning LLMs or applying reinforcement learning in applied settings.
  • Knowledge of compilers, program synthesis, or code optimization tools.
  • Contributions to open-source AI initiatives.

The offer

  • Competitive compensation (approx. CAD 125K–190K, with flexibility for standout talent).
  • Health and dental benefits.
  • Professional growth opportunities in a scaling AI environment.
  • Flexible work arrangements.
  • Access to state-of-the-art compute and development resources.