← all jobs

[Remote] Reinforcement Learning Engineer

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. They are seeking a skilled Reinforcement Learning Engineer to design, train, and deploy RL-based systems for high-impact decision-making problems. The role involves implementing modern reinforcement learning algorithms and collaborating with teams to identify valuable use cases.

Responsibilities

  • Design and implement reinforcement learning solutions for sequential decision-making problems in real and simulated environments
  • Develop, calibrate, and maintain simulation environments suitable for large-scale agent training
  • Implement and evaluate modern RL algorithms including policy gradient, actor-critic, off-policy, and offline RL methods
  • Engineer reward functions and shaping strategies that align agent behavior with desired outcomes and safety constraints
  • Apply offline RL and imitation learning techniques where exploration is costly or unsafe
  • Use RLHF, DPO, and related techniques for fine-tuning large language models when relevant
  • Build scalable training infrastructure for distributed RL, including efficient experience collection and replay systems
  • Optimize training stability and sample efficiency through algorithmic and engineering improvements
  • Design rigorous evaluation protocols, including out-of-distribution and adversarial test cases
  • Implement safety mechanisms such as constraint enforcement, conservative policies, and human-in-the-loop oversight
  • Collaborate with applied scientists and product teams to identify high-value RL use cases
  • Monitor deployed policies and models in production for drift, regression, and unintended behaviors, building the alerting and dashboards that surface issues before they meaningfully affect users
  • Document methodology, design decisions, and operational characteristics for internal stakeholders
  • Stay current with RL research and translate promising techniques into production-ready solutions

Skills

  • Master's or PhD in Computer Science, Machine Learning, or a related field; or equivalent applied experience
  • Six or more years of combined RL research and engineering experience
  • Strong proficiency in Python and modern deep learning frameworks
  • Hands-on experience with at least one major RL library or in-house RL stack
  • Solid understanding of probability, optimization, and the theoretical foundations of RL
  • Experience designing and tuning reward functions in non-trivial environments
  • Familiarity with simulation environments and large-scale experience collection
  • Experience training neural network policies on GPU clusters
  • Strong written and verbal communication skills
  • Track record of shipping or publishing impactful RL work
  • Experience with RLHF for large language models
  • Familiarity with multi-agent RL or hierarchical RL
  • Exposure to robotics, control systems, or autonomous driving
  • Publications in RL or related research venues
  • Open-source contributions to RL libraries or environments

Benefits

  • Competitive base salary commensurate with experience, plus benefits.
  • Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party).
  • We will support H1B transfers for qualified candidates.

Company Overview

  • Bright Vision Technologies is an information technology company that offers software development, AI, and cybersecurity services. It was founded in 2020, and is headquartered in Bridgewater, New Jersey, USA, with a workforce of 51-200 employees. Its website is https://bvteck.com.
  • More open positions

    [Remote] Account Executive

    Work from home Full-time role

    [Remote] AI Data Infrastructure Engineer

    Work from home Full-time role

    [Remote] Agentic AI Engineer - Burlington/Boston, MA OR Princeton, NJ

    Work from home Full-time role

    [Remote] Senior Data Analyst

    Work from home Full-time role

    [Remote] Edge AI Engineer

    Work from home Full-time role

    Research Advisor - Human Frontier Collective (US)

    Work from home Full-time role

    Senior Application Developer (CA Plex) - Remote

    Work from home Full-time role

    Experienced Mandarin-English Bilingual Healthcare Customer Service Representative – Remote Opportunity in California

    Work from home Full-time role

    [Work From Home] Remote Receptionist- Santa Fe, NM

    Work from home Full-time role

    Remote Customer Service Representative – Premium Financial Services Support at careerzynith

    Work from home Full-time role

    Payments Architecture & Engineering Specialist

    Work from home Full-time role

    [Hiring] Manager, Solution Delivery @Quest Diagnostics

    Work from home Full-time role

    Cloud & DevOps Engineer

    Work from home Full-time role

    Fractional CMO Needed to Drive Lead Gen for $1.5M Mastermind (Direct Response + Facebook Ads)

    Work from home Full-time role

    Part-Time (34 hours/week) Data Entry Claims Intake Processor

    Work from home Full-time role

    [Remote] Software Engineer, Stablecoin

    Work from home Full-time role

    Remote Live Chat Customer Support Specialist – careerzynith – Full‑Time Work‑From‑Home Role with Competitive Salary & Benefits

    Work from home Full-time role

    Recruiter Specialist - Health Care Provider

    Work from home Full-time role

    GSA Schedule & Federal Contracts Manager

    Work from home Full-time role

    Patient Care Coordinator - Dermatology

    Work from home Full-time role

    [Remote] Mechanical Engineering Analyst Intern - Summer 2026

    Work from home Full-time role