Description
This role leads multifunctional teams developing groundbreaking approaches to AI assessment, including automated evaluation systems. You'll work with ML researchers, engineers, and domain experts to pioneer new methods for scalable, high-quality AI evaluation. We are looking for an outstanding, hands-on manager who will thrive in a fast-paced environment. We believe the most exciting problems in machine learning research arise at the intersection with real-world use cases, and this is also where the most critical breakthroughs come from. KEY RESPONSIBILITIES: Lead R&D in automated AI evaluation, including development of LLM-based assessment systems that can reliably evaluate model outputs Drive research and implementation of novel approaches to measure and improve AI system quality, safety, and alignment Build and scale evaluation infrastructure that combines human expertise with ML-powered automation Work with cross-functional partners to integrate evaluation systems into production workflows
Minimum Qualifications
Key Qualifications
Preferred Qualifications
Education & Experience
Additional Requirements
Pay & Benefits
...Sundays* 4 day work week* 5 day work week during training* Weekly guarantee of $650 for 8 weeks* Entry Level tools provided * No experience necessary, all Toyota training will be done in house.This person will be responsible for inspecting, diagnosing and repairing...
Class B Entry Level No Experience Needed CDL B Local Driver JobStakebed driving between various construction locations in the Greater Denver Metro... ...03-867-2567 Tell em' Gary's Job Board sent you.This truck driving job may have an alternate application method. Look...
Overview: Ensure that all guests are served to the Golden standard in the Restaurant. Display highest standards of hospitality and welcome are always demonstrated within all food and beverage areas. Responsibilities: The exhibit is conducted in accordance with all...
**Job Description Summary**As Staff Specialist Regulatory Affairs, you will have a deep understanding of Risk Based frameworks, Agile SDLC... ...support recognition of associates' progress, ranging from entry level to experts in their field, and talent mobility. There are...
...potential of humans to do great things. We believe that great food can fuel any lifestyle--whether you choose to participate in a Netflix marathon or an actual marathon. THE ROLE We are in need of a part-time delivery driver to take food from Roots and deliver to...