Turing
About Us:
Based in San Francisco, California, Turing is the world's premier research accelerator for leading AI labs and a trusted partner to global enterprises deploying advanced AI systems. Turing provides extensive support to clients by accelerating breakthrough research through high-quality datasets, cutting-edge training pipelines, and expert AI researchers specializing in software engineering, logical reasoning, STEM, multilingualism, multimodality, and agents. Additionally, we leverage our expertise to help enterprises transition AI from proof of concept into proprietary intelligence, ensuring reliable performance that delivers measurable impact and contributes positively to the bottom line.
Ideal Background:
This role is perfect for engineers with experience in developing production systems at major tech companies like Google, Microsoft, Apple, Amazon, or Meta, as well as similar high-scale engineering organizations. We particularly value candidates from institutions renowned for their computer science programs, such as the University of Washington, University of Illinois Urbana-Champaign, UT Austin, University of Michigan, Purdue, and comparable universities—though exceptional skills and experience will always be prioritized over educational background.
Project Overview:
As a Software Engineering evaluator, you will create innovative datasets aimed at training, benchmarking, and advancing large language models in close collaboration with our research teams. Your tasks will include curating code examples, providing precise coding solutions, and correcting code in languages such as Python, C/C++, Rust, Go, Java, and Java
Script (including ReactJS). A key focus will be on systems-level code, performance-critical applications, and infrastructure. You will also assess and enhance AI-generated code for efficiency, scalability, and reliability, collaborating with cross-functional teams to improve enterprise-level AI-driven coding solutions.
What Does a Typical Day Look Like?
Engaging in AI model training initiatives by curating code examples, developing solutions, and correcting code across multiple programming languages.
Evaluating and enhancing AI-generated code with a focus on systems-level correctness, performance, and dependability.
Collaborating with cross-disciplinary teams to improve AI-driven coding solutions against industry performance standards.
Building agents tasked with verifying the quality of systems-level and infrastructure code, while identifying error patterns.
Hypothesizing about the stages in the software engineering lifecycle (such as prototyping, architecture design, API design, production implementation, launch, monitoring, and maintenance) and assessing model capabilities at each stage.
Designing verification mechanisms for automatic verification of solutions to software engineering tasks.
Required Skills:
Several years of software engineering experience (3 years or more).
Strong expertise in systems programming, infrastructure, or backend development using languages like Python, C/C++, Rust, and Go.
Experience in building and deploying scalable, production-grade software with contemporary programming languages and tools.
A deep understanding of software architecture, design, development, debugging, and code quality/review assessment.
Excellent verbal and written communication skills for delivering clear and structured evaluation rationales.
Engagement Details:
Commitment: Flexible engagement, minimum 10 hours per week, up to 40 hours per week.
Type: Contractor (no medical/paid leave).
Duration: 1 month (potential for extensions based on performance and fit).
Location: Candidates must be based in the United States.
Evaluation Process:
The application process takes approximately 15-30 minutes.
Completion of an AI video interview is mandatory.
As part of the assessment, you will need to complete an AI video interview.
After applying, you will receive an email with a login link. Please use that link to access the portal and complete your profile.
Know amazing talent? Refer them to Turing and earn rewards from your network.
Verified Listing
This role has been verified for authenticity, market-rate compensation, and remote eligibility.
Get the latest updates on AI-powered hiring, career growth, and technical deep-dives delivered to your inbox.