Remote AI Quality Analyst (Arabic)📣 Job Ad
| Contract Type | Seasonal | |
| Workplace type | Remote | |
| Location | Saudi Arabia |
About the Role
Turing, a leading research accelerator for advanced AI labs and a trusted partner for global enterprises, announces its need for a **Remote AI Quality Analyst** proficient in the Arabic language. This contract role, requiring a commitment of at least 30 hours per week, focuses on evaluating a new personalization feature for Gemini, aiming to ensure the AI model's effectiveness in utilizing user data from past conversations, Gmail, Google Search, and YouTube activity to provide more relevant and helpful responses. The role demands a unique blend of creativity in prompt engineering and analytical rigor in evaluating AI outputs.
Key Tasks and Responsibilities
- Design and execute multi-turn conversational prompts (typically 1-5 turns) that require the AI to use your personal information and experiences.
- Evaluate the model's responses based on your intent from the initial prompt, verifying if personalization has been applied appropriately.
- Analyze responses for "Grounding" issues, ensuring claims about you are supported by evidence and not false inferences or hallucinations.
- Assess the quality of "Integration" to ensure personal data is seamlessly incorporated into the response without robotic "over-narration."
- Accurately rate and compare two model responses side-by-side (SxS) to determine which is more helpful, usable, and enjoyable.
- Write clear and defensible justifications for your comparisons, explicitly referencing where issues or positive aspects occurred in the conversation.
- Extract and verify "Debug Info" from the model to confirm correct usage of chat summaries and data sources.
- Maintain strict data hygiene by deleting evaluation conversations to prevent contamination of your future chat history.
Qualifications and Requirements
- High proficiency in reading and writing Arabic, as Arabic is the focus language for this project.
- Willingness to use your primary personal Google account (not a test account) and enable personal data sources for genuine evaluation.
- Full-time availability within your local time zone is required, as we are building a 24-hour global operations team.
- Demonstrate exceptional analytical thinking and the ability to evaluate nuanced and rich AI responses, with a particular focus on personalization quality.
- Experience in designing creative multi-turn starter prompts based on personal context to thoroughly test model capabilities.
- Strong evaluation skills, including the ability to identify incorrect personalization, weak inferences, and forced connections.
- Meticulous attention to detail, with the ability to review model responses side-by-side (SxS) and detect subtle differences in naturalness and over-narration.
- Superior ability to write clear, concise, and structured justifications for model rankings, explicitly referencing specific turn numbers.
- Ability to provide constructive feedback and detailed commentary.
- Excellent communication and collaboration skills.
- Self-motivated and able to work independently in a remote environment.
- Possess a desktop/laptop computer with a good internet connection.
- Bachelor's degree or equivalent experience in a relevant field (*, Politics, Law, Ethics, Linguistics, Journalism, Computer Science, or a related analytical field).
- Experience in data annotation, AI quality evaluation, content moderation, or a related role is highly preferred.
Core Skills
- Arabic Language Proficiency
- Personal Account Usage
- Schedule Flexibility
- Analytical Thinking
- Creative Prompt Engineering
- Evaluation Capability
- Attention to Detail
- Written Communication
- Feedback Provision
- Communication and Collaboration
- Self-Reliance and Motivation
- Technical Setup (Desktop/Laptop with good internet)
Work Details
This is a **full-time contract** role, requiring a commitment of no less than 30 hours per week, with the option to commit to either 30 or 40 hours per week. The work must include 4 hours of overlap with Pacific Standard Time (PST). The engagement duration is 3 months. Work is remote, with a focus on the Saudi Arabia region.
Requirements
- No experience required
Similar Jobs
You may also like
- Related Remote AI Quality Analyst (Arabic) Opportunities
- Business Development Manager Jobs in Riyadh
- Sales Manager Jobs in Riyadh
- Digital Marketing Specialist Jobs in Riyadh
- Sales Representative Jobs in Riyadh
- Marketing Specialist Jobs in Riyadh
- Other Job Fields in
- Business Development Manager Jobs in Riyadh
- Sales Manager Jobs in Riyadh
- Digital Marketing Specialist Jobs in Riyadh
- Sales Representative Jobs in Riyadh
- Marketing Specialist Jobs in Riyadh
- Executive Assistant Jobs in Riyadh
- Human Resources Specialist Jobs in Riyadh
- Truck Driver Jobs in Riyadh
- Logistics Pricing & Sales Support Coordinator Jobs in Riyadh
- Sales Specialist Jobs in Riyadh
- Explore Jobs Across Saudi Arabia
- Data Collector Jobs in Al Ula
- Car Driver Jobs in Jeddah
- Promoter Jobs in Jeddah
- Human Resources Specialist Jobs in Al Jubail
- Customer Service Representative Jobs in Dammam
- Clothes Seller Jobs in Riyadh
- Car Driver Jobs in Tabuk
- Cleaning and Housekeeping Supervisor Jobs in Medina
- eCommerce Specialist Jobs in Riyadh
- Hotel Specialist Jobs in Medina