Remote AI Quality Analyst (Arabic)📣 Job Ad
| Contract Type | Seasonal | |
| Workplace type | Remote | |
| Location | Saudi Arabia |
About the Role
Turing, a leading research accelerator for cutting-edge AI labs and a trusted partner for global enterprises, announces the need to hire an AI Quality Analyst (Arabic) for remote work. This contract role requires 0-1 years of experience and focuses on evaluating a new personalization feature for Gemini. The analyst will assess how effectively the AI model uses information from past Gemini conversations, Gmail activity, Google Search, and YouTube activity to provide relevant and helpful responses. This position requires a blend of creativity in designing prompts based on personal experiences and analytical rigor in evaluating the outputs of personalized AI.
Key Tasks and Responsibilities
- Design and implement multi-turn conversational prompts (typically 1-5 turns) that require the AI to use personal information and experiences.
- Evaluate model responses based on the intent of the initial prompt, verifying that personalization is applied appropriately.
- Analyze responses for "Grounding" issues, ensuring that claims related to the user are supported by evidence and are not false inferences or hallucinations.
- Assess the quality of integration to ensure personal data is seamlessly incorporated into the response without robotic exaggeration.
- Rigorously evaluate and compare two model responses side-by-side (SxS) to determine which is more helpful, usable, and enjoyable.
- Write clear and defensible justifications for comparisons, explicitly referencing where issues or positive aspects occurred in the conversation.
- Extract and verify "Debug Info" from the model to ensure chat summaries and data sources are used correctly.
- Maintain strict data hygiene by deleting evaluation conversations to prevent contamination of future chat logs.
Qualifications and Requirements
- Ability to read and write Arabic at a high level of proficiency, as Arabic is the focus language for this project.
- Willingness to use a primary personal Google account (not a test account) and enable personal data sources for genuine evaluation.
- Full-time availability within the local time zone.
- Proven ability to evaluate nuanced and ambiguous AI responses, particularly assessing the quality of personalization.
- Experience in designing creative, multi-turn starting prompts based on personal context to thoroughly test model capabilities.
- Understanding of personalization concepts, including the ability to identify incorrect personalization, weak inferences, and forced connections.
- Meticulous attention to detail, with the ability to review model responses side-by-side (SxS) and identify nuances in naturalness and exaggeration.
- Superior ability to write clear, concise, and structured justifications for model ratings, explicitly referencing specific turn numbers.
- Ability to provide constructive feedback and detailed commentary.
- Excellent communication and collaboration skills.
- Self-motivated and able to work independently in a remote work environment.
- Requires a desktop/laptop setup with a good internet connection.
- Bachelor's degree or equivalent experience in a relevant field (*, Politics, Law, Ethics, Linguistics, Journalism, Computer Science, or a related analytical field).
- Strongly preferred experience in data annotation, AI quality evaluation, content moderation, or a related role.
Core Skills
- Native proficiency in Arabic
- Personal account usage
- Schedule flexibility
- Exceptional analytical thinking
- Creative prompt engineering
- Strong evaluation capability
- Meticulous attention to detail
- Excellent writing skills
- Feedback provision
- Communication and collaboration
- Independence
- Technical setup (desktop/laptop with good internet connection)
Additional Role Details
This role is a remote contract with Turing, based in Saudi Arabia. The role requires a full-time commitment, with a minimum of 4 hours per day and 30 hours per week, including 4 hours of overlap with the Pacific Standard Time (PST) zone. 30 or 40 hours per week commitment options are available. The contract duration is 3 months.
Requirements
- No experience required
Similar Jobs
You may also like
- Related Remote AI Quality Analyst (Arabic) Opportunities
- Business Development Manager Jobs in Riyadh
- Sales Manager Jobs in Riyadh
- Digital Marketing Specialist Jobs in Riyadh
- Sales Representative Jobs in Riyadh
- Marketing Specialist Jobs in Riyadh
- Other Job Fields in
- Business Development Manager Jobs in Riyadh
- Sales Manager Jobs in Riyadh
- Digital Marketing Specialist Jobs in Riyadh
- Sales Representative Jobs in Riyadh
- Marketing Specialist Jobs in Riyadh
- Executive Assistant Jobs in Riyadh
- Human Resources Specialist Jobs in Riyadh
- Truck Driver Jobs in Riyadh
- Logistics Pricing & Sales Support Coordinator Jobs in Riyadh
- Sales Specialist Jobs in Riyadh
- Explore Jobs Across Saudi Arabia
- Marketing Manager Jobs in Medina
- Sales Representative Jobs in Buraydah
- Legal Advisor Jobs in Riyadh
- Certified Trainer Jobs in Tabuk
- Warehouse Manager Jobs in Jeddah
- Cashier Jobs in Tabuk
- Maintenance Supervisor Jobs in Dammam
- Copywriter Jobs in Medina
- Graphic Designer Jobs in Riyadh
- Sales Accountant Jobs in Riyadh