Remote AI Quality Analyst (Arabic)📣 Job Ad
| Contract Type | Seasonal | |
| Workplace type | Remote | |
| Location | Saudi Arabia |
About the Role
Turing, a leading research accelerator for cutting-edge AI labs and a trusted partner for global enterprises, announces the need for a "Remote AI Quality Analyst (Arabic)" for a contract position in Saudi Arabia. This role focuses on evaluating a new personalization feature for Gemini, with an emphasis on how effectively AI uses user data from past conversations, Gmail, Google Search, and YouTube activity to provide relevant and helpful responses. The position requires a unique blend of creativity in prompt engineering and analytical rigor in evaluating AI outputs.
Key Tasks and Responsibilities
- Design and implement multi-turn conversational prompts (typically 1-5 turns) that require the AI to use your personal information and experiences.
- Evaluate model responses based on your intent from the initial prompt, verifying if personalization has been applied appropriately.
- Analyze responses for "Grounding" issues, ensuring that claims about you are supported by evidence and are not false inferences or hallucinations.
- Assess the quality of "Integration" to ensure personal data is naturally incorporated into the response without robotic "over-narration".
- Accurately rate and rank two model responses side-by-side (SxS) to determine which is more helpful, user-friendly, and generally enjoyable.
- Write clear and defensible justifications for your comparisons, explicitly referencing where issues or positive aspects occurred in the conversation.
- Extract and verify "Debug Info" from the model to confirm that chat summaries and data sources are being used correctly.
- Maintain strict data hygiene by deleting evaluation conversations to prevent them from polluting your future chat history.
Qualifications and Requirements
- Ability to read and write Arabic at a highly proficient level, as Arabic is the pivotal language for this project.
- Willingness to use your primary personal Google account (not a test account) and enable personal data sources for genuine evaluation.
- Full-time availability in your local time zone is required, as the team operates globally 24/7.
- Proven ability to evaluate nuanced and absent AI responses, with a particular assessment of personalization quality.
- Experience in designing creative, multi-turn starting prompts based on personal context to thoroughly test model capabilities.
- Understanding of personalization concepts, including the ability to identify incorrect personalization, weak inferences, and forced connections.
- Meticulous attention to detail, with the ability to review model responses side-by-side (SxS) and detect nuances in naturalness and over-narration.
- Superior ability to write clear, concise, and structured justifications for model ratings, explicitly referencing specific turn numbers.
- Ability to provide constructive feedback and detailed commentary.
- Excellent communication and collaboration skills.
- Self-motivated and able to work independently in a remote work environment.
- Desktop/laptop setup with a good internet connection is required.
- Bachelor's degree or equivalent experience in a relevant field such as Politics, Law, Ethics, Linguistics, Journalism, Computer Science, or a related analytical field.
- Experience in data annotation, AI quality evaluation, content moderation, or a related role is highly preferred.
Required Skills
- Native proficiency in Arabic
- Personal account usage
- Schedule flexibility
- Exceptional analytical thinking
- Creative prompt engineering
- Strong evaluation capability
- Meticulous attention to detail
- Excellent written communication
- Feedback provision
- Communication and collaboration
- Independence
- Technical setup
Contract and Commitment Details
This is a full-time contract position requiring a commitment of at least 30 hours per week. Options for 30 or 40 hours of work per week are available. The contract duration is 3 months. The role requires 4 hours of overlap with Pacific Standard Time (PST).
Requirements
- No experience required
Similar Jobs
You may also like
- Related Remote AI Quality Analyst (Arabic) Opportunities
- Business Development Manager Jobs in Riyadh
- Sales Manager Jobs in Riyadh
- Digital Marketing Specialist Jobs in Riyadh
- Sales Representative Jobs in Riyadh
- Marketing Specialist Jobs in Riyadh
- Other Job Fields in
- Business Development Manager Jobs in Riyadh
- Sales Manager Jobs in Riyadh
- Digital Marketing Specialist Jobs in Riyadh
- Sales Representative Jobs in Riyadh
- Marketing Specialist Jobs in Riyadh
- Executive Assistant Jobs in Riyadh
- Human Resources Specialist Jobs in Riyadh
- Truck Driver Jobs in Riyadh
- Logistics Pricing & Sales Support Coordinator Jobs in Riyadh
- Sales Specialist Jobs in Riyadh
- Explore Jobs Across Saudi Arabia
- Host Jobs in Riyadh
- Irrigation Engineer Jobs in Riyadh
- Administrative Assistant Jobs in Makkah
- Head Chef Jobs in Al-Ahsa
- Cafe Manager Jobs in Al-Ahsa
- Barista Jobs in Abha
- Promoter Jobs in Khamis Mushayt
- Store Keeper Jobs in Yanbu
- Customer Services Manager Jobs in Al Uyainah
- Cashier Jobs in Al Khobar