All roles

AI & Data Engineer

Remote · USA Full-time New today

Location: Remote-first Job type: Full-time

About the job:

Can you imagine a world where business and digital solutions will be truly seamless and where users will help companies to co-create them? Do you want to help us to shape this human-centred world? Welcome to UNGUESS.

UNGUESS is the crowdsourcing platform for effective testing and real insights that enable tech, digital and business leaders to make smarter decisions, faster. How? Unleashing the power of the crowd, a community of highly engaged people all over the world that allows us to bring end-customer insights into the design, development, and testing phases of a product.

Why work at UNGUESS:

At UNGUESS, you’ll have the chance to make an immediate impact in a fast-paced and dynamic environment. We’re growing rapidly and strengthening our market position. Joining us now means stepping into an exciting challenge: one that won’t always be easy, but will undoubtedly be among the most rewarding and fulfilling experiences of your career. You’ll constantly learn, grow, and apply your full skill set across diverse and stimulating projects.

This is not a traditional data engineering position. Around 60–70% of your work will focus on GenAI, RAG systems, vector search, and natural language understanding (NLU). The remaining part will cover classic data engineering responsibilities such as ETL pipelines and data modeling. You won’t just maintain existing systems, you’ll be the first building block of something new, laying the foundations for a knowledge base that transforms raw testing data into intelligent, queryable insights.

Your mission:

As our first dedicated Data Engineer, you will be the architect of the infrastructure that makes this vision possible. You’ll own the design, implementation, and scalability of our data stack, working closely with the product and development teams. We are a rapidly growing tech company with the ambition of building an LLM-queryable Knowledge Base by leveraging existing but currently unstructured data sources. We do not yet have a dedicated data team: this role will be the first hire, with full ownership over architecture, implementation, and scalability.

Responsibilities:

  • Design and implement data ingestion and normalization pipelines from heterogeneous sources (APIs, files, databases, streams).

  • Build a data lake on AWS (S3, Glue, Athena) and orchestrate data flows using CDK.

  • Implement RAG (Retrieval-Augmented Generation) systems using vector databases and LLM models (Bedrock, OpenAI, LangChain).

  • Model metadata and define chunking strategies for NLU-queryable documents.

  • Ensure data security, governance, monitoring, and cost optimization.

  • Collaborate with the Product team to integrate the knowledge base into the existing platform.

Requirements:

  • GenAI & Vector Search: Hands-on experience with RAG systems in production, embedding models (OpenAI, Cohere, Amazon Titan), and vector databases (OpenSearch, Pinecone, pgvector).

  • Strong grasp of chunking strategies, retrieval optimization (precision/recall/reranking)

  • Proven expertise with AWS CDK, data services (S3, Glue, Athena, Lambda, Step Functions), and ML/AI workloads (Bedrock, SageMaker). Solid understanding of IAM, KMS, VPC for security/compliance.

  • Has a builder's mindset and enjoys designing robust, scalable solutions.

Nice to have:

  • Hands-on with serverless architectures and cost-optimized scaling strategies

  • Experience in cloud-native environments and CI/CD (AWS).

  • Familiarity with monitoring and alerting (CloudWatch, X-Ray).

We set high expectations, but we also offer great rewards:

  • Compensation: €45,000 to €50,000/year gross salary and competitive MBO bonus - this range is a guideline; we’re first and foremost looking for the right person, the final offer will be shaped around you and reflect your skills and experience.

  • Remote work lovers

  • Fast-track growth opportunities

  • Access to group and personal training programs

Please note that this job advertisement is open to applicants of all genders, in accordance with Laws 903/77 and 125/91.

Apply To This Job

Related roles

Piloto de Drone PJ - Chapadão do Sul (MS)

Remote · USA Full-time

Piloto de Drone PJ - Querência (MT)

Remote · USA Full-time

QA Engineer (Functional & Automation)

Remote · USA Full-time

FP&A Lead

Remote · USA Full-time

Customer Care Specialist - Spanish / French market

Remote · USA Full-time

Customer Care Specialist - Spanish market

Remote · USA Full-time

Senior Performance Marketing Specialist

Remote · USA Full-time

Data Engineer

Remote · USA Full-time

Executive Content Creator

Remote · USA Full-time

Mid/Sr. Product Designer – Multi-Channel Team

Remote · USA Full-time

Box Truck Dispatcher (26ft Box Truck / Regional Freight)

Remote · USA Full-time

Retail Customer Service Specialist – Deliver Exceptional Experiences at blithequark

Remote · USA Full-time

Experienced Data Entry Clerk – Web & Cloud Application Development at arenaflex

Remote · USA Full-time

Experienced Remote Customer Support Specialist – Pet Industry Expertise and Passion Required for Delivering Exceptional Service

Remote · USA Full-time

Senior Fraud Operations Analyst

Remote · USA Full-time

Experienced Remote Data Entry and Customer Support Specialist - Full-Time Work from Home Opportunity with blithequark

Remote · USA Full-time

Executive Assistant (100% Remote, Non-Profit Industry)

Remote · USA Full-time

Experienced Remote Data Entry Specialist and Research Panelist – Flexible Work Arrangements at arenaflex

Remote · USA Full-time

[Remote] SEO and GEO Analyst - Part Time

Remote · USA Full-time

Senior Client Partner, Auto

Remote · USA Full-time