Zum Hauptinhalt springen

Data Engineer (m/f/d)

Remote, Austria, Croatia, France, Germany, Italy, Poland...
Full-time
Permanent employee

Your team

As a Data Engineer (f/d/m), you will play a key role in our Data Expansion Squad, which is responsible for integrating and operationalizing legal data from multiple jurisdictions. The team transforms heterogeneous source data into a unified, high-quality foundation that powers search, retrieval, and AI-supported workflows across our products. 


Felix, our VP AI & Data Engineering, will guide you through your journey at Noxtua. With deep expertise in AI systems, Felix leads with a passion for innovation and a collaborative approach, ensuring every team member thrives. 


You will work closely with AI, engineering, and legal domain experts to adapt and extend existing data workflows for new customer datasets and source formats. Your work will focus on understanding source structures, defining robust mappings, standardizing and enriching content, and ensuring that data is integrated in a way that is reliable, scalable, and easy to use in downstream systems. Our Tech Team of around 32 people, including UI Engineers, UI Designers, AI Engineers, Data Engineers, as well as Fullstack, Backend, and DevOps Engineers. Within that team, the Data Expansion Team provides the data foundation, structure, and metadata needed for our agent-based systems to retrieve relevant legal information efficiently and reliably across jurisdictions. 

About you

  • Experience: at least 2 years of professional experience in data engineering, and being involved in successfully deployed projects
  • Programming: Strong Python skills with experience in designing robust data pipelines
  • Technical Expertise: Experience in building and maintaining reliable ET and RAG pipelines and a solid understanding of data modeling, quality, filtering, validation, and consistency
  • Infrastructure: Familiarity with containerization (Docker), CI/CD pipelines, and version control (Git)
  • Fundamentals: Strong grasp of data structures, algorithms, system design principles, and software engineering best practices
  • Expertise in working with graph databases and familiarity with developing and deploying NLP models is a bonus
  • Language: English proficiency at the C2 level 

Your responsibilities

  • Design, build, and optimize end-to-end ETL pipelines for legal data from multiple jurisdictions, including cleaning, transformation, chunking, validation, embedding, and ingestion into vector databases
  • Work extensively with XML-based legal data feeds: parse, validate, normalize, and transform XML structures into scalable internal schemas and unified document formats
  • Develop and maintain data models and storage schemas that support continuously updated datasets while ensuring consistency, scalability, and accuracy across diverse datasets and large amounts of data
  • Coordinate data handover and integration from multiple internal and external data providers, including official sources, APIs, and web scraping pipelines, ensuring reliable and timely updates
  • Implement and continuously refine metadata enrichment strategies to maximize searchability, ranking quality, and relevance of legal information in vector databases.
  • Build and maintain a high-performance search and retrieval infrastructure enabling agent-based systems to call search functions and retrieve the most relevant legal information efficiently
  • Collaborate with product, AI, and legal domain experts to deliver high-quality, reliable data solutions
  • Own the data integration of one jurisdiction end-to-end

Our offer to you

Building Europe's sovereign Legal AI is ambitious, meaningful work — and we want the people doing it to be properly looked after. Our benefits are built around flexibility, real-time off to recharge, and the setup to do your best work from wherever you are.
  • Remote: 100% remote work possible (given a German residence), other countries upon request
  • Working hours: Flexible working hours
  • Vacation: 26 days + December 24th & 31st off, + 1 additional vacation day per year of employment (up to 30 days)
  • Discounts: e.g., Urban Sports Club Membership, depending on location
  • Equipment: Laptop (Lenovo or Mac), plus €1,000 net home office setup budget (paid with your first salary)


Mid-Level Salary Ranges gross per year:

  • Germany: €58k – €72k EUR
  • France: €48k – €59k EUR
  • Switzerland: 77k – 96k CHF (≈ €75k – €93k EUR)
  • Sweden: 667k – 828k SEK (≈ €58k – €72k EUR)
  • Austria: €57k – €71k EUR
  • Italy: €46k – €58k EUR
  • Poland: 168k – 209k PLN (≈ €39k – €48k EUR)
  • Croatia: €35k – €43k EUR
  • Slovakia: €34k – €42k EUR

The actual offer within the range depends on your relevant experience, demonstrated impact, and interview outcome — not on negotiation skill or gender. In line with the EU Pay Transparency Directive, we are happy to discuss the criteria for placement within the range during the interview process.

For other countries you will get the information within your invitation to a potential interview.


Über uns

Noxtua ist Europas souveräne Rechts-KI. Die juristisch kompetente KI deckt die Bandbreite juristischer Textarbeit ab – von der Informationsbeschaffung (“Research) über die Analyse komplexer Sachverhalte (“Understanding”) bis zur Dokumentenerstellung (“Drafting). Dabei erfüllt die rechtskonforme KI die deutschen berufs-, straf- und datenschutzrechtlichen Anforderungen für Anwält*innen (§ 203 Strafgesetzbuch, § 43e Bundesrechtsanwaltsordnung) und ist zertifiziert nach BSI C5, TISAX, ISO 27001, 9001, 27018, 27017 und 42001. In exklusiven Partnerschaften mit führenden europäischen Rechtsverlagen aus Deutschland, Österreich, Schweiz, Polen, Tschechien und der Slowakei entwickelt das Tech-Unternehmen Noxtua die Legal AI Workspaces MANZ-Noxtua, Swiss-Noxtua, Beck-Noxtua Polen, Beck-Noxtua Tschechien und Beck-Noxtua Slowakei.


Im Jahr 2017 aus einem Forschungsprojekt von Dr. Leif-Nissen Lundbæk und Professor Dr. Michael Huth an der Oxford University und dem Imperial College London in der deutschen Hauptstadt gegründet, hat das Legal-Tech-Unternehmen mit langjähriger Erfahrung in der Entwicklung DSGVO-konformer KI-Lösungen mittlerweile Standorte in Paris, Berlin, Zagreb und München. Als strategische Partner investierten u.a. Deutschlands führender juristischer Fachverlag C.H.BECK sowie die führenden Kanzleien CMS und Dentons in der Series B rund 81 Millionen EURO in das europäische Scaleup.

Wir ermutigen ausdrücklich Frauen zur Bewerbung, da sie derzeit unterrepräsentiert sind. Unser Ziel ist es, ein vielfältiges und inklusives Arbeitsumfeld zu schaffen, das unterschiedliche Perspektiven wertschätzt. Selbstverständlich freuen wir uns über Bewerbungen von allen qualifizierten Personen – unabhängig von Geschlecht, ethnischer Herkunft, Religion, Behinderung, Alter oder sexueller Identität.