Portrait of Mindaugas

Mindaugas Vaitekunas

Mindaugas Vaitekūnas

Senior Data Scientist

I use predictive analytics, data engineering and software development to do all kinds of funky stuff. I enjoy building things all the way from an abstract idea, to something you can touch (inasmuch as you can touch digital products).

Currently working as a Senior Data Scientist at a consulting firm in Nuremberg, Germany.

Profile

I'm a Full-Stack Data Scientist & Engineer with 10+ years of experience building scalable, ML-powered products and services. I have experience in the banking, insurance and consulting industries as well as the public sector.

Currently, I am interested in the following senior roles/positions:

  • Data Scientist
  • Machine Learning Engineer
  • AI Engineer
  • Software Engineer (backend/full stack)

I am available for onsite/hybrid positions in Nuremberg, Germany or remote positions anywhere in Germany.

Core Skills

Programming languages:
Python SQL JavaScript Java Go
Frameworks:
pyTorch transformers pandas fastAPI Flask Spring Boot Vue.js
Databases:
Postgres SQLite SQL Server MongoDB
LLM:
ollama llama.cpp RAG
DevOps:
Git Linux Docker Kubernetes Jenkins Github Actions
Languages:
🇬🇧 English 🇩🇪 German 🇱🇹 Lithuanian

Work Experience

since 2021
(4 years)
Senior Data Scientist Atos (consulting / public sector) (4 years)

A consulting position at a large European IT-consulting house. Mostly NLP-related usecases, as well as general software development (web services, APIs) and refactoring of legacy systems.

Main tasks:

  • Implementation of NLP usecases (end-to-end)
  • Development of web-services (frontend + backend)
  • Improvement of data engineering practices
  • Integration of LLMs into business processes

Python SQL Java Javascript Vue.js
2017 - 2021
(4 years)
Data Scientist NÜRNBERGER Versicherung (insurance) (4 years)

Core member of the newly created analytics competence center. Focus on implementing ML-powered usecases across many of the domains of the insurance business (claim recognition, product recommendation, fraud detection, churn analysis, etc.)

Main tasks:

  • Implementation of insurance-related Data Science usecases (image and text classification, product recommendation, fraud detection)
  • Mentoring of juniors / interns
  • In-house consulting

Python R SQL SAS Java Javascript Vue.js
2016 - 2017
(1 year)
Data Scientist HalcyonFlux (SaaS) (1 year)

Working on an integrated Data Science platform (SaaS) for clients in the banking and insurance sectors.

Main tasks:

  • Development of a licence management software for SaaS applications
  • Development of a notebook-based Data Science platform

Go SQL Javascript Vue.js
2012 - 2016
(4 years)
Senior Consultant Risk Research (consulting / banking) (4 years)

Credit risk management (estimation of the probability and loss of a credit default). Started as an intern, then got hired as consultant and later promoted to senior consultant.

Main tasks:

  • Development of simulation based credit portfolio models
  • Implementation of credit-risk related usecases (time-series and cross-sectional prediction models)
  • Development of desktop and web-based applications for credit risk simulations and model validation

Python R SQL SAS VBA PHP

Education

2021 - 2025
Computer Science, BSc. Hagen University

Specialization: Natural Language Processing (NLP).

Thesis: "Data analysis and visualization: Investigation of customer reviews with classic Data Science techniques".

2010 - 2013
Economics, MSc. Regensburg University

Specialization: Econometrics (Statistics / Machine Learning applied to economic usecases).

Thesis: "Model Selection for Prediction when the Sample Size is small and the Data-generating Process possibly complex".

2005 - 2009
Economics, BSc. Vilnius University

Specialization: Econometrics and Economic Analysis.

Thesis: "Evaluation of Lithuania’s export capabilities".

Certification

2024
Certified Kubernetes Application Developer (CKAD) CNCF
Credential ID: LF-zbba29fv87
2022
Certified Professional for Software Architecture - Foundation (CPSA-F) iSQI
Credential ID: 2201-CPSAFL-217493-DE

Selected projects

2025
Analytical framework and application for building modular Data Science pipelines

A framework for defining Data Science workflows and combining them into analytical pipelines, with a primary application focus on online customer reviews.

The main goal is to enable users to define reusable workflows that can be applied in various scenarios - for example, a text preprocessing workflow, a binary classification workflow, and so on.

The application also integrates Large Language Models (LLM) as a proof of concept for zero-shot sentiment classification.

Python scikit-learn llama.cpp FastAPI SQLAlchemy Vue.js
Code
2024
Medical Named Entity Recognition based Semantic Knowledge Graph Extractor

A web application for extracting Semantic Networks (Knowledge Graphs) from medical journals. It leverages the PubMed API to retrieve relevant articles, which are then analyzed by a BERT-based Named Entity Recognition (NER) model. The model is capable of extracting medical entities such as diseases, treatments, and medical technologies.

To reduce noise, each extracted entity is evaluated using Count Vectorization or TF-IDF, allowing for more accurate filtering.

An interactive network visualization, combined with entity and article filters, enables users to visually explore and refine the extracted information.

Python PyTorch transformers FastAPI SQLAlchemy Plotly.js Vue.js