Krisztián Papp, PhD — Bioinformatics Consultant

Global Reach

Consulting Experience Worldwide

16+ companies and 55+ completed projects across biotech and life science sectors internationally.

World map showing consulting project locations across multiple continents

Work

Portfolio

Reproducible bioinformatics and data science — from raw data to final results, with full transparency at every step.

Reproducible Data Analysis

Dockerized environments, AWS cloud computing, and Quarto notebooks for fully transparent, verifiable workflows.

Learn more →

NGS Data Analysis

End-to-end RNA-seq and high-throughput sequencing — QC, statistical analysis, publication-ready figures.

Learn more →

Automated Workflows

Snakemake pipelines, automated reporting, and client-ready outputs with custom branding.

Learn more →

Interactive Visualization

R Shiny dashboards with PostgreSQL integration for real-time, client-facing data exploration.

Learn more →

Machine Learning

Regression and classification models with robust validation strategies and interpretable outputs.

Learn more →

Statistical Analysis

Hypothesis testing, inferential statistics, and high-quality publication-ready visualizations in R.

Learn more →

01 — Reproducible Analysis

Reproducible Data Analysis

Transforming raw data into reliable results is a journey where the process matters as much as the outcome. My analyses are fully reproducible, with every step clearly documented. I use dockerized environments to ensure consistent execution, cloud computing (e.g. AWS) for scalable and secure computation, and Quarto R notebooks to combine code, results, and documentation into a single HTML report.

02 — NGS

Next-Generation Sequencing

I provide end-to-end analysis of NGS data, transforming raw sequencing outputs into biologically meaningful results. My work covers RNA-seq and other high-throughput sequencing data — data processing, quality control, statistical analysis, and clear, publication-ready visualizations. I emphasize reproducible workflows and transparent methodology throughout.

03 — Automation

Automated Workflow Management

I design automated, reproducible workflows using Snakemake, ensuring scalability, traceability, and reproducible execution across local, server, and HPC environments. I also build fully automated reporting pipelines that generate client-ready reports in HTML or PDF format, customized with company branding and consistent visual identity.

04 — Visualization

Interactive Data Visualization

I develop interactive visualization solutions using R Shiny — custom web applications that allow dynamic interaction through responsive dashboards, plots, and tables. These applications can connect directly to PostgreSQL databases, enabling real-time data access without manual data exports.

05 — ML

Machine Learning

I apply machine learning techniques to model complex relationships, make predictions, and extract actionable insights. My work focuses on regression and classification approaches within reproducible, well-documented workflows. I identify key drivers and patterns in data and assess model performance using robust validation strategies.

06 — Statistics

Statistical Analysis & Visualization

Rigorous statistical analysis using R — exploratory data analysis, hypothesis testing, descriptive and inferential statistics, and high-quality, publication-ready visualizations. I place strong emphasis on methodological soundness, reproducibility, and clarity, ensuring analyses are reliable and accessible to diverse audiences.

Background

About Me

I am an independent bioinformatics and data science consultant with a strong foundation in molecular biology and more than 10 years of wet-lab experience. This dual background allows me to understand biological questions, experimental design, and data limitations from a practitioner's perspective.

I have contributed to 40+ peer-reviewed publications in leading journals (Nature Medicine, Science, Nature Communications, Cancers, and Scientific Reports), and am a co-inventor on two patents, reflecting long-term involvement in scientific research and innovation.

I work closely with biotech and life science teams, offering direct communication, flexibility, and end-to-end involvement — from project scoping and data analysis to visualization, reporting, and knowledge transfer.

For a full list of publications, see my ORCID or Google Scholar profile.

ORCID GitHub LinkedIn Google Scholar

Publications & Patents

40+

Publications

2

Patents

Core Technologies

R / Bioconductor Snakemake Docker AWS Quarto R Shiny PostgreSQL RNA-seq scRNA-seq Machine Learning HPC

Domain Background

Immunology Molecular Biology 10+ yrs wet-lab Biotech consulting

Trusted By

Some Companies I've Worked With

Partnering with biotech and life science teams across Europe and beyond.

Testimonials

What Clients Say

Whenever we need customized solutions, from analysis of measurement data through fine tuning of R scripts to visualizations, we keep turning and returning to Krisztián.

JP

József Prechl

Director of R&D · Diagnosticum Zrt.

Working with Krisztian on our NGS and sequencing data at Treos Bio was a genuinely great experience, he combines exceptional speed and flexibility with a thoughtful, constructive approach. His ability to turn complex data into clear, actionable insights made a real difference for our team and kept projects moving forward with confidence.

LM

Levente Molnar

Head of Bioinformatics · Treos Bio

Krisztián provided excellent support in the analysis of clinical study data, enabling us to go beyond conventional statistical approaches and incorporate machine learning methods into our analytical workflows. His domain expertise and clear, accessible explanations made the collaboration highly efficient and effective.

BH

Balázs Hallgas

CEO & Strategic Lead · Navolab Diagnostics

Get in Touch

Available for
new projects

Reach out to discuss your data challenges.

Send an Email

Email

k.papp@pappbioinf.com

ID

ORCID

0000-0003-0619-8233

GitHub

github.com/pkrisz5

linkedin.com/in/krisztian-papp

Krisztián Papp

Consulting Experience Worldwide

Portfolio

Reproducible Data Analysis

NGS Data Analysis

Automated Workflows

Interactive Visualization

Machine Learning

Statistical Analysis

Reproducible Data Analysis

Next-Generation Sequencing

Automated Workflow Management

Interactive Data Visualization

Machine Learning

Statistical Analysis & Visualization

About Me

Some Companies I've Worked With

What Clients Say

Available fornew projects

Available for
new projects