End-to-End Bioinformatics
by Ahmed Mokhtar
Cancer genomics, single-cell and spatial transcriptomics, and reproducible NGS pipelines — built in R and Python. MSc candidate at Cairo University and a freelance bioinformatics instructor with six-plus years training researchers across the field.
- Experience
- 7+ years
- Mentees
- 1000+
- Followers
- 1.6k+
Translating omics data into decisions
Bioinformatics scientist with a biotechnology background, focused on cancer genomics and transcriptomics. MSc candidate at Cairo University, postgraduate-diploma trained at Suez Canal University. Six-plus years teaching bioinformatics — from NGS foundations to single-cell and spatial transcriptomics — and shipping reproducible analysis pipelines for independent research.
A multi-disciplinary trajectory
From wet-lab biotechnology to genomics, statistics, and business — a deliberate stack built across six programs.
- milestone · 01AAMBFSBusiness Strategy
- milestone · 02in progress
MSc Candidate · Genetics & Bioinformatics
Faculty of Agriculture · Cairo UniversityGenetics Department - milestone · 03in progress
MicroMasters · Statistics & Data Science
MITxComputational Statistics - milestone · 04
Postgraduate Diploma · Bioinformatics
Suez Canal UniversitySystems Biology · NGS - milestone · 05
Diploma · Data Analysis
Arab Academy for Science, Technology & Maritime TransportAnalytics - milestone · 06
Diploma · Data Science
Epsilon InstituteML · Python
Production-grade bioinformatics, end to end
From experimental design to scalable cloud delivery — pick a service or compose a full omics program.
RNA-Seq Analysis
Bulk transcriptomics with DESeq2/edgeR, QC, alignment and pathway enrichment.
Single-Cell Sequencing
10x Chromium, Seurat & Scanpy workflows, integration and trajectory analysis.
Spatial Transcriptomics
Visium / Xenium / MERFISH analysis, niche detection, cell–cell interaction.
Comparative Genomics
Orthology, synteny, evolutionary models across species and strains.
Metagenomics Pipelines
Shotgun & 16S microbiome analysis with taxonomic and functional profiling.
GWAS & Population Genetics
QC, imputation, association testing and polygenic risk scoring.
NeuroBioinformatics
Neuronal vulnerability mapping, multi-omics integration and single-cell brain atlases for neurodegeneration research.
Bioinformatics DevOps
Reproducible Nextflow / Snakemake stacks with CI/CD and containers.
Scientific Consulting
Study design, statistical strategy and reviewer-ready interpretation.
Custom Pipeline Development
End-to-end omics pipelines tailored to your data and infrastructure.
Cloud Deployment for Omics
AWS / GCP / Azure HPC, batch, S3, Terraform and cost optimization.
Training & Workshops
Hands-on workshops on NGS, scRNA-seq, R/Python and reproducibility.
Selected research & pipelines
From problem to result
Three deep-dives across plant genomics, cancer single-cell biology and immuno-oncology — each shown as the actual problem, the method, and the measurable outcome.
Dual-branch autoencoder for Brassica napus trait prediction
- Problem
- Allotetraploid canola hides subgenome-specific signal that flat GBLUP and standard GWAS routinely miss, capping genomic prediction accuracy.
- Method
- Trained a 1.07M-parameter dual-branch deep autoencoder on 1,007 globally diverse accessions, enforcing A/C subgenome separation by architecture. Latent fusion + gradient-based feature importance recover causal loci without phenotype supervision.
- Result
- Recovered BnaA09FLC2 and BnaC05FAE1 unsupervised. Delivered 1.89× QTL enrichment, 4.61× GWAS concordance, and up to +90.8% genomic-prediction gain over GRM-GBLUP across five agronomic traits.
Peer-reviewed research
Original research in comparative genomics and cancer immunogenomics — from population-scale SNP architecture to TCR-β repertoire dynamics under checkpoint blockade.
- paper01AcceptedJournal of Scientific Research in Science
Comparative Genomics Analysis of Some Brassica napus Accessions
Comparative GenomicsSNPBrassica napus - paper02Under reviewJournal of Scientific Research in Science
Multi-Dimensional TCR-β Repertoire Characterization in Anti-PD-1-Treated Non-Small Cell Lung Cancer: Development and Multi-Cohort Validation of the Composite Immune Reprogramming Score (CIRS)
TCR-βNSCLCImmuno-oncologyAnti-PD-1Biomarker
Learn bioinformatics, production-style
Two formats — short Courses (8–9 weeks) for focused topics, and long-term Schools (4–5 months) for end-to-end mastery. All delivered through BioScience.
Schools · long-term, 4–5 months
NGS & Single-Cell RNA-Seq Data Analysis School
Bioinformatics for CRISPR-Cas Systems School
Cancer Informatics School
Metagenomics & Metatranscriptomics Data Analysis School
Bulk TCR Repertoire & Single-Cell TCR Data Analysis School
Machine Learning for Single-Cell RNA-Seq School
Whole Genome Seq Data Analysis School
Whole Exome Seq Data Analysis School
Neuroscience Multi-Omics Data Analysis School
Courses · focused, 8–9 weeks
Biostatistics Using R Language
Data Analysis Using R Language
RNA-Seq Analysis from Scratch
Single-Cell RNA-Seq with Scanpy & Seurat
Cancer Bioinformatics in Practice
Metagenomics: From Reads to Insights
Structural Variants & Long-Reads
Scientific writing
Digital Twins in Biology: Building Virtual Humans with Bioinformatics
Virtual computational replicas of organs, tissues and entire patients — how multi-omics, AI and systems biology are building biological digital twins for precision medicine.
AI-Powered Multi-Omics Integration: The New Era of Precision Medicine
How AI is unifying genomics, transcriptomics, proteomics, spatial and single-cell layers into clinically actionable molecular profiles — and what it means for the future of personalized medicine.
Spatial Transcriptomics: Mapping Biology in 3D
Preserving the spatial coordinates of gene expression is reshaping cancer research, neuroscience and clinical diagnostics — and AI is unlocking 3D molecular atlases at organ scale.
CRISPR Bioinformatics and AI: The Computational Revolution Behind Gene Editing
Guide RNA design, off-target prediction, single-cell CRISPR screens, base/prime editing and generative Cas engineering — how AI and bioinformatics power modern gene editing.
Large Language Models in Bioinformatics: Beyond Chatbots
LLMs are learning the language of biology — DNA, proteins, pathways and literature — powering protein design, omics interpretation and generative biology.
Designing a production scRNA-seq pipeline with Nextflow
How to architect a reproducible single-cell pipeline that scales from a laptop to AWS Batch without rewriting code.
Foundation models in biology: what actually works
A pragmatic look at scGPT, Evo, Geneformer and ESM-3, and when classical methods still win.
Cancer multi-omics integration with MOFA2 in practice
Step-by-step walkthrough integrating expression, methylation and mutations into latent factors with biological interpretation.
What collaborators & mentees say
1000+ mentees and researchers trained across Egypt, Tunisia, Nigeria, the UAE, Italy, Germany and more.
“Ahmed rebuilt our entire RNA-seq stack on AWS in three weeks. Reproducible, traceable, and 40% cheaper than what we had.”
“Working with Ahmed on our spatial transcriptomics atlas elevated the entire paper. Reviewers specifically praised the methods.”
“His scRNA-seq course is the clearest, most production-minded I've taken. My PhD timeline literally shifted because of it.”
“You may not remember me — 3 years ago I reached out about getting into bioinformatics and did a one-to-one mentoring session with you. You built me a clear roadmap, I followed every step, and alhamdulillah I'm now doing my PhD in Germany. I got accepted for the PhD scholarship because of the advice and the path you laid out. Thank you, truly.”
“I've taken similar courses in Germany on R and Bash, and after every one I felt the material was too hard to digest. Honestly, with Ahmed it became much easier — step by step, gradually, and his teaching style is excellent. Mashallah.”
“I'm taking the Immunogenetics & Informatics course with Dr. Ahmed Mokhtar — the immunology content, the data work, and the Bash scripting are all very good. The lectures are advanced, he's honest with the information, and he follows up step-by-step on every project we build.”
“مع حضرتك د. جاكلين، الصراحة كنت عايزة أشكر جداً بشمهندس أحمد مختار، والصراحة هو شاطر جداً وممتاز، وكان من أفضل المواقع اللي اشتغلت مع بيوساينس فيها.”
“The mentoring session was so beneficial. Your clear communication, personalized advice, and industry insights were invaluable. The course recommendations and practical tips have put me on the right path.”
“Thank you Dr. Ahmed for the insightful session. Your detailed explanations significantly clarified my understanding and dispelled any confusion I had. I truly appreciate the time and effort you dedicated.”
“Thank you very much for sparing time to share an insight towards advancing my career in the field of biomedical science.”
“The mentoring session was really good. I learned a lot about the global bioinformatics market and what is most in demand. I'll work with your advice and share the results.”
Frequently asked questions
Everything you might want to know before reaching out.
Cancer genomics, single-cell and spatial transcriptomics, multi-omics integration, and reproducible NGS pipelines. I work with academic labs, biotech startups, and clinical teams.
Yes — I'm a freelance bioinformatics instructor with 6+ years of teaching experience. Workshops, one-on-one mentoring, and structured cohort-based courses.
R and Python primarily, with Bioconductor, Seurat, Scanpy, Nextflow, Snakemake, Docker, and AWS/HPC for orchestration and scale.
Most projects start with a free 30-minute consultation, followed by a scoped proposal. I support both fixed-scope deliverables and ongoing retainers.
Cairo, Egypt — working with collaborators worldwide. Async-first, with overlap for live working sessions.
Let's build something measurable
Bring your dataset, study, or platform idea. Initial scoping calls are free and end with a concrete plan.