Stress-test my research thesis: feasibility from a bioinformatics POV?
Hi r/bioinformatics,
I am exploring a research thesis and would value sharp critique before committing to original data collection. Here is a quick recap of the idea.
The thesis
Oral mycobiome composition - combined with the chemical signals fungi produce - may carry individually-distinct information that correlates with interpersonal recognition, affection, attraction, bonding patterns. Currently unstudied at the fungal layer.
What the literature supports:
- Beghini, Pullman, Christakis et al. (Nature, 2024) - microbiome strain-sharing in 1,787 adults predicts close social relationships better than wealth, religion, or education. Fungi were not measured.
- Cornejo Ulloa, Krom et al. (Frontiers in Endocrinology, 2024)- oral tissue expresses SSH receptors; the authors explicitly name the SSH–oral microbiome interaction as an open research gap. Bennett et al. (MDPI Toxins, 2015) - fungi produce species-specific volatile organic compounds.
- Hadrich et al. (Frontiers in Cellular Neuroscience, 2025) -oral mycobiome dysbiosis linked to serotonin/dopamine pathway disruption.
Where I would love this sub's input:
ITS1 vs ITS2 for oral mycobiome specifically - current state of the art? Resolution trade-offs for typical oral genera (Candida, Cladosporium, Aureobasidium)?
Existing public datasets - HMP fungal subset, oral cohorts - are there any where a within-vs-between-individual variance question on fungal composition could be tested before committing to original collection?
Multi-omic angle - if metabolomics (VOCs) gets layered in later, what's a credible integration strategy with ITS abundance at the individual level? Honest tear-down - what would invalidate this thesis at the data layer before we even talk about behavioral correlates?
I am ready to hear (and cry later)) what you consider unworkable in this thesis. or what could be the cleanest first feasibility test (fail-fast).
Happy to discuss further in DMs.