Study

ROADIES

  •   0%
  •  0     0     0

  • How did ROADIES perform compared to BUSCO-based pipelines?
    It matched or exceeded their accuracy with less manual setup.
  • Which tool performs multiple sequence alignment in ROADIES?
    PASTA
  • Which tool does ROADIES use for pairwise alignments of loci?
    LASTZ
  • Which software is used to infer gene trees for each locus?
    RAxML-NG
  • What criterion determines when ROADIES stops adding loci?
    When tree topology convergence stabilizes (≥95% stability)
  • What type of data does ROADIES not require?
    Annotated genes or reference genomes
  • What is the first step in the ROADIES pipeline?
    Randomly sampling short loci (around 500 bp) across genomes
  • What kind of tree-building model does ROADIES use?
    The coalescent model (via ASTRAL-Pro3)
  • What is “orthology inference”?
    The process of identifying genes derived from a common ancestor across species.
  • Which dataset did ROADIES find most challenging to resolve?
    Birds (Neoaves)
  • What is the advantage of using random loci instead of predefined orthologs?
    Reduces reference bias and works with incomplete or unannotated genomes
  • What does ROADIES stand for?
    Reference-free, Orthology-free, Annotation-free, Discordance-aware Estimation of Species Trees
  • What does it mean that ROADIES is “reference-free”?
    It doesn’t require alignment or mapping to a reference genome.
  • What metric does ROADIES use to assess branch support?
    localPP (local posterior probability)
  • What was ROADIES’ performance compared to the Zoonomia (mammal) tree?
    Comparable accuracy to expert-curated reference trees.
  • Why does ROADIES use multicopy instead of single-copy genes?
    To avoid the need for orthology inference and to better handle gene duplication
  • What tool combines gene trees into a species tree?
    ASTRAL-Pro3
  • Why is ROADIES considered “democratizing” phylogenetics?
    It allows non-experts to infer species trees without specialized bioinformatics knowledge.
  • What makes ROADIES different from earlier alignment-free methods?
    It’s discordance-aware and coalescent-based, not purely distance-based.
  • What is “localPP”?
    A statistical measure of support for internal branches in a tree.
  • What kind of loci does ROADIES select for tree building?
    Randomly sampled, short genomic regions (not predefined genes)
  • What type of data does ROADIES use as input?
    Assembled genomes
  • What scripting or workflow engine automates ROADIES?
    Snakemake
  • What is “incomplete lineage sorting”?
    When gene trees differ from the species tree due to ancestral polymorphism.
  • What aspect of ROADIES helps it handle polyploidy well?
    Using multicopy gene trees and ASTRAL-Pro’s paralogy-aware model.
  • What does the term “discordance-aware” refer to in ROADIES?
    Accounting for gene tree–species tree discordance (ILS or duplication)
  • What biological process likely explains difficulty with bird phylogeny?
    Incomplete lineage sorting (ILS) and rapid radiation
  • How does ROADIES evaluate convergence between runs?
    By comparing tree topologies across iterations
  • What is a good normRF?
    The closer to 0 the better.
  • What is the main goal of the ROADIES pipeline?
    To automate and simplify species tree inference directly from genome assemblies.
  • Which pipeline did ROADIES outperform that is alignment-free?
    MashTree
  • What kind of gene trees does ROADIES build — single-copy or multicopy?
    Multicopy gene trees
  • What are the three runtime modes available in ROADIES?
    Accurate, Balanced, and Fast
  • What are “multicopy genes”?
    Genes with multiple copies (paralogs) within or across species.
  • What does “Balanced Mode” aim to optimize?
    A compromise between runtime and accuracy.
  • What is the primary trade-off between the Fast and Accurate modes in ROADIES?
    Fast = less loci, lower confidence; Accurate = slower, higher support.
  • What’s the difference between concatenation-based and coalescent-based approaches?
    Concatenation merges all loci into one supermatrix; coalescent models gene tree variation explicitly.
  • On which dataset did ROADIES produce a perfect match with the reference tree?
    Polyploid Bamboos
  • What is the final output of the ROADIES pipeline?
    An estimated species tree with branch support scores (localPP)
  • What were some of the datasets used to test ROADIES?
    Mammals, Birds, Flies, Yeast, and Bamboos
  • How does ROADIES make large-scale species tree inference more accessible?
    It minimizes data prep requirements and computational complexity for users.