📝 Input Sequences
Input Method:
Size: 0 MB / 15 MB limit
📋 Paste sequences in FASTA format. Sequences can include headers (>name) or be headerless. Minimum length: ~200 nucleotides for reliable analysis.
💡 How BBSketch Works
BBSketch creates compact "sketches" from k-mer frequencies in your sequences, then compares these against reference databases. Results include ANI (Average Nucleotide Identity) scores, completeness estimates, and taxonomic assignments.
⚙️ Analysis Parameters
🧬 Select the type of sequence you're analyzing. Protein database requires amino acid sequences or will translate nucleotides.
🗃️ RefSeq provides the best balance of accuracy and speed for most analyses. Silva is optimal for 16S/18S rRNA studies.
📊 Number of top matches to return. More results provide broader taxonomic context but slower analysis.
🎯 Interpreting Results
ANI ≥95%: Same species
ANI 80-95%: Related species/genus
ANI <80%: Distant relationship
Completeness: Fraction of reference genome represented