------------------------------------------------------ EnTAP Run Information - Execution ------------------------------------------------------ Current EnTAP Version: 0.10.4 ------------------------------------------------------ Transcriptome Statistics ------------------------------------------------------ Protein sequences found Total sequences: 50841 Total length of transcriptome(bp): 64668462 Average sequence length(bp): 1271.00 n50: 1569 n90: 690 Longest sequence(bp): 16563 (FRAEX38873_v2_000316690.2) Shortest sequence(bp): 165 (FRAEX38873_v2_000276560.1) ------------------------------------------------------ Similarity Search - DIAMOND - complete ------------------------------------------------------ Search results: Total alignments: 523737 Total unselected results: 478526 Total unique transcripts with an alignment: 45211 Total unique transcripts without an alignment: 5630 Total unique informative alignments: 34884 Total unique uninformative alignments: 10327 Total unique contaminants: 0(0.00%): Top 10 alignments by species: 1)olea europaea var. sylvestris: 38099(84.27%) 2)sesamum indicum: 3317(7.34%) 3)camellia sinensis: 336(0.74%) 4)coffea arabica: 285(0.63%) 5)erythranthe guttata: 278(0.61%) 6)nicotiana tomentosiformis: 275(0.61%) 7)nicotiana tabacum: 195(0.43%) 8)coffea eugenioides: 165(0.36%) 9)ipomoea triloba: 129(0.29%) 10)ziziphus jujuba: 96(0.21%) ------------------------------------------------------ Compiled Similarity Search - DIAMOND - Best Overall ------------------------------------------------------ Total unique transcripts with an alignment: 45211 Total unique transcripts without an alignment: 5630 Total unique informative alignments: 34886 Total unique uninformative alignments: 10325 Total unique contaminants: 0(0.00%): Top 10 alignments by species: 1)olea europaea var. sylvestris: 38108(84.29%) 2)sesamum indicum: 3321(7.35%) 3)camellia sinensis: 332(0.73%) 4)coffea arabica: 292(0.65%) 5)erythranthe guttata: 278(0.61%) 6)nicotiana tomentosiformis: 277(0.61%) 7)nicotiana tabacum: 193(0.43%) 8)coffea eugenioides: 158(0.35%) 9)ipomoea triloba: 133(0.29%) 10)ziziphus jujuba: 87(0.19%) ------------------------------------------------------ Gene Family - Gene Ontology and Pathway - EggNOG ------------------------------------------------------ Statistics for overall Eggnog results: Total unique sequences with family assignment: 49178 Total unique sequences without family assignment: 1663 Top 10 Taxonomic Scopes Assigned: 1)Viridiplantae: 47580(96.75%) 2)Eukaryotes: 1368(2.78%) 3)Ancestor: 160(0.33%) 4)Bacteria: 56(0.11%) 5)Fungi: 8(0.02%) 6)Animals: 4(0.01%) 7)Arthropoda: 2(0.00%) Total unique sequences with at least one GO term: 49173 Total unique sequences without GO terms: 5 Total GO terms assigned: 2433269 Total biological_process terms (lvl=1): 136227 Total unique biological_process terms (lvl=1): 151 Top 10 biological_process terms assigned (lvl=1): 1)GO:0008152-metabolic process(L=1): 28499(20.92%) 2)GO:0009987-cellular process(L=1): 27100(19.89%) 3)GO:0044699-single-organism process(L=1): 16363(12.01%) 4)GO:0050896-response to stimulus(L=1): 12928(9.49%) 5)GO:0065007-biological regulation(L=1): 11939(8.76%) 6)GO:0032502-developmental process(L=1): 6699(4.92%) 7)GO:0032501-multicellular organismal process(L=1): 6615(4.86%) 8)GO:0051179-localization(L=1): 6404(4.70%) 9)GO:0071840-cellular component organization or biogenesis(L=1): 5324(3.91%) 10)GO:0000003-reproduction(L=1): 3681(2.70%) Total molecular_function terms (lvl=1): 57404 Total unique molecular_function terms (lvl=1): 30 Top 10 molecular_function terms assigned (lvl=1): 1)GO:0005488-binding(L=1): 24350(42.42%) 2)GO:0003824-catalytic activity(L=1): 21703(37.81%) 3)GO:0001071-nucleic acid binding transcription factor activity(L=1): 3672(6.40%) 4)GO:0005215-transporter activity(L=1): 2963(5.16%) 5)GO:0005198-structural molecule activity(L=1): 1464(2.55%) 6)GO:0009055-electron carrier activity(L=1): 964(1.68%) 7)GO:0004871-signal transducer activity(L=1): 729(1.27%) 8)GO:0060089-molecular transducer activity(L=1): 729(1.27%) 9)GO:0016209-antioxidant activity(L=1): 389(0.68%) 10)GO:0000988-transcription factor activity, protein binding(L=1): 216(0.38%) Total overall terms (lvl=1): 279486 Total unique overall terms (lvl=1): 196 Top 10 overall terms assigned (lvl=1): 1)GO:0005623-cell(L=1): 29645(10.61%) 2)GO:0008152-metabolic process(L=1): 28499(10.20%) 3)GO:0009987-cellular process(L=1): 27100(9.70%) 4)GO:0005488-binding(L=1): 24350(8.71%) 5)GO:0043226-organelle(L=1): 23008(8.23%) 6)GO:0003824-catalytic activity(L=1): 21703(7.77%) 7)GO:0044699-single-organism process(L=1): 16363(5.85%) 8)GO:0016020-membrane(L=1): 15730(5.63%) 9)GO:0050896-response to stimulus(L=1): 12928(4.63%) 10)GO:0065007-biological regulation(L=1): 11939(4.27%) Total cellular_component terms (lvl=1): 85855 Total unique cellular_component terms (lvl=1): 15 Top 10 cellular_component terms assigned (lvl=1): 1)GO:0005623-cell(L=1): 29645(34.53%) 2)GO:0043226-organelle(L=1): 23008(26.80%) 3)GO:0016020-membrane(L=1): 15730(18.32%) 4)GO:0032991-macromolecular complex(L=1): 6257(7.29%) 5)GO:0030054-cell junction(L=1): 2973(3.46%) 6)GO:0055044-symplast(L=1): 2955(3.44%) 7)GO:0031974-membrane-enclosed lumen(L=1): 2725(3.17%) 8)GO:0005576-extracellular region(L=1): 2367(2.76%) 9)GO:0009295-nucleoid(L=1): 93(0.11%) 10)GO:0045202-synapse(L=1): 42(0.05%) Total unique sequences with at least one pathway (KEGG) assignment: 12763 Total unique sequences without pathways (KEGG): 36415 Total pathways (KEGG) assigned: 43085 ------------------------------------------------------ Final Annotation Statistics ------------------------------------------------------ Total Sequences: 50841 Similarity Search Total unique sequences with an alignment: 45211 Total unique sequences without an alignment: 5630 Gene Families Total unique sequences with family assignment: 49178 Total unique sequences without family assignment: 1663 Total unique sequences with at least one GO term: 41627 Total unique sequences with at least one pathway (KEGG) assignment: 12632 Totals Total unique sequences annotated (similarity search alignments only): 267 Total unique sequences annotated (gene family assignment only): 4234 Total unique sequences annotated (gene family and/or similarity search): 49445 Total unique sequences unannotated (gene family and/or similarity search): 1396