------------------------------------------------------ EnTAP Run Information - Execution ------------------------------------------------------ Current EnTAP Version: 0.10.4 ------------------------------------------------------ Transcriptome Statistics ------------------------------------------------------ Protein sequences found Total sequences: 25808 Total length of transcriptome(bp): 31550556 Average sequence length(bp): 1222.00 n50: 1509 n90: 648 Longest sequence(bp): 11823 (Qrob_P0010570.2) Shortest sequence(bp): 150 (Qrob_P0424800.2) ------------------------------------------------------ Similarity Search - DIAMOND - complete ------------------------------------------------------ Search results: Total alignments: 200708 Total unselected results: 178409 Total unique transcripts with an alignment: 22299 Total unique transcripts without an alignment: 3509 Total unique informative alignments: 17443 Total unique uninformative alignments: 4856 Total unique contaminants: 0(0.00%): Top 10 alignments by species: 1)quercus lobata: 14775(66.26%) 2)quercus suber: 4833(21.67%) 3)juglans regia: 381(1.71%) 4)ziziphus jujuba: 174(0.78%) 5)camellia sinensis: 139(0.62%) 6)prunus avium: 106(0.48%) 7)malus domestica: 91(0.41%) 8)hevea brasiliensis: 83(0.37%) 9)nicotiana tomentosiformis: 75(0.34%) 10)prosopis alba: 71(0.32%) ------------------------------------------------------ Compiled Similarity Search - DIAMOND - Best Overall ------------------------------------------------------ Total unique transcripts with an alignment: 22299 Total unique transcripts without an alignment: 3509 Total unique informative alignments: 17445 Total unique uninformative alignments: 4854 Total unique contaminants: 0(0.00%): Top 10 alignments by species: 1)quercus lobata: 14633(65.62%) 2)quercus suber: 4975(22.31%) 3)juglans regia: 381(1.71%) 4)ziziphus jujuba: 167(0.75%) 5)camellia sinensis: 136(0.61%) 6)prunus avium: 104(0.47%) 7)malus domestica: 90(0.40%) 8)hevea brasiliensis: 80(0.36%) 9)prosopis alba: 77(0.35%) 10)nicotiana tomentosiformis: 75(0.34%) ------------------------------------------------------ Gene Family - Gene Ontology and Pathway - EggNOG ------------------------------------------------------ Statistics for overall Eggnog results: Total unique sequences with family assignment: 25031 Total unique sequences without family assignment: 777 Top 10 Taxonomic Scopes Assigned: 1)Viridiplantae: 23471(93.77%) 2)Eukaryotes: 1228(4.91%) 3)Ancestor: 324(1.29%) 4)Arthropoda: 4(0.02%) 5)Animals: 2(0.01%) 6)Fungi: 1(0.00%) 7)Mammals: 1(0.00%) Total unique sequences with at least one GO term: 25031 Total unique sequences without GO terms: 0 Total GO terms assigned: 1223264 Total molecular_function terms (lvl=1): 31279 Total unique molecular_function terms (lvl=1): 27 Top 10 molecular_function terms assigned (lvl=1): 1)GO:0005488-binding(L=1): 12906(41.26%) 2)GO:0003824-catalytic activity(L=1): 12772(40.83%) 3)GO:0005215-transporter activity(L=1): 1626(5.20%) 4)GO:0001071-nucleic acid binding transcription factor activity(L=1): 1305(4.17%) 5)GO:0009055-electron carrier activity(L=1): 661(2.11%) 6)GO:0005198-structural molecule activity(L=1): 608(1.94%) 7)GO:0004871-signal transducer activity(L=1): 438(1.40%) 8)GO:0060089-molecular transducer activity(L=1): 438(1.40%) 9)GO:0016209-antioxidant activity(L=1): 273(0.87%) 10)GO:0045735-nutrient reservoir activity(L=1): 93(0.30%) Total cellular_component terms (lvl=1): 38337 Total unique cellular_component terms (lvl=1): 15 Top 10 cellular_component terms assigned (lvl=1): 1)GO:0005623-cell(L=1): 13188(34.40%) 2)GO:0043226-organelle(L=1): 9683(25.26%) 3)GO:0016020-membrane(L=1): 7662(19.99%) 4)GO:0032991-macromolecular complex(L=1): 2502(6.53%) 5)GO:0005576-extracellular region(L=1): 1462(3.81%) 6)GO:0030054-cell junction(L=1): 1380(3.60%) 7)GO:0055044-symplast(L=1): 1367(3.57%) 8)GO:0031974-membrane-enclosed lumen(L=1): 1004(2.62%) 9)GO:0009295-nucleoid(L=1): 43(0.11%) 10)GO:0019012-virion(L=1): 21(0.05%) Total overall terms (lvl=1): 139847 Total unique overall terms (lvl=1): 146 Top 10 overall terms assigned (lvl=1): 1)GO:0008152-metabolic process(L=1): 15114(10.81%) 2)GO:0009987-cellular process(L=1): 13521(9.67%) 3)GO:0005623-cell(L=1): 13188(9.43%) 4)GO:0005488-binding(L=1): 12906(9.23%) 5)GO:0003824-catalytic activity(L=1): 12772(9.13%) 6)GO:0043226-organelle(L=1): 9683(6.92%) 7)GO:0044699-single-organism process(L=1): 8355(5.97%) 8)GO:0016020-membrane(L=1): 7662(5.48%) 9)GO:0050896-response to stimulus(L=1): 7353(5.26%) 10)GO:0065007-biological regulation(L=1): 5636(4.03%) Total biological_process terms (lvl=1): 70231 Total unique biological_process terms (lvl=1): 104 Top 10 biological_process terms assigned (lvl=1): 1)GO:0008152-metabolic process(L=1): 15114(21.52%) 2)GO:0009987-cellular process(L=1): 13521(19.25%) 3)GO:0044699-single-organism process(L=1): 8355(11.90%) 4)GO:0050896-response to stimulus(L=1): 7353(10.47%) 5)GO:0065007-biological regulation(L=1): 5636(8.02%) 6)GO:0051179-localization(L=1): 3278(4.67%) 7)GO:0032501-multicellular organismal process(L=1): 3156(4.49%) 8)GO:0032502-developmental process(L=1): 2921(4.16%) 9)GO:0071840-cellular component organization or biogenesis(L=1): 2458(3.50%) 10)GO:0051704-multi-organism process(L=1): 2374(3.38%) Total unique sequences with at least one pathway (KEGG) assignment: 5880 Total unique sequences without pathways (KEGG): 19151 Total pathways (KEGG) assigned: 19204 ------------------------------------------------------ Final Annotation Statistics ------------------------------------------------------ Total Sequences: 25808 Similarity Search Total unique sequences with an alignment: 22299 Total unique sequences without an alignment: 3509 Gene Families Total unique sequences with family assignment: 25031 Total unique sequences without family assignment: 777 Total unique sequences with at least one GO term: 20914 Total unique sequences with at least one pathway (KEGG) assignment: 5854 Totals Total unique sequences annotated (similarity search alignments only): 120 Total unique sequences annotated (gene family assignment only): 2852 Total unique sequences annotated (gene family and/or similarity search): 25151 Total unique sequences unannotated (gene family and/or similarity search): 657