------------------------------------------------------ EnTAP Run Information - Execution ------------------------------------------------------ Current EnTAP Version: 0.10.4 ------------------------------------------------------ Transcriptome Statistics ------------------------------------------------------ Protein sequences found Total sequences: 57437 Total length of transcriptome(bp): 62615775 Average sequence length(bp): 1090.00 n50: 1509 n90: 516 Longest sequence(bp): 16920 (Jcr4S00306.10) Shortest sequence(bp): 60 (Jcr4S12350.10) ------------------------------------------------------ Similarity Search - DIAMOND - complete ------------------------------------------------------ Search results: Total alignments: 170724 Total unselected results: 135054 Total unique transcripts with an alignment: 35670 Total unique transcripts without an alignment: 21767 Total unique informative alignments: 18198 Total unique uninformative alignments: 17472 Total unique contaminants: 0(0.00%): Top 10 alignments by species: 1)jatropha curcas: 24820(69.58%) 2)hevea brasiliensis: 879(2.46%) 3)manihot esculenta: 477(1.34%) 4)vitis vinifera: 466(1.31%) 5)camellia sinensis: 429(1.20%) 6)ricinus communis: 402(1.13%) 7)nicotiana tomentosiformis: 359(1.01%) 8)brassica rapa: 297(0.83%) 9)nicotiana tabacum: 288(0.81%) 10)quercus suber: 285(0.80%) ------------------------------------------------------ Compiled Similarity Search - DIAMOND - Best Overall ------------------------------------------------------ Total unique transcripts with an alignment: 35670 Total unique transcripts without an alignment: 21767 Total unique informative alignments: 18199 Total unique uninformative alignments: 17471 Total unique contaminants: 0(0.00%): Top 10 alignments by species: 1)jatropha curcas: 24822(69.59%) 2)hevea brasiliensis: 879(2.46%) 3)manihot esculenta: 476(1.33%) 4)vitis vinifera: 462(1.30%) 5)camellia sinensis: 430(1.21%) 6)ricinus communis: 401(1.12%) 7)nicotiana tomentosiformis: 363(1.02%) 8)brassica rapa: 298(0.84%) 9)nicotiana tabacum: 285(0.80%) 10)quercus suber: 281(0.79%) ------------------------------------------------------ Gene Family - Gene Ontology and Pathway - EggNOG ------------------------------------------------------ Statistics for overall Eggnog results: Total unique sequences with family assignment: 41213 Total unique sequences without family assignment: 16224 Top 10 Taxonomic Scopes Assigned: 1)Viridiplantae: 35013(84.96%) 2)Eukaryotes: 5779(14.02%) 3)Ancestor: 270(0.66%) 4)Bacteria: 76(0.18%) 5)Fungi: 40(0.10%) 6)Arthropoda: 16(0.04%) 7)Animals: 15(0.04%) 8)Nematodes: 2(0.00%) 9)Fishes: 1(0.00%) 10)Archaea: 1(0.00%) Total unique sequences with at least one GO term: 41203 Total unique sequences without GO terms: 10 Total GO terms assigned: 1792656 Total cellular_component terms (lvl=1): 48887 Total unique cellular_component terms (lvl=1): 15 Top 10 cellular_component terms assigned (lvl=1): 1)GO:0005623-cell(L=1): 17176(35.13%) 2)GO:0043226-organelle(L=1): 13359(27.33%) 3)GO:0016020-membrane(L=1): 8463(17.31%) 4)GO:0032991-macromolecular complex(L=1): 3571(7.30%) 5)GO:0031974-membrane-enclosed lumen(L=1): 1562(3.20%) 6)GO:0030054-cell junction(L=1): 1547(3.16%) 7)GO:0005576-extracellular region(L=1): 1541(3.15%) 8)GO:0055044-symplast(L=1): 1526(3.12%) 9)GO:0009295-nucleoid(L=1): 62(0.13%) 10)GO:0045202-synapse(L=1): 39(0.08%) Total molecular_function terms (lvl=1): 48961 Total unique molecular_function terms (lvl=1): 31 Top 10 molecular_function terms assigned (lvl=1): 1)GO:0005488-binding(L=1): 22119(45.18%) 2)GO:0003824-catalytic activity(L=1): 20115(41.08%) 3)GO:0001071-nucleic acid binding transcription factor activity(L=1): 1810(3.70%) 4)GO:0005215-transporter activity(L=1): 1763(3.60%) 5)GO:0005198-structural molecule activity(L=1): 881(1.80%) 6)GO:0009055-electron carrier activity(L=1): 783(1.60%) 7)GO:0060089-molecular transducer activity(L=1): 416(0.85%) 8)GO:0004871-signal transducer activity(L=1): 416(0.85%) 9)GO:0016209-antioxidant activity(L=1): 385(0.79%) 10)GO:0000988-transcription factor activity, protein binding(L=1): 96(0.20%) Total overall terms (lvl=1): 192590 Total unique overall terms (lvl=1): 205 Top 10 overall terms assigned (lvl=1): 1)GO:0008152-metabolic process(L=1): 24149(12.54%) 2)GO:0009987-cellular process(L=1): 22586(11.73%) 3)GO:0005488-binding(L=1): 22119(11.49%) 4)GO:0003824-catalytic activity(L=1): 20115(10.44%) 5)GO:0005623-cell(L=1): 17176(8.92%) 6)GO:0043226-organelle(L=1): 13359(6.94%) 7)GO:0044699-single-organism process(L=1): 9790(5.08%) 8)GO:0016020-membrane(L=1): 8463(4.39%) 9)GO:0050896-response to stimulus(L=1): 7961(4.13%) 10)GO:0065007-biological regulation(L=1): 7049(3.66%) Total biological_process terms (lvl=1): 94742 Total unique biological_process terms (lvl=1): 159 Top 10 biological_process terms assigned (lvl=1): 1)GO:0008152-metabolic process(L=1): 24149(25.49%) 2)GO:0009987-cellular process(L=1): 22586(23.84%) 3)GO:0044699-single-organism process(L=1): 9790(10.33%) 4)GO:0050896-response to stimulus(L=1): 7961(8.40%) 5)GO:0065007-biological regulation(L=1): 7049(7.44%) 6)GO:0051179-localization(L=1): 3714(3.92%) 7)GO:0032501-multicellular organismal process(L=1): 3508(3.70%) 8)GO:0032502-developmental process(L=1): 3488(3.68%) 9)GO:0071840-cellular component organization or biogenesis(L=1): 3059(3.23%) 10)GO:0023052-signaling(L=1): 2606(2.75%) Total unique sequences with at least one pathway (KEGG) assignment: 7060 Total unique sequences without pathways (KEGG): 34153 Total pathways (KEGG) assigned: 24138 ------------------------------------------------------ Final Annotation Statistics ------------------------------------------------------ Total Sequences: 57437 Similarity Search Total unique sequences with an alignment: 35670 Total unique sequences without an alignment: 21767 Gene Families Total unique sequences with family assignment: 41213 Total unique sequences without family assignment: 16224 Total unique sequences with at least one GO term: 31615 Total unique sequences with at least one pathway (KEGG) assignment: 7002 Totals Total unique sequences annotated (similarity search alignments only): 1879 Total unique sequences annotated (gene family assignment only): 7422 Total unique sequences annotated (gene family and/or similarity search): 43092 Total unique sequences unannotated (gene family and/or similarity search): 14345