------------------------------------------------------ EnTAP Run Information - Execution ------------------------------------------------------ Current EnTAP Version: 0.10.4 ------------------------------------------------------ Transcriptome Statistics ------------------------------------------------------ Protein sequences found Total sequences: 29127 Total length of transcriptome(bp): 32252580 Average sequence length(bp): 1107.00 n50: 1539 n90: 525 Longest sequence(bp): 17745 (OWM76907.1) Shortest sequence(bp): 180 (OWM62563.1) ------------------------------------------------------ Similarity Search - DIAMOND - complete ------------------------------------------------------ Search results: Total alignments: 105199 Total unselected results: 84334 Total unique transcripts with an alignment: 20865 Total unique transcripts without an alignment: 8262 Total unique informative alignments: 16039 Total unique uninformative alignments: 4826 Total unique contaminants: 0(0.00%): Top 10 alignments by species: 1)punica granatum: 19566(93.77%) 2)eucalyptus grandis: 123(0.59%) 3)syzygium oleosum: 99(0.47%) 4)rhodamnia argentea: 72(0.35%) 5)quercus suber: 61(0.29%) 6)camellia sinensis: 48(0.23%) 7)durio zibethinus: 46(0.22%) 8)ziziphus jujuba: 40(0.19%) 9)juglans regia: 39(0.19%) 10)carica papaya: 28(0.13%) ------------------------------------------------------ Compiled Similarity Search - DIAMOND - Best Overall ------------------------------------------------------ Total unique transcripts with an alignment: 20865 Total unique transcripts without an alignment: 8262 Total unique informative alignments: 16039 Total unique uninformative alignments: 4826 Total unique contaminants: 0(0.00%): Top 10 alignments by species: 1)punica granatum: 19567(93.78%) 2)eucalyptus grandis: 123(0.59%) 3)syzygium oleosum: 100(0.48%) 4)rhodamnia argentea: 71(0.34%) 5)quercus suber: 65(0.31%) 6)durio zibethinus: 49(0.23%) 7)camellia sinensis: 48(0.23%) 8)ziziphus jujuba: 41(0.20%) 9)juglans regia: 39(0.19%) 10)carica papaya: 27(0.13%) ------------------------------------------------------ Gene Family - Gene Ontology and Pathway - EggNOG ------------------------------------------------------ Statistics for overall Eggnog results: Total unique sequences with family assignment: 21790 Total unique sequences without family assignment: 7337 Top 10 Taxonomic Scopes Assigned: 1)Viridiplantae: 20966(96.22%) 2)Eukaryotes: 690(3.17%) 3)Ancestor: 119(0.55%) 4)Animals: 5(0.02%) 5)Bacteria: 4(0.02%) 6)Mammals: 3(0.01%) 7)Arthropoda: 1(0.00%) 8)Aves: 1(0.00%) 9)Apicomplexa: 1(0.00%) Total unique sequences with at least one GO term: 21788 Total unique sequences without GO terms: 2 Total GO terms assigned: 994494 Total molecular_function terms (lvl=1): 24925 Total unique molecular_function terms (lvl=1): 25 Top 10 molecular_function terms assigned (lvl=1): 1)GO:0005488-binding(L=1): 10201(40.93%) 2)GO:0003824-catalytic activity(L=1): 9959(39.96%) 3)GO:0005215-transporter activity(L=1): 1341(5.38%) 4)GO:0001071-nucleic acid binding transcription factor activity(L=1): 1303(5.23%) 5)GO:0009055-electron carrier activity(L=1): 560(2.25%) 6)GO:0005198-structural molecule activity(L=1): 515(2.07%) 7)GO:0060089-molecular transducer activity(L=1): 319(1.28%) 8)GO:0004871-signal transducer activity(L=1): 319(1.28%) 9)GO:0016209-antioxidant activity(L=1): 214(0.86%) 10)GO:0045735-nutrient reservoir activity(L=1): 79(0.32%) Total cellular_component terms (lvl=1): 34301 Total unique cellular_component terms (lvl=1): 14 Top 10 cellular_component terms assigned (lvl=1): 1)GO:0005623-cell(L=1): 11851(34.55%) 2)GO:0043226-organelle(L=1): 9016(26.28%) 3)GO:0016020-membrane(L=1): 6509(18.98%) 4)GO:0032991-macromolecular complex(L=1): 2181(6.36%) 5)GO:0005576-extracellular region(L=1): 1267(3.69%) 6)GO:0030054-cell junction(L=1): 1214(3.54%) 7)GO:0055044-symplast(L=1): 1211(3.53%) 8)GO:0031974-membrane-enclosed lumen(L=1): 976(2.85%) 9)GO:0009295-nucleoid(L=1): 39(0.11%) 10)GO:0019012-virion(L=1): 14(0.04%) Total overall terms (lvl=1): 117200 Total unique overall terms (lvl=1): 138 Top 10 overall terms assigned (lvl=1): 1)GO:0008152-metabolic process(L=1): 12445(10.62%) 2)GO:0005623-cell(L=1): 11851(10.11%) 3)GO:0009987-cellular process(L=1): 11248(9.60%) 4)GO:0005488-binding(L=1): 10201(8.70%) 5)GO:0003824-catalytic activity(L=1): 9959(8.50%) 6)GO:0043226-organelle(L=1): 9016(7.69%) 7)GO:0044699-single-organism process(L=1): 7012(5.98%) 8)GO:0016020-membrane(L=1): 6509(5.55%) 9)GO:0050896-response to stimulus(L=1): 5881(5.02%) 10)GO:0065007-biological regulation(L=1): 4992(4.26%) Total biological_process terms (lvl=1): 57974 Total unique biological_process terms (lvl=1): 99 Top 10 biological_process terms assigned (lvl=1): 1)GO:0008152-metabolic process(L=1): 12445(21.47%) 2)GO:0009987-cellular process(L=1): 11248(19.40%) 3)GO:0044699-single-organism process(L=1): 7012(12.10%) 4)GO:0050896-response to stimulus(L=1): 5881(10.14%) 5)GO:0065007-biological regulation(L=1): 4992(8.61%) 6)GO:0032501-multicellular organismal process(L=1): 2693(4.65%) 7)GO:0051179-localization(L=1): 2679(4.62%) 8)GO:0032502-developmental process(L=1): 2647(4.57%) 9)GO:0071840-cellular component organization or biogenesis(L=1): 2123(3.66%) 10)GO:0051704-multi-organism process(L=1): 1643(2.83%) Total unique sequences with at least one pathway (KEGG) assignment: 5022 Total unique sequences without pathways (KEGG): 16768 Total pathways (KEGG) assigned: 16566 ------------------------------------------------------ Final Annotation Statistics ------------------------------------------------------ Total Sequences: 29127 Similarity Search Total unique sequences with an alignment: 20865 Total unique sequences without an alignment: 8262 Gene Families Total unique sequences with family assignment: 21790 Total unique sequences without family assignment: 7337 Total unique sequences with at least one GO term: 18025 Total unique sequences with at least one pathway (KEGG) assignment: 4984 Totals Total unique sequences annotated (similarity search alignments only): 531 Total unique sequences annotated (gene family assignment only): 1456 Total unique sequences annotated (gene family and/or similarity search): 22321 Total unique sequences unannotated (gene family and/or similarity search): 6806