------------------------------------------------------ EnTAP Run Information - Execution ------------------------------------------------------ Current EnTAP Version: 0.10.4 ------------------------------------------------------ Transcriptome Statistics ------------------------------------------------------ Protein sequences found Total sequences: 35131 Total length of transcriptome(bp): 44539683 Average sequence length(bp): 1267.00 n50: 1710 n90: 636 Longest sequence(bp): 15984 (Ppr_1145.961) Shortest sequence(bp): 189 (Ppr_58.25178) ------------------------------------------------------ Similarity Search - DIAMOND - complete ------------------------------------------------------ Search results: Total alignments: 219870 Total unselected results: 193032 Total unique transcripts with an alignment: 26838 Total unique transcripts without an alignment: 8293 Total unique informative alignments: 14627 Total unique uninformative alignments: 12211 Total unique contaminants: 0(0.00%): Top 10 alignments by species: 1)populus trichocarpa: 14855(55.35%) 2)populus euphratica: 10630(39.61%) 3)hevea brasiliensis: 142(0.53%) 4)manihot esculenta: 76(0.28%) 5)jatropha curcas: 65(0.24%) 6)ricinus communis: 65(0.24%) 7)quercus suber: 56(0.21%) 8)camellia sinensis: 52(0.19%) 9)quercus lobata: 48(0.18%) 10)pistacia vera: 48(0.18%) ------------------------------------------------------ Compiled Similarity Search - DIAMOND - Best Overall ------------------------------------------------------ Total unique transcripts with an alignment: 26838 Total unique transcripts without an alignment: 8293 Total unique informative alignments: 14742 Total unique uninformative alignments: 12096 Total unique contaminants: 0(0.00%): Top 10 alignments by species: 1)populus trichocarpa: 15021(55.97%) 2)populus euphratica: 10463(38.99%) 3)hevea brasiliensis: 136(0.51%) 4)manihot esculenta: 80(0.30%) 5)jatropha curcas: 68(0.25%) 6)ricinus communis: 65(0.24%) 7)quercus suber: 55(0.20%) 8)camellia sinensis: 52(0.19%) 9)quercus lobata: 48(0.18%) 10)pistacia vera: 47(0.18%) ------------------------------------------------------ Gene Family - Gene Ontology and Pathway - EggNOG ------------------------------------------------------ Statistics for overall Eggnog results: Total unique sequences with family assignment: 29340 Total unique sequences without family assignment: 5791 Top 10 Taxonomic Scopes Assigned: 1)Viridiplantae: 26854(91.53%) 2)Eukaryotes: 2162(7.37%) 3)Ancestor: 314(1.07%) 4)Fungi: 6(0.02%) 5)Animals: 2(0.01%) 6)Bacteria: 1(0.00%) 7)Arthropoda: 1(0.00%) Total unique sequences with at least one GO term: 29340 Total unique sequences without GO terms: 0 Total GO terms assigned: 1495175 Total molecular_function terms (lvl=1): 34202 Total unique molecular_function terms (lvl=1): 30 Top 10 molecular_function terms assigned (lvl=1): 1)GO:0005488-binding(L=1): 13929(40.73%) 2)GO:0003824-catalytic activity(L=1): 13226(38.67%) 3)GO:0001071-nucleic acid binding transcription factor activity(L=1): 1972(5.77%) 4)GO:0005215-transporter activity(L=1): 1857(5.43%) 5)GO:0005198-structural molecule activity(L=1): 808(2.36%) 6)GO:0009055-electron carrier activity(L=1): 762(2.23%) 7)GO:0004871-signal transducer activity(L=1): 520(1.52%) 8)GO:0060089-molecular transducer activity(L=1): 520(1.52%) 9)GO:0016209-antioxidant activity(L=1): 298(0.87%) 10)GO:0000988-transcription factor activity, protein binding(L=1): 131(0.38%) Total cellular_component terms (lvl=1): 48013 Total unique cellular_component terms (lvl=1): 15 Top 10 cellular_component terms assigned (lvl=1): 1)GO:0005623-cell(L=1): 16253(33.85%) 2)GO:0043226-organelle(L=1): 12576(26.19%) 3)GO:0016020-membrane(L=1): 8931(18.60%) 4)GO:0032991-macromolecular complex(L=1): 3463(7.21%) 5)GO:0030054-cell junction(L=1): 1728(3.60%) 6)GO:0055044-symplast(L=1): 1687(3.51%) 7)GO:0005576-extracellular region(L=1): 1681(3.50%) 8)GO:0031974-membrane-enclosed lumen(L=1): 1540(3.21%) 9)GO:0009295-nucleoid(L=1): 64(0.13%) 10)GO:0045202-synapse(L=1): 48(0.10%) Total overall terms (lvl=1): 163829 Total unique overall terms (lvl=1): 211 Top 10 overall terms assigned (lvl=1): 1)GO:0008152-metabolic process(L=1): 16650(10.16%) 2)GO:0005623-cell(L=1): 16253(9.92%) 3)GO:0009987-cellular process(L=1): 15353(9.37%) 4)GO:0005488-binding(L=1): 13929(8.50%) 5)GO:0003824-catalytic activity(L=1): 13226(8.07%) 6)GO:0043226-organelle(L=1): 12576(7.68%) 7)GO:0044699-single-organism process(L=1): 9560(5.84%) 8)GO:0016020-membrane(L=1): 8931(5.45%) 9)GO:0050896-response to stimulus(L=1): 7759(4.74%) 10)GO:0065007-biological regulation(L=1): 7039(4.30%) Total biological_process terms (lvl=1): 81614 Total unique biological_process terms (lvl=1): 166 Top 10 biological_process terms assigned (lvl=1): 1)GO:0008152-metabolic process(L=1): 16650(20.40%) 2)GO:0009987-cellular process(L=1): 15353(18.81%) 3)GO:0044699-single-organism process(L=1): 9560(11.71%) 4)GO:0050896-response to stimulus(L=1): 7759(9.51%) 5)GO:0065007-biological regulation(L=1): 7039(8.62%) 6)GO:0032501-multicellular organismal process(L=1): 4088(5.01%) 7)GO:0032502-developmental process(L=1): 4084(5.00%) 8)GO:0051179-localization(L=1): 3893(4.77%) 9)GO:0071840-cellular component organization or biogenesis(L=1): 3400(4.17%) 10)GO:0000003-reproduction(L=1): 2435(2.98%) Total unique sequences with at least one pathway (KEGG) assignment: 6965 Total unique sequences without pathways (KEGG): 22375 Total pathways (KEGG) assigned: 23917 ------------------------------------------------------ Final Annotation Statistics ------------------------------------------------------ Total Sequences: 35131 Similarity Search Total unique sequences with an alignment: 26838 Total unique sequences without an alignment: 8293 Gene Families Total unique sequences with family assignment: 29340 Total unique sequences without family assignment: 5791 Total unique sequences with at least one GO term: 23993 Total unique sequences with at least one pathway (KEGG) assignment: 6936 Totals Total unique sequences annotated (similarity search alignments only): 274 Total unique sequences annotated (gene family assignment only): 2776 Total unique sequences annotated (gene family and/or similarity search): 29614 Total unique sequences unannotated (gene family and/or similarity search): 5517