Chromosomal integrons (as named by (4)) when their frequency in the pan-genome was 100 , or when they contained more than 19 attC sites. They were classed as mobile integrons when missing in more than 40 of the species’ genomes, when present on a plasmid, or when the integron-integrase was from classes 1 to 5. The remaining integrons were classed as `other’. Pseudo-genes detection We translated the six reading frames of the region containing the CALIN elements (10 kb on each side) to detect intI pseudo-genes. We then ran hmmsearch with Pyrvinium pamoate site default options from HMMER suite v3.1b1 to search for hits matching the profile intI Cterm and the profile PF00589 among the translated reading frames. We recovered the hits with evalues lower than 10-3 and alignments covering more than 50 of the profiles. IS detection We identified insertion sequences (IS) by searching for sequence similarity between the genes present 4 kb around or within each genetic element and a database of IS from ISFinder (56). Details can be found in (57). Detection of cassettes in INTEGRALL We searched for sequence similarity between all the CDS of CALIN elements and the INTEGRALL database using BLASTN from BLAST 2.2.30+. Cassettes were considered homologous to those of INTEGRALL when the BLASTN alignment showed more than 40 identity. RESULTSPhylogenetic analyses We have made two phylogenetic analyses. One analysis encompasses the set of all tyrosine recombinases and the other focuses on IntI. The phylogenetic tree of tyrosine recombinases (Supplementary Figure S1) was built using 204 proteins, including: 21 integrases adjacent to attC sites and matching the PF00589 profile but lacking the intI Cterm domain, seven proteins identified by both profiles and representative a0023781 of the Isoarnebin 4 site diversity of IntI, and 176 known tyrosine recombinases from phages and from the literature (12). We aligned the protein sequences with Muscle v3.8.31 with default options (49). We curated the alignment with BMGE using default options (50). The tree was then built with IQTREE multicore version 1.2.3 with the model LG+I+G4. This model was the one minimizing the Bayesian Information Criterion (BIC) among all models available (`-m TEST’ option in IQ-TREE). We made 10 000 ultra fast bootstraps to evaluate node support (Supplementary Figure S1, Tree S1). The phylogenetic analysis of IntI was done using the sequences from complete integrons or In0 elements (i.e., integrases identified by both HMM profiles) (Supplementary Figure S2). We added to this dataset some of the known integron-integrases of class 1, 2, 3, 4 and 5 retrieved from INTEGRALL. Given the previous phylogenetic analysis we used known XerC and XerD proteins to root the tree. Alignment and phylogenetic reconstruction were done using the same procedure; except that we built ten trees independently, and picked the one with best log-likelihood for the analysis (as recommended by the IQ-TREE authors (51)). The robustness of the branches was assessed using 1000 bootstraps (Supplementary Figure S2, Tree S2, Table S4).Pan-genomes Pan-genomes are the full complement of genes in the species. They were built by clustering homologous proteins into families for each of the species (as previously described in (52)). Briefly, we determined the journal.pone.0169185 lists of putative homologs between pairs of genomes with BLASTP (53) (default parameters) and used the e-values (<10-4 ) to cluster them using SILIX (54). SILIX parameters were set such that a protein was homologous to ano.Chromosomal integrons (as named by (4)) when their frequency in the pan-genome was 100 , or when they contained more than 19 attC sites. They were classed as mobile integrons when missing in more than 40 of the species' genomes, when present on a plasmid, or when the integron-integrase was from classes 1 to 5. The remaining integrons were classed as `other'. Pseudo-genes detection We translated the six reading frames of the region containing the CALIN elements (10 kb on each side) to detect intI pseudo-genes. We then ran hmmsearch with default options from HMMER suite v3.1b1 to search for hits matching the profile intI Cterm and the profile PF00589 among the translated reading frames. We recovered the hits with evalues lower than 10-3 and alignments covering more than 50 of the profiles. IS detection We identified insertion sequences (IS) by searching for sequence similarity between the genes present 4 kb around or within each genetic element and a database of IS from ISFinder (56). Details can be found in (57). Detection of cassettes in INTEGRALL We searched for sequence similarity between all the CDS of CALIN elements and the INTEGRALL database using BLASTN from BLAST 2.2.30+. Cassettes were considered homologous to those of INTEGRALL when the BLASTN alignment showed more than 40 identity. RESULTSPhylogenetic analyses We have made two phylogenetic analyses. One analysis encompasses the set of all tyrosine recombinases and the other focuses on IntI. The phylogenetic tree of tyrosine recombinases (Supplementary Figure S1) was built using 204 proteins, including: 21 integrases adjacent to attC sites and matching the PF00589 profile but lacking the intI Cterm domain, seven proteins identified by both profiles and representative a0023781 of the diversity of IntI, and 176 known tyrosine recombinases from phages and from the literature (12). We aligned the protein sequences with Muscle v3.8.31 with default options (49). We curated the alignment with BMGE using default options (50). The tree was then built with IQTREE multicore version 1.2.3 with the model LG+I+G4. This model was the one minimizing the Bayesian Information Criterion (BIC) among all models available (`-m TEST’ option in IQ-TREE). We made 10 000 ultra fast bootstraps to evaluate node support (Supplementary Figure S1, Tree S1). The phylogenetic analysis of IntI was done using the sequences from complete integrons or In0 elements (i.e., integrases identified by both HMM profiles) (Supplementary Figure S2). We added to this dataset some of the known integron-integrases of class 1, 2, 3, 4 and 5 retrieved from INTEGRALL. Given the previous phylogenetic analysis we used known XerC and XerD proteins to root the tree. Alignment and phylogenetic reconstruction were done using the same procedure; except that we built ten trees independently, and picked the one with best log-likelihood for the analysis (as recommended by the IQ-TREE authors (51)). The robustness of the branches was assessed using 1000 bootstraps (Supplementary Figure S2, Tree S2, Table S4).Pan-genomes Pan-genomes are the full complement of genes in the species. They were built by clustering homologous proteins into families for each of the species (as previously described in (52)). Briefly, we determined the journal.pone.0169185 lists of putative homologs between pairs of genomes with BLASTP (53) (default parameters) and used the e-values (<10-4 ) to cluster them using SILIX (54). SILIX parameters were set such that a protein was homologous to ano.
Related Posts
Degeneration in PD. Neurotherapeutics against PD shall then be targeted against the misregulated accomplices from
Degeneration in PD. Neurotherapeutics against PD shall then be targeted against the misregulated accomplices from the p38 and PI3KAKT cascades. In this assessment, we’ve outlined a lot of such established mechanisms involving the p38 MAPK and PI3KAKT pathways which can give therapeutic windows for the rectification of aberrant DA neuronal dynamics in PD brains.Key words: […]
S and one-fourth of clear cell carcinoma sufferers had DDR gene mutations. Our DDR gene
S and one-fourth of clear cell carcinoma sufferers had DDR gene mutations. Our DDR gene panel consisted on the genes involved in single-strand break repair, double-strand break repair and cell cycle regulation, such as the genes advisable by National Extensive Cancer Network (NCCN) recommendations as cost-effective tools for assessing the lifetime danger of EOC, like […]
The RPA chase data were fit to double exponential decays
As proven right here, at a protein focus of 100 nM, RPA-eGFP could be changed by wild-variety RPA and vice versa, indicating 3-Methyladeninethat the protein was in a position to exchange amongst a free of charge and sure point out beneath these response situations.Figure four. Concentration-dependent exchange of ssDNA-certain RPA. (A) Schematic illustrating the predicted […]