Cover illustration
The spatial organization of chromatins plays essential role in regulating transcriptional activity. There are mainly three kinds of methods to study 3D genome organization: molecular mapping, imaging and computational modeling. In this review we focus on reviewing (i) the recent developed super-resolution microscopy techniques to image and study chromatin, and (ii) the computational methods to analyze and visualize chromatin interactions derived from chromosome configuration [Detail] ...
Stochastic dynamics pervades gene regulation. Despite being random, the dynamics displays a kind of innate structure. In fact, two stochastic forces combine driving efforts: one force originates from the gradient of the underlying stochastic potential, and the other originates from the mathematical curl of the probability flux. The curl force gives rise to rotation. The gradient force gives rise to drift. Together they give rise to helical behavior. Here, it is shown that around and about the vicinity of attractive fixed points, the gradient force naturally wanes but the curl force is found to remain high. This leads to a locally noticeably different type of stochastic track near and about attractive fixed points, compared to tracks in regions where drift dominates. The consistency of this observation with the experimental fact that, in biology, fate commitment appears to not be a-priory locked-in, but rather necessitating active maintenance, is discussed. Hence attractive fixed-points are not only fuzzy, but may effectively be, locally, “more free”.
The sequencing revolution driven by high-throughput technologies has generated a huge amount of marine microbial sequences which hide the interaction patterns among microbial species and environment factors. Exploring these patterns is helpful for exploiting the marine resources. In this paper, we use the complex network approach to mine and analyze the interaction patterns of marine taxa and environments in spring, summer, fall and winter seasons. With the 16S rRNA pyrosequencing data of 76 time point taken monthly over 6 years, we first use our MtHc clustering algorithm to generate the operational taxonomic units (OTUs). Then, employ the k-means method to divide 76 time point samples into four seasonal groups, and utilize mutual information (MI) to construct the four correlation networks among microbial species and environment factors. Finally, we adopt the symmetrical non-negative matrix factorization method to detect the interaction patterns, and analysis the relationship between marine species and environment factors. The results show that the four seasonal microbial interaction networks have the characters of complex networks, and interaction patterns are related with the seasonal variability; the same environmental factor influences different species in the four seasons; the four environmental factors of day length, photosynthetically active radiation, NO2+NO3 and silicate may have stronger influences on microbes than other environment factors.
Reconstruction of transcriptome by de novo assembly from next generation sequencing (NGS) short-sequence reads provides an essential mean to catalog expressed genes, identify splicing isoforms, and capture the expression detail of transcripts for organisms with no reference genome available. De novo transcriptome assembly faces many unique challenges, including alternative splicing, variable expression level covering a dynamic range of several orders of magnitude, artifacts introduced by reverse transcription, etc. In the current review, we illustrate the grand strategy in applying De Bruijn Graph (DBG) approach in de novo transcriptome assembly. We further analyze many parameters proven critical in transcriptome assembly using DBG. Among them,
Although the diploid nature has been observed for over 50 years, phasing the diploid is still a laborious task. The speed and throughput of next generation sequencing have largely increased in the past decades. However, the short read-length remains one of the biggest challenges of haplotype analysis. For instance, reads as short as 150 bp span no more than one variant in most cases. Numerous experimental technologies have been developed to overcome this challenge. Distance, complexity and accuracy of the linkages obtained are the main factors to evaluate the efficiency of whole genome haplotyping methods. Here, we review these experimental technologies, evaluating their efficiency in linkages obtaining and system complexity. The technologies are organized into four categories based on its strategy: (i) chromosomes separation, (ii) dilution pools, (iii) crosslinking and proximity ligation, (ix) long-read technologies. Within each category, several subsections are listed to classify each technology. Innovative experimental strategies are expected to have high-quality performance, low cost and be labor-saving, which will be largely desired in the future.
Fundamental improvement was made for genome sequencing since the next-generation sequencing (NGS) came out in the 2000s. The newer technologies make use of the power of massively-parallel short-read DNA sequencing, genome alignment and assembly methods to digitally and rapidly search the genomes on a revolutionary scale, which enable large-scale whole genome sequencing (WGS) accessible and practical for researchers. Nowadays, whole genome sequencing is more and more prevalent in detecting the genetics of diseases, studying causative relations with cancers, making genome-level comparative analysis, reconstruction of human population history, and giving clinical implications and instructions. In this review, we first give a typical pipeline of whole genome sequencing, including the lab template preparation, sequencing, genome assembling and quality control, variants calling and annotations. We compare the difference between whole genome and whole exome sequencing (WES), and explore a wide range of applications of whole genome sequencing for both mendelian diseases and complex diseases in medical genetics. We highlight the impact of whole genome sequencing in cancer studies, regulatory variant analysis, predictive medicine and precision medicine, as well as discuss the challenges of the whole genome sequencing.
The recent advances in chromosome configuration capture (3C)-based series molecular methods and optical super-resolution (SR) techniques offer powerful tools to investigate three dimensional (3D) genomic structure in prokaryotic and eukaryotic cell nucleus. In this review, we focus on the progress during the last decade in this exciting field. Here we at first introduce briefly genome organization at chromosome, domain and sub-domain level, respectively; then we provide a short introduction to various super-resolution microscopy techniques which can be employed to detect genome 3D structure. We also reviewed the progress of quantitative and visualization tools to evaluate and visualize chromatin interactions in 3D genome derived from Hi-C data. We end up with the discussion that imaging methods and 3C-based molecular methods are not mutually exclusive - - - - actually they are complemental to each other and can be combined together to study 3D genome organization.