Hi-C Proximity-Guided Assembly

Regain ultra-long-range genomic contiguity with Phase Genomics’ patent-pending Proximo™ Hi-C and ProxiMeta™ Hi-C technology.

chromosome-scale scaffolds for every genome

Get chromosome-scale scaffolds for virtually any genome

Proximo Hi-C measures the structure of DNA in vivo and uses that information to scaffold contigs into entire chromosomes. This powerful new scaffolding approach requires only a biological sample in its natural, intact state — no need to isolate high-molecular-weight DNA. Chromosome-scale scaffolds generated with Proximo Hi-C empower you to answer biological questions you never could have before and to make important new discoveries.

Inside the cell, chromosomes are typically unwound into a hairball-like chromatin structure. DNA sequences found on the same chromosome tend to be physically close together in the chromatin. Sequences located closer together on a chromosome are also more likely to be close together than sequences farther apart on the chromosome. Hi-C measures these structural chromatin features, providing architectural genome information to complement more commonly available sequence information.

In Proximo Hi-C, chromatin is cross-linked with formaldehyde to fix DNA sequences in close physical proximity. The cross-linked chromatin is fragmented, and the resulting junctions are extracted and proximity ligated. These junctions capture chromatin structure information. Illumina paired-end sequencing is then used to read both ends of the ligated fragments, identifying the sequences that were originally in close proximity in vivo.

The Proximo Hi-C reads are mapped to a set of contigs from a draft assembly. The number of Hi-C read pairs mapped to a given ligation site reflects the original chromatin structural information captured via the cross-linking process.

The structural information is used first to place contigs into chromosome groups...

...and then used to order and orient the contigs into whole chromosome-scale scaffolds.

chromosome-scale scaffolds for every genome

Explore the depths of the microbial world

The ProxiMeta Hi-C metagenomic deconvolution method reveals which DNA sequences originated in the same cell, enabling assembly of whole genomes from mixed samples in their natural state — no culturing or high-molecular-weight DNA extraction required. Like Proximo Hi-C, ProxiMeta Hi-C is based on capturing physical DNA proximity with in vivo Hi-C. This powerful new source of information allows sequences from the same species and/or strain to be grouped, yielding dozens or even hundreds of genomes for rare, unculturable, and novel microbes easily and affordably.

ProxiMeta Hi-C uses in vivo cross-linking to fix DNA sequences which were present inside the same intact cell. Cross-linking generates linkage information between all DNA sequences inside the same cell, so it also generates data about which chromosomes were present inside the same cell (e.g., for eukaryotic metagenomics) and which plasmids were associated with which host genomes (e.g., for identification of anti-microbial resistance plasmids in strains).

The cross-linked chromatin is fragmented and proximity ligated, creating chimeric junctions between sequences originating from the same cell. Illumina paired-end sequencing these junctions measures which pairs of sequences were in close physical proximity in vivo, which can then be used to group sequences by cellular origin.

The chromatin structure information generated by ProxiMeta Hi-C is used deconvolute metagenomes by identifying which groups of contigs were present inside the same cell, creating individual genome assemblies for the species and strains which were originally present in the sample, including estimating cellular abundance. Because no culturing is required, novel genomes can be detected and characterized as they were in nature, and mutations, plasmids, and multiple chromosomes can be placed into a specific species or strain. Available reference data is then used to add context to the genomes extracted from a sample. Because reference data is not used as an input to the deconvolution process, the biases and limitations of known reference data do not affect the results for a sample: the pure chromatin structural data drives deconvolution, with reference data applied to layer in known scientific information after the fact.

Complete chromosome-scale scaffolds and deconvoluted metagenomes, along with a continuing commitment to personalized service without hidden costs.