Wilson H Lee

CRIS

CRIS A database of crosslinking-based RNA interactomes and structurome, containing sequencing data generated by various methods (e.g., PARIS/SHARC) and analyzed using the CRSSANT pipeline.

About the Project

SEE PROJECT HERE: https://whl-usc.github.io/

Advancements in the field of RNA structure and interaction studies have enabled high throughput analysis of RNAs, revealing base pairing and higher order interactions. Despite the progress on the experimental methods for studying RNA secondary and intermolecular complexes, computational analysis of such data generated has remained a challenge. One subset of such experimental methods, the crosslinking- and proximity-ligation based approaches generates non-continuous reads that indicate specific RNA structures and interactions. Here, we provide a resource that details the results of the computational analysis from a variety of methods that employ the crosslinking-ligation principle (see PARIS, PARIS2, SHARC).

Briefly, the pipeline functions as follows: Raw data were processed to remove sequencing adapters and PCR duplicates (and if applicable. separated by barcode). All reads were permissively mapped to a custom reference genome (Hg38 with masked repeat regions) using STAR. Primary alignments were extracted, classified, and assembled to duplex groups (DGs), non-overlapping (NGs), or tri-segment groups (TGs).

Here, DGs can be visualized using IGV, converted to stemloop groups, and have endpoints (read arm stopping points) determined. Furthermore, the nucleotide frequency or SHAPE reactivity of DG arms can be determined.

Contact

Zhipeng Lu

(back to top)