Cohesin is positioned in mammalian genomes by transcription, CTCF and Wapl
Mammalian genomes are spatially organized by CCCTC-binding factor (CTCF) and cohesin into chromatin loops1,2and topologically associated domains3,4,5,6, which have important roles in gene regulation1,2,4,5,7and recombination7,8,9. By binding to specific sequences10, CTCF defines contact points for cohesin-mediated long-range chromosomalcis-interactions1,2,4,5,6,7,11. Cohesin is also present at these sites12,13, but has been proposed to be loaded onto DNA elsewhere14,15and to extrude chromatin loops until it encounters CTCF bound to DNA16,17,18,19. How cohesin is recruited to CTCF sites, according to this or other models, is unknown. Here we show that the distribution of cohesin in the mouse genome depends on transcription, CTCF and the cohesin release factor Wings apart-like (Wapl). In CTCF-depleted fibroblasts, cohesin cannot be properly recruited to CTCF sites but instead accumulates at transcription start sites of active genes, where the cohesin-loading complex is located14,15. In the absence of both CTCF and Wapl, cohesin accumulates in up to 70 kilobase-long regions at 3′-ends of active genes, in particular if these converge on each other. Changing gene expression modulates the position of these ‘cohesin islands’. These findings indicate that transcription can relocate mammalian cohesin over long distances on DNA, as previously reported for yeast cohesin20,21,22,23, that this translocation contributes to positioning cohesin at CTCF sites, and that active genes can be freed from cohesin either by transcription-mediated translocation or by Wapl-mediated release.