This paper presents detailed analysis of the entire sequence of a cosmid clone, 26H7, containing 35 kb of human DNA. This cosmid resides on the q27.1 region of the human X chromosome between, DS1232 and DXS119 loci. Novel potential small exons were detected for which conventional gene identification strategies (Northern blot analysis and extensive cDNA library screening) proved to be inefficient. Of the standard repetitive elements we found: 8 Alu's making up 6.2% of the sequence; 10 MIR segments (4.1%); 5 LINE1 elements (4.8%), 3 MIR2 (1.0%); 2 MLT (2.9%), and 1 MSTA (0.7%) representing about 20% of the total sequence. The overall GC content was rather low, only 42% and no CpG island was detected using rare restriction enzymes. However, a CpG-rich region was identified. Computer aided analysis of the sequence inferred the presence of three possible genes: one of them was found to be homologous to the U7 RNA family elements; a second is reported in this paper, however af the moment no significant homology has been found in the data bank. The third predicted gene has not as yet been found to be detectable by RT-PCR. We also report in this paper the identification of X-chromosome specific repeated sequences.
- computer analysis
- gene identification