The production of large amount of nucleotide sequences is used in biology, genetics and medicine, for example, in order to sequence genomic regions, or to identify novel mutations, or to identify genes selected in gene expression and protein interaction studies. In such applications the availability of a system able not only to collect and manage the sequences, but also to analyze the results with modern bioinformatic techniques, may be very useful. For this reason we realized a web-based sequences database able to help the researcher in all phases of the sequencing process, from sample submission, to sample processing, to results retrieving and analysis. This system works around the CEINGE sequencing service and, in fact, it also represents an interface between researchers and service operators.

The sequences, assigned to a specific project, can be controlled at any stage of their processing and collected in the database. Each sample includes additional information such as primer sequence, template type and hypothetical amplimer size. The system includes advanced analysis routines which can be used for sets of sequences of variable size: 1) filtering of repeated regions and/or vector sequences; 2) sequence identification by searching homologies in different datasets; 3) ‘contigs’ assembly by using the PHRED/PHRAP/CONSED package. More sophisticated analysis are provided by the integrated support for CAPRI, our solution for editing and analyzing nucleic and protein sequences.

