Raw data is saved as .txt files. We convert raw data of .txt files into .mat files in MATLAB. Data of .mat files contains DNA sequences and labels. These sequences of .mat files are encoded in onehotencode.py file and the encoded DNA sequences are saved as .npz files. Then run the iPro-CSAF.py file for feature extraction and promoter recognition.