Control Systems and Computers, N5, 2017, Article 3

DOI: https://doi.org/10.15407/usim.2017.05.025

Upr. sist. maš., 2017, Issue 5 (271), pp. 25-42.

UDC 004.048

S.A. Babichev

PhD., associate professor, associate professor of department of informatics, Jan Evangelista Purkyne University in Usti nad Labem, Czech Republic, 8, Ceske mladeze Str., Usti nad Labem, Czech Republic, 400 96.

Technology of Wavelet-Filtration of the Gene Expression Profiles in Order to Remove the Background Noise 

Introduction. The solved task is focused on increasing the gene expression profiles quality, which are used to reconstruct the gene regulatory networks. The filtration process is one of the stages of data preprocessing, implementation of which corresponds to the increasing data quality by removing the background “white” noise component.
The aim of the paper is development of the wavelet filtration technology of gene expression profiles based on the Shannon entropy criterion, which calculated by James-Stein shrinkage estimator using.
Methods. During the research, the methods of the computer simulation, wavelet analysis, and entropy methods to estimate the studied data comprehension are used.
Results. The results of the simulation prove that the choice of the mother wavelet type from orthogonal and biorthogonalwavelets in case of the gene expression profiles filtration is not determinative. In terms of the relative criterion calculated as
the Shannon entropy ratio of the filtered gene expression profiles and the extracted noise component, the best results are obtained using the biorthogonal wavelet bior1.5, however the difference obtained using other types of wavelets is insignificant.
The choice of the type of the wavelet from the family of the mother’s wavelets, the choice of the level of the wavelet decomposition,and the choice of the value of the thresholding coefficient are determining in this case.
Conclusions. The wavelet filtration technology of gene expression profiles based on complex use of the methods to estimate the filtered signal and extracted noise comprehension component is proposed based on the performed simulation. The
implementation of this technology allows us to optimise the wavelet filtration process of complex signals in order to remove the “white” noise component.

Keywords: gene expression profiles, wavelets, tresholding, filtration.

Download full text! (In Ukrainan)

  1. Schermer, M.J., 1999. DNA microarrays: a practical approach. Oxford University Press, pp. 17–42.
  2. Basarsky, T., Verdnik, D., Zhai J.Y. et al., 2000. “Microarray biochip technology”. Eaton Publishing, pp. 265–284.
  3. Daubechies, I., 1990. “The wavelet transform, time-frequency localization and signal analysis”. IEEE Trans. Inform. Theory, 36, pp. 961–1005.
    https://doi.org/10.1109/18.57199
  4. Daubechies, I., 1992. Ten lectures on wavelet. CBMS–NSF conf. series in applied math. SLAM Ed., 343 p.
    https://doi.org/10.1137/1.9781611970104
  5. Coifman, R.R., Meyer, Y., Wickerhauser, M.V., 1992. “Wavelet Analysis and Signal Processing”. Wavelets and Their Applications, Boston Jones and Bartlett, pp. 153–178.
  6. Antoshchuk, S.G., 2004. “Realizaciya vejvletnogo preobrazovaniya pri strukturnom analize izobrazhenij”. Elektromashinobuduvannya ta elektroobladnannya, 62, pp. 153–157. (In Russian).
  7. Benedetto, J.J., 1993. Wavelets: Mathematics and Application. CRC Press, Series: Studies in Advanced Mathematics, 1993, 592 p.
  8. Samsul, A., Karim, A., Mohd, T.I., 2009. “Compression of Chemical Signal Using Wavelet Transform”. European J. of Scientific Research, 36(4), pp. 513–520.
  9. Joshi, A., Aravind, H.S., 2014. “Analysis of Adaptive Wavelet Wiener Filtering for ECG Signals: Review”. Int. J. Of Advanced Research in Electronics and Communication Engineering, 3 (4), pp. 395–398.
  10. Chandu, R., Venkateswarlu, M., 2015. “ECG Signal Filtering using an Improved Wavelet Wiener Filtering”. Int. J. of Advanced Technology and Innovative Research, 7 (7), pp. 1242–1247.
  11. Baldi, P., Hatfield, G.W., 2002. “DNA Microarrays and gene expression: From experiments to data analysis modeling”. Cambridge University Press, pp. 22–23.
    https://doi.org/10.1017/CBO9780511541773
  12. Bolstad, B.M., Irizarry, R.A., Astrand M. et al., 2003. “A Comparison of Normalization Methods for High Density Oligonucleotide Array Data Based on Variance and Bias”. Bioinformatics, 19, pp. 185–193.
    https://doi.org/10.1093/bioinformatics/19.2.185
  13. Irizarry, R.A., Hobbs, B., Collin, F. et al., 2003. “Exploration, normalization, and summaries of high density oligonucleotide array probe level data”. Biostatistics, 4 (2), pp. 249–264.
    https://doi.org/10.1093/biostatistics/4.2.249
  14. Chen, Z., Mcgee, M., LIU, Q., 2009. “Distribution-Free Convolution Model for background correction of oligonucleotide microarray data”. BMC genomics, 10, pp. 1–19.
    https://doi.org/10.1186/1471-2164-10-S1-S19
  15. Computational analysis of gene expression profiles of lung cancer. S. Babichev, A. Kornelyuk, V. Lytvynenko et al. Biopolymers and Cells, 2016, 32(1), pp. 70–79.
  16. Babichev, S., Taif, M.A., Lytvynenko, V., 2016. “Filtration of DNA nucleotide gene expression profiles in the systems of biological objects clustering”. Int. Frontier Science Letters, 8, pp. 1–8.
    https://doi.org/10.18052/www.scipress.com/IFSL.8.1
  17. Babichev, S.A., Didyk, A.A., Litvinenko, V.I. et al., 2010. “Fil’traciya hromatogramm s pomoshch’yu vejvlet-analiza s ispol’zovaniem kriteriya ehntropii”. System technologies, 6(71), pp. 117–131. (In Russian).
  18. Hausser, J., Strimmer, K., 2009. “Entropy inference and the James-Stein estimator, with application to nonlinear gene association networks”. J. of Machine Learning Research, 10, pp. 1469–1484.
  19. Beer, D.G., Kardia, S.L., Huang C. at al., 2002. “Gene-expression profiles predict survival of patients with lung adenocarcinoma”. Nature Medicine, 8(8), pp. 816–824.
    https://doi.org/10.1038/nm733

Received 19.09.2017