BAHI Halima

Publications internationales

2021

Noura Louzzani, Abdelkrim Boukabou, Halima Bahi, Ali Boussayoud. (2021), A novel chaos based generating function of the Chebyshev polynomials and its applications in image encryption. Chaos, Solitons & Fractals : Elsevier, https://www.sciencedirect.com/science/article/abs/pii/S096007792100669X

Résumé: In this paper, we propose a generating function for Chebyshev polynomials with typical period-doubling to chaos. In this context, the bifurcation diagram and Lyapunov exponent proved that the proposed generating function is a deterministic system that exhibits chaotic behavior for specific values of the control parameters. As an application, this proposed generating function is used as a chaos-based cryptosystem to encrypt different images. Security analysis demonstrated that the proposed generating function of the Chebyshev polynomials presents an excellent performance in image encryption against various attacks.

Bilal Dendani, Halima Bahi, Toufik Sari. (2021), Self-Supervised Speech Enhancement for Arabic Speech Recognition in Real-World Environments. Traitement du Signal : IIETA, https://www.iieta.org/journals/ts/paper/10.18280/ts.380212

Résumé: Mobile speech recognition attracts much attention in the ubiquitous context, however, background noises, speech coding, and transmission errors are prone to corrupt the incoming speech. Therein, building a robust speech recognizer requires the availability of a large number of real-world speech samples. Arabic language, like many other languages, lacks such resources; to overcome this limitation, we propose a speech enhancement step, before the recognition begins. For the speech enhancement purpose, we suggest the use of a deep autoencoder (DAE) algorithm. A two-step procedure is suggested: in the first step, an overcomplete DAE is trained in an unsupervised way, and in the second one, a denoising DAE is trained in a supervised way leveraging the clean speech produced in the previous step. Experimental results performed on a real-life mobile database confirmed the potentials of the proposed approach and show a reduction of the WER (Word Error Rate) of a ubiquitous Arabic speech recognizer. Further experiments show an improvement of the perceptual evaluation of speech quality (PESQ), and the short-time objective intelligibility (STOI) as well.

2020

Halima Bahi, Khaled Necibi. (2020), Fuzzy Logic Applied for Pronunciation Assessment. International Journal of Computer-Assisted Language Learning and Teaching : IGI Global, https://www.igi-global.com/article/fuzzy-logic-applied-for-pronunciation-assessment/243696

Résumé: Pronunciation teaching is an important stage in language learning activities. This article tackles the pronunciation scoring problem where research has demonstrated relatively low human-human and low human-machine agreement rates, which makes teachers skeptical about their relevance. To overcome these limitations, a fuzzy combination of two machines scores is suggested. The experiments were carried in the context of Algerian pupils learning to read Arabic. Although the native language of Algerian pupils is a dialect of Arabic, Modern Standard Arabic remains difficult for them with difficult sounds to master and letters close in their pronunciation. The article presents a fuzzy evaluation system including both oral reading fluency, and intelligibility. The fuzzy system has shown that despite the disparities between human ratings, its scores correspond at least to one of their ratings and most of the time its ratings are in favor of learners. Therefore, fuzzy logic, more favorable than thresholding systems, encourages learners to pursue their training.

2017

Nadia Lachetar, Halima Bahi. (2017), Conceptual search of songs using domain ontology and semantic links. International Journal of Intelligent Systems Technologies and Applications : Inderscience, https://www.inderscienceonline.com/doi/abs/10.1504/IJISTA.2017.084233

Résumé: Songs retrieval process is complex and implies multiple facets. Nowadays, the most common way of searching songs is through a combination of factual and cultural metadata or by lyrics. Owing to the poor indexing techniques, these systems cannot completely satisfy the users' requests. In this paper, we suggest the use of domain ontology and semantic links to index a collection of songs and perform a conceptual search that offers a benefit way to complement metadata-based methods. A set of experiments were carried over a dedicated dataset and show the superiority of our approach when compared with classical one.

Hamza Frihia, Halima Bahi. (2017), HMM/SVM segmentation and labelling of Arabic speech for speech recognition applications. International Journal of Speech Technology : springer, https://link.springer.com/article/10.1007/s10772-017-9427-z

Résumé: Building a large vocabulary continuous speech recognition (LVCSR) system requires a lot of hours of segmented and labelled speech data. Arabic language, as many other low-resourced languages, lacks such data, but the use of automatic segmentation proved to be a good alternative to make these resources available. In this paper, we suggest the combination of hidden Markov models (HMMs) and support vector machines (SVMs) to segment and to label the speech waveform into phoneme units. HMMs generate the sequence of phonemes and their frontiers; the SVM refines the frontiers and corrects the labels. The obtained segmented and labelled units may serve as a training set for speech recognition applications. The HMM/SVM segmentation algorithm is assessed using both the hit rate and the word error rate (WER); the resulting scores were compared to those provided by the manual segmentation and to those provided by the well-known embedded learning algorithm. The results show that the speech recognizer built upon the HMM/SVM segmentation outperforms in terms of WER the one built upon the embedded learning segmentation of about 0.05%, even in noisy background.

2015

Khaled Necibi, Halima Bahi. (2015), A Statistical-based Decision for Arabic Pronunciation Assessment. Int. J. Speech Technology : Springer, DOI 10.1007/s10772-014-9248-2

Résumé: The aim of a computer assisted language learning (CALL) system is to improve the language skills of learners. Such systems often include, grammar and vocabulary components, while the pronunciation learning seems to be the hardest step in language learning process. Little attention has been paid to this aspect among the required ones in CALL systems. In pronunciation learning context, the learnerwould like to know if its pronunciation is good or bad. In the case where the pronunciation is bad, it will be suitable if some advices are given to him. The goal of this work is an early detection of pupils with reading difficulties and in the issue of decision whether their pronunciation is good or not is our particular interest. For this purpose, we consider the answer to this question as a classification problem and we use a statistical approach to make a decision; this approach allows us to pursue the investigation concerning the pronunciation of every phoneme in the word or in the sentence.

Abderrahamane Kefali, Toufik Sari, Halima Bahi. (2015), Structural feature-based evaluation method of binarization techniques for word retrieval in the deg. International journal of Document Analysis and Recognition (IJDAR) : Springer, http://link.springer.com/article/10.1007/s10032-015-0254-y

Résumé: One of the most important and necessary steps in the process of document analysis and recognition is the binarization, which allows extracting the foreground from the background. Several binarization techniques have been proposed in the literature, but none of themwas reliable for all image types. This makes the selection of one method to apply in a given application very difficult. Thus, performance evaluation of binarization algorithms becomes therefore vital. In this paper, we are interested in the evaluation of binarization techniques for the purpose of retrieving words from the images of degraded Arabic documents. A new evaluation methodology is proposed. The proposed evaluation methodology is based on the comparison of the visual features extracted from the binarized document images with ground truth features instead of comparing images between themselves. The most appropriate thresholding method for each image is the one for which the visual features of the identified words in the image are “closer” to the features of the reference words. The proposed technique was used here to assess the performances of eleven algorithms based on different approaches on a collection of real and synthetic images.

2014

Abderrahamane Kefali, Toufik Sari, Halima Bahi. (2014), Foreground-Background Separation by Feed – forward Neural Networks in Old Manuscripts. Informaticahttps://www.informatica.si/index.php/informatica/article/view/715

Résumé: Artificial Neural Networks (ANNs) are widely used techniques in image processing and pattern recognition. Despite of their power in classification tasks, for pattern recognition, they show limited applicability in the earlier stages such as the foreground-background separation (FBS). In this paper a novel FBS technique based on ANN is applied on old documents with a variety of degradations. The idea is to train the ANN on a set of pairs of original images and their respective ideal black and white ones relying on global and local information. We ran several experiments on benchmark and synthetic data and we obtained better results than state-of-the art methods.

2013

(2013), Type-2 Fuzzy Gaussian Mixture Models for Singing Voice Classification in Commercial ‎Music Productio. International Journal of Signal and Imaging Systems engineering, Vol.6, No.2, pp.111 – 118, 2013. http://www.inderscience.com/info/inarticle.php?artid=53418

Résumé: The paper describes a system of singing voice classification in the commercial music productions. A first step in our system is to separate the singer’s voice from the music. Based on the vocal part, two sets of parameters are formed, one for singing voice type and the other for the singing voice quality. Each set of parameters contains a number of MPEG-7 low-level descriptors and other descriptors; at the classification stage the paper suggests an extension of Gaussian Mixture Models (GMMs), by using the Type-2 FGMMs (Type-2 Fuzzy Gaussian Mixture Models). Results show substantial improvements when compared to similar works.

Livres

2016

Halima BAHI. (2016), De la Logique à Prolog : Office des Publications Universitaires (OPU), https://opu.dz/fr/livre/informatique/de-la-logique-%C3%A0-prolog-cours-et-exercices

Chapitres de livres

2020

Bilal Dendani, Halima Bahi, Toufik Sari. (2020), Speech Enhancement Based on Deep AutoEncoder for Remote Arabic Speech Recognition. El Moataz A., Mammass D., Mansouri A., Nouboud F. (eds) Image and Signal Processing : Springer, https://link.springer.com/chapter/10.1007/978-3-030-51935-3_24

Résumé: Remote applications that deal with speech need the speech signal to be compressed. First, speech coding transforms the continuous waveform into a numerical form. Then, the digitized signal is compressed with or without loss of information. This transformation affects the original waveform and degrades performances for further recognition of the speech signal. Meanwhile, the transmission is another source of speech degradation. To restore the original “clean” speech, speech enhancement (SE) is widely used, and deep learning algorithms are state-of-the-art, nowadays. In this paper, the target application is a remote Arabic speech recognition system, and the aim of using SE is to improve the accuracy of the speech recognizer. For that purpose, a Deep Auto Encoder (DAE) is used. The effect of the DAE-based SE is studied through different configurations, and the performances are evaluated through accuracy. The results showed an improvement of about 3.17 between the accuracy prior to the SE and that computed with the enhanced speech.

2012

(2012), Automatic Speech Recognition Technology for Speech Disorders Evaluation. BookChapter In : Speech, Image and Language Processing for Human Computer Interaction : Multi-modal : IGI Global, http://www.igi-global.com/book/speech-image-language-processing-human/60784#table-of-contents

Résumé: Speech disorders are human disabilities widely present in young population but also adults may suffer from such disorders after some physical problems. In this context, the detection and further the correction of such disabilities may be handled by Automatic Speech Recognition (ASR) technology. The first works on the speech disorders detection began early in the 70s and seem to follow the same evolution as those on the ASR. Indeed, these early works were more based on the signal processing techniques. Progressively, systems dealing with speech disorders incorporate more ideas from ASR technology. Particularly, Hidden Markov Models, the state-of-the-art approaches in ASR systems, are used. This chapter reviews systems that use ASR techniques to evaluate pronunciation of people who suffer from speech or voice impairments. The authors investigate the existing systems and present the main innovation and some of the available resources.

Communications internationales

2001

Halima Bahi, Mokhtar Sellami. (2001), Combination of vector quantization and hidden Markov models for Arabic speech recognition. Proceedings ACS/IEEE International conference on computer systems and applications, Beirut, Libanhttps://dl.acm.org/doi/abs/10.5555/872017.872283

باهي حليمة

Publications internationales

Livres

Chapitres de livres

Communications internationales