Search | arXiv e-print repository

arXiv:2405.09830 [pdf]

Unveiling the Direct Piezoelectric Effect on Piezo-phototronic Coupling in Ferroelectrics: First Principle Study Assisted Experimental Approach

Authors: Koyal Suman Samantaray, Sourabh Kumar, P Maneesha, Dilip Sasmal, Suresh Chandra Baral, B. R. Vaishnavi Krupa, Arup Dasgupta, K Harrabi, A Mekki, Somaditya Sen

Abstract: A new study explores the distinct roles of spontaneous polarization and piezoelectric polarization in piezo-phototronic coupling. This investigation focuses on differences in photocatalytic and piezo-photocatalytic performance using sodium bismuth titanate (NBT), a key ferroelectric material. The research aims to identify which type of polarization has a greater influence on piezo-phototronic effe… ▽ More A new study explores the distinct roles of spontaneous polarization and piezoelectric polarization in piezo-phototronic coupling. This investigation focuses on differences in photocatalytic and piezo-photocatalytic performance using sodium bismuth titanate (NBT), a key ferroelectric material. The research aims to identify which type of polarization has a greater influence on piezo-phototronic effects. A theoretical assessment complements the experimental findings, providing additional insights. This study explores the enhanced piezo-phototronic performance of electrospun nanofibers compared to sol-gel particles under different illumination conditions (11W UV, 250W UV, and natural sunlight). Electrospun nanofibers exhibited a rate constant (k) improvement of 2.5 to 3.75 times, whereas sol-gel particles showed only 1.3 to 1.4 times higher performance when ultrasonication was added to photocatalysis. Analysis using first-principle methods revealed that nanofibers had an elastic modulus (C33) about 2.15 times lower than sol-gel particles, indicating greater flexibility. The elongation of lattice along z-axis in the case of nanofibers reduced the covalency in the Bi-O and Ti-O bonds. These structural differences led to reduced spontaneous polarization and piezoelectric stress coefficients (e31 & e33). Despite having lower piezoelectric stress coefficients, higher flexibility in nanofibers led to a higher piezoelectric strain coefficient, 2.66 and 1.97 times greater than sol-gel particles, respectively. This improved the piezo-phototronic coupling for nanofibers. △ Less

Submitted 16 May, 2024; originally announced May 2024.

arXiv:2310.18778 [pdf, other]

ProMap: Effective Bilingual Lexicon Induction via Language Model Prompting

Authors: Abdellah El Mekki, Muhammad Abdul-Mageed, ElMoatez Billah Nagoudi, Ismail Berrada, Ahmed Khoumsi

Abstract: Bilingual Lexicon Induction (BLI), where words are translated between two languages, is an important NLP task. While noticeable progress on BLI in rich resource languages using static word embeddings has been achieved. The word translation performance can be further improved by incorporating information from contextualized word embeddings. In this paper, we introduce ProMap, a novel approach for B… ▽ More Bilingual Lexicon Induction (BLI), where words are translated between two languages, is an important NLP task. While noticeable progress on BLI in rich resource languages using static word embeddings has been achieved. The word translation performance can be further improved by incorporating information from contextualized word embeddings. In this paper, we introduce ProMap, a novel approach for BLI that leverages the power of prompting pretrained multilingual and multidialectal language models to address these challenges. To overcome the employment of subword tokens in these models, ProMap relies on an effective padded prompting of language models with a seed dictionary that achieves good performance when used independently. We also demonstrate the effectiveness of ProMap in re-ranking results from other BLI methods such as with aligned static word embeddings. When evaluated on both rich-resource and low-resource languages, ProMap consistently achieves state-of-the-art results. Furthermore, ProMap enables strong performance in few-shot scenarios (even with less than 10 training examples), making it a valuable tool for low-resource language translation. Overall, we believe our method offers both exciting and promising direction for BLI in general and low-resource languages in particular. ProMap code and data are available at \url{https://github.com/4mekki4/promap}. △ Less

Submitted 28 October, 2023; originally announced October 2023.

Comments: To appear in IJCNLP-AACL 2023

arXiv:2206.08415 [pdf, other]

CS-UM6P at SemEval-2022 Task 6: Transformer-based Models for Intended Sarcasm Detection in English and Arabic

Authors: Abdelkader El Mahdaouy, Abdellah El Mekki, Kabil Essefar, Abderrahman Skiredj, Ismail Berrada

Abstract: Sarcasm is a form of figurative language where the intended meaning of a sentence differs from its literal meaning. This poses a serious challenge to several Natural Language Processing (NLP) applications such as Sentiment Analysis, Opinion Mining, and Author Profiling. In this paper, we present our participating system to the intended sarcasm detection task in English and Arabic languages. Our sy… ▽ More Sarcasm is a form of figurative language where the intended meaning of a sentence differs from its literal meaning. This poses a serious challenge to several Natural Language Processing (NLP) applications such as Sentiment Analysis, Opinion Mining, and Author Profiling. In this paper, we present our participating system to the intended sarcasm detection task in English and Arabic languages. Our system\footnote{The source code of our system is available at \url{https://github.com/AbdelkaderMH/iSarcasmEval}} consists of three deep learning-based models leveraging two existing pre-trained language models for Arabic and English. We have participated in all sub-tasks. Our official submissions achieve the best performance on sub-task A for Arabic language and rank second in sub-task B. For sub-task C, our system is ranked 7th and 11th on Arabic and English datasets, respectively. △ Less

Submitted 16 June, 2022; originally announced June 2022.

arXiv:2206.08407 [pdf, other]

Deep Multi-Task Models for Misogyny Identification and Categorization on Arabic Social Media

Authors: Abdelkader El Mahdaouy, Abdellah El Mekki, Ahmed Oumar, Hajar Mousannif, Ismail Berrada

Abstract: The prevalence of toxic content on social media platforms, such as hate speech, offensive language, and misogyny, presents serious challenges to our interconnected society. These challenging issues have attracted widespread attention in Natural Language Processing (NLP) community. In this paper, we present the submitted systems to the first Arabic Misogyny Identification shared task. We investigat… ▽ More The prevalence of toxic content on social media platforms, such as hate speech, offensive language, and misogyny, presents serious challenges to our interconnected society. These challenging issues have attracted widespread attention in Natural Language Processing (NLP) community. In this paper, we present the submitted systems to the first Arabic Misogyny Identification shared task. We investigate three multi-task learning models as well as their single-task counterparts. In order to encode the input text, our models rely on the pre-trained MARBERT language model. The overall obtained results show that all our submitted models have achieved the best performances (top three ranked submissions) in both misogyny identification and categorization tasks. △ Less

Submitted 16 June, 2022; originally announced June 2022.

arXiv:2205.07340 [pdf]

Room Temperature Magneto-dielectric coupling in the CaMnO3 modified NBT lead-free ceramics

Authors: Koyal Suman Samantaray, Ruhul Amin, Saniya Ayaz, A. K. Pathak, Christopher Hanley, A. Mekki, K. Harrabi, Somaditya Sen

Abstract: The sol-gel prepared (1-x) Na0.5Bi0.5TiO3- (x) CaMnO3 (x=0, 0.03, 0.06, 0.12) compositions show a Rhombohedral (R3c) phase for x=0.06 while a mixed Rhombohedral (R3c) and orthorhombic (Pnma) phases for the x=0.12. The lattice volume consistently decreased with an increase in the CaMnO3 content. The phase transition temperature (Tc) decreased with an increase in the CaMnO3 compositions. The room te… ▽ More The sol-gel prepared (1-x) Na0.5Bi0.5TiO3- (x) CaMnO3 (x=0, 0.03, 0.06, 0.12) compositions show a Rhombohedral (R3c) phase for x=0.06 while a mixed Rhombohedral (R3c) and orthorhombic (Pnma) phases for the x=0.12. The lattice volume consistently decreased with an increase in the CaMnO3 content. The phase transition temperature (Tc) decreased with an increase in the CaMnO3 compositions. The room temperature dielectric constant increased, and loss decreased for the x=0.03 composition due to a decrease in the oxygen vacancy and Bi loss confirmed by the valence state study (XPS). All the compositions show a variation of the room temperature dielectric property with an application of magnetic field confirming a magnetodielectric coupling. The x=0.06 composition shows the highest negative magnetodielectric constant (MD%) of 3.69 at 100kHz at an applied field of 5 kG. △ Less

Submitted 15 May, 2022; originally announced May 2022.

arXiv:2204.13515 [pdf, other]

UM6P-CS at SemEval-2022 Task 11: Enhancing Multilingual and Code-Mixed Complex Named Entity Recognition via Pseudo Labels using Multilingual Transformer

Authors: Abdellah El Mekki, Abdelkader El Mahdaouy, Mohammed Akallouch, Ismail Berrada, Ahmed Khoumsi

Abstract: Building real-world complex Named Entity Recognition (NER) systems is a challenging task. This is due to the complexity and ambiguity of named entities that appear in various contexts such as short input sentences, emerging entities, and complex entities. Besides, real-world queries are mostly malformed, as they can be code-mixed or multilingual, among other scenarios. In this paper, we introduce… ▽ More Building real-world complex Named Entity Recognition (NER) systems is a challenging task. This is due to the complexity and ambiguity of named entities that appear in various contexts such as short input sentences, emerging entities, and complex entities. Besides, real-world queries are mostly malformed, as they can be code-mixed or multilingual, among other scenarios. In this paper, we introduce our submitted system to the Multilingual Complex Named Entity Recognition (MultiCoNER) shared task. We approach the complex NER for multilingual and code-mixed queries, by relying on the contextualized representation provided by the multilingual Transformer XLM-RoBERTa. In addition to the CRF-based token classification layer, we incorporate a span classification loss to recognize named entities spans. Furthermore, we use a self-training mechanism to generate weakly-annotated data from a large unlabeled dataset. Our proposed system is ranked 6th and 8th in the multilingual and code-mixed MultiCoNER's tracks respectively. △ Less

Submitted 28 April, 2022; originally announced April 2022.

arXiv:2110.04262 [pdf]

Defect Dipole Induced Improved Electrocaloric Effect in Modified NBT-6BT Lead-Free Ceramics

Authors: Koyal Suman Samantaray, Ruhul Amin, E. G Rini, Indranil Bhaumik, A. Mekki, K. Harrabi, Somaditya Sen

Abstract: The Rietveld refinement of the polycrystalline powders of 1% Fe and Mn-doped (Na0.5Bi0.5)0.94Ba0.06Ti0.98V0.02O3 at the Ti-site confirmed a single rhombohedral (R3c) phase. The bandgap, (Eg) was affected by the anti-phase octahedral tilt angle and the spin-orbit splitting energy of Ti4+2p3/2 and Ti4+2p1/2 states. The decrease in Bi loss and increase in the binding energy of Ba due to Fe/Mn do**… ▽ More The Rietveld refinement of the polycrystalline powders of 1% Fe and Mn-doped (Na0.5Bi0.5)0.94Ba0.06Ti0.98V0.02O3 at the Ti-site confirmed a single rhombohedral (R3c) phase. The bandgap, (Eg) was affected by the anti-phase octahedral tilt angle and the spin-orbit splitting energy of Ti4+2p3/2 and Ti4+2p1/2 states. The decrease in Bi loss and increase in the binding energy of Ba due to Fe/Mn do** has been correlated to the strengthening of Bi-O and Ba-O bonds which was revealed from the XPS studies thereby further related to the average A-O bond length from structural studies. Hence, a reduction of oxygen vacancy (VO) for the doped samples has been justified. A significant improvement of the dielectric constant, relaxation time (τ0), and the decrease in conductivity due to do** was revealed from the frequency-dependent (10Hz-1MHz) dielectric measurement study. The conduction and relaxation process is dominated by the short-range movement of defects. The activation energy (Ea ~1eV) revealed that there is a presence of double-ionized VOs. The ECE study showed a significant enhancement of the changes in entropy, and the adiabatic temperature difference due to do**, with the change in tempearture being highest in the Fe-doped sample. Such improvement of dielectric and ECE properties was confirmed due to the reduction of the mobility of oxygen vacancy because of the formation defect dipoles. △ Less

Submitted 8 October, 2021; originally announced October 2021.

arXiv:2106.12495 [pdf, other]

BERT-based Multi-Task Model for Country and Province Level Modern Standard Arabic and Dialectal Arabic Identification

Authors: Abdellah El Mekki, Abdelkader El Mahdaouy, Kabil Essefar, Nabil El Mamoun, Ismail Berrada, Ahmed Khoumsi

Abstract: Dialect and standard language identification are crucial tasks for many Arabic natural language processing applications. In this paper, we present our deep learning-based system, submitted to the second NADI shared task for country-level and province-level identification of Modern Standard Arabic (MSA) and Dialectal Arabic (DA). The system is based on an end-to-end deep Multi-Task Learning (MTL) m… ▽ More Dialect and standard language identification are crucial tasks for many Arabic natural language processing applications. In this paper, we present our deep learning-based system, submitted to the second NADI shared task for country-level and province-level identification of Modern Standard Arabic (MSA) and Dialectal Arabic (DA). The system is based on an end-to-end deep Multi-Task Learning (MTL) model to tackle both country-level and province-level MSA/DA identification. The latter MTL model consists of a shared Bidirectional Encoder Representation Transformers (BERT) encoder, two task-specific attention layers, and two classifiers. Our key idea is to leverage both the task-discriminative and the inter-task shared features for country and province MSA/DA identification. The obtained results show that our MTL model outperforms single-task models on most subtasks. △ Less

Submitted 23 June, 2021; originally announced June 2021.

arXiv:2106.12488 [pdf, other]

Deep Multi-Task Model for Sarcasm Detection and Sentiment Analysis in Arabic Language

Authors: Abdelkader El Mahdaouy, Abdellah El Mekki, Kabil Essefar, Nabil El Mamoun, Ismail Berrada, Ahmed Khoumsi

Abstract: The prominence of figurative language devices, such as sarcasm and irony, poses serious challenges for Arabic Sentiment Analysis (SA). While previous research works tackle SA and sarcasm detection separately, this paper introduces an end-to-end deep Multi-Task Learning (MTL) model, allowing knowledge interaction between the two tasks. Our MTL model's architecture consists of a Bidirectional Encode… ▽ More The prominence of figurative language devices, such as sarcasm and irony, poses serious challenges for Arabic Sentiment Analysis (SA). While previous research works tackle SA and sarcasm detection separately, this paper introduces an end-to-end deep Multi-Task Learning (MTL) model, allowing knowledge interaction between the two tasks. Our MTL model's architecture consists of a Bidirectional Encoder Representation from Transformers (BERT) model, a multi-task attention interaction module, and two task classifiers. The overall obtained results show that our proposed model outperforms its single-task counterparts on both SA and sarcasm detection sub-tasks. △ Less

Submitted 23 June, 2021; originally announced June 2021.

arXiv:2102.11000 [pdf, other]

An open access NLP dataset for Arabic dialects : Data collection, labeling, and model construction

Authors: ElMehdi Boujou, Hamza Chataoui, Abdellah El Mekki, Saad Benjelloun, Ikram Chairi, Ismail Berrada

Abstract: Natural Language Processing (NLP) is today a very active field of research and innovation. Many applications need however big sets of data for supervised learning, suitably labelled for the training purpose. This includes applications for the Arabic language and its national dialects. However, such open access labeled data sets in Arabic and its dialects are lacking in the Data Science ecosystem a… ▽ More Natural Language Processing (NLP) is today a very active field of research and innovation. Many applications need however big sets of data for supervised learning, suitably labelled for the training purpose. This includes applications for the Arabic language and its national dialects. However, such open access labeled data sets in Arabic and its dialects are lacking in the Data Science ecosystem and this lack can be a burden to innovation and research in this field. In this work, we present an open data set of social data content in several Arabic dialects. This data was collected from the Twitter social network and consists on +50K twits in five (5) national dialects. Furthermore, this data was labeled for several applications, namely dialect detection, topic detection and sentiment analysis. We publish this data as an open access data to encourage innovation and encourage other works in the field of NLP for Arabic dialects and social media. A selection of models were built using this data set and are presented in this paper along with their performances. △ Less

Submitted 6 February, 2021; originally announced February 2021.

arXiv:2012.13824 [pdf]

Pbnm to R3-c phase transformation in (1-x)LaFeO3.xLaMnO3 solid solution due to modifications in structure, octahedral tilt and valence states of Fe-Mn

Authors: E. G. Rini, Mayanak. K. Gupta, R. Mittal, A. Mekki, Mohammed H. Al Saeed, Somaditya Sen

Abstract: A theoretically supported experimental study of the (1-x)LaFeO3.xLaMnO3 (LFO-LMO) solid solution is being reported for the first time which reveals a phase transformation from the Pbnm and R3-c phase at a chemical composition of x=0.625. Correlation of octahedral distortion and phase transition was extensively investigated using x-ray photoelectron spectroscopy (XPS), Raman and x-ray diffraction (… ▽ More A theoretically supported experimental study of the (1-x)LaFeO3.xLaMnO3 (LFO-LMO) solid solution is being reported for the first time which reveals a phase transformation from the Pbnm and R3-c phase at a chemical composition of x=0.625. Correlation of octahedral distortion and phase transition was extensively investigated using x-ray photoelectron spectroscopy (XPS), Raman and x-ray diffraction (XRD) measurements and density functional theory (DFT) calculation. A detailed study of the structural lattice parameters, bond lengths, bond angles have been done, supported by valence state and electronic properties studies. All the above parameters show a correlated modification to the phase transition. The distortion and tilting of the BO6 octahedra has been studied as a function of different Fe:Mn content and expressed by Glazer representation from the refined Crystallographic Information Files (CIF). The angle of tilting from the central non-tilted position also shows a correlated modification with the phase transformation. The valence state and size of cations influences the octahedral tilting. Octahedral volume is reduced as the entire perovskite structure is relatively flattened with increasing Mn-content implying a flattening of both the BO6 octahedra and the La8O6 cage. The vibrational properties were studied experimentally and supported by DFT phonon calculations, detailing the displacement pattern (eigen vectors) revealing considerable insight into the lattice dynamics of the compounds. The optoelectronic modifications in the band properties were studied experimentally and supported with theory. Hence, this manuscript is a in-depth analysis of the structure correlated phase transition of the LFO-LMO solid solution. △ Less

Submitted 26 December, 2020; originally announced December 2020.

arXiv:1003.3352 [pdf, other]

doi 10.1051/m2an/2014001

Error estimates for Stokes problem with Tresca friction condition

Authors: Ayadi Mekki, Gdoura Mohamed Khaled, Sassi Taoufik

Abstract: In this work we propose and study a three field mixed formulation for solving the Stokes problem with Tresca-type non-linear boundary conditions. Two Lagrange multipliers are used to enforce div(u)=0 constraint and to regularize the energy functional. The resulting problem is discretised using "P1 bubble/P1-P1" finite elements. Error estimates are derived and several numerical studies are achieved… ▽ More In this work we propose and study a three field mixed formulation for solving the Stokes problem with Tresca-type non-linear boundary conditions. Two Lagrange multipliers are used to enforce div(u)=0 constraint and to regularize the energy functional. The resulting problem is discretised using "P1 bubble/P1-P1" finite elements. Error estimates are derived and several numerical studies are achieved. △ Less

Submitted 17 March, 2010; originally announced March 2010.

Report number: Lab. Math. Nicolas Oresme: 2010 - 4. MSC Class: 35; 65; 76

Journal ref: ESAIM: M2AN 48 (2014) 1413-1429

arXiv:cs/0609134 [pdf]

Using NLP to build the hypertextuel network of a back-of-the-book index

Authors: Touria Aït El Mekki, Adeline Nazarenko

Abstract: Relying on the idea that back-of-the-book indexes are traditional devices for navigation through large documents, we have developed a method to build a hypertextual network that helps the navigation in a document. Building such an hypertextual network requires selecting a list of descriptors, identifying the relevant text segments to associate with each descriptor and finally ranking the descrip… ▽ More Relying on the idea that back-of-the-book indexes are traditional devices for navigation through large documents, we have developed a method to build a hypertextual network that helps the navigation in a document. Building such an hypertextual network requires selecting a list of descriptors, identifying the relevant text segments to associate with each descriptor and finally ranking the descriptors and reference segments by relevance order. We propose a specific document segmentation method and a relevance measure for information ranking. The algorithms are tested on 4 corpora (of different types and domains) without human intervention or any semantic knowledge. △ Less

Submitted 24 September, 2006; originally announced September 2006.

ACM Class: H.3.1

Journal ref: Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP) (2005) 316-320

arXiv:cs/0609133 [pdf]

An application-oriented terminology evaluation: the case of back-of-the book indexes

Authors: Touria Aït El Mekki, Adeline Nazarenko

Abstract: This paper addresses the problem of computational terminology evaluation not per se but in a specific application context. This paper describes the evaluation procedure that has been used to assess the validity of our overall indexing approach and the quality of the IndDoc indexing tool. Even if user-oriented extended evaluation is irreplaceable, we argue that early evaluations are possible and… ▽ More This paper addresses the problem of computational terminology evaluation not per se but in a specific application context. This paper describes the evaluation procedure that has been used to assess the validity of our overall indexing approach and the quality of the IndDoc indexing tool. Even if user-oriented extended evaluation is irreplaceable, we argue that early evaluations are possible and they are useful for development guidance. △ Less

Submitted 24 September, 2006; originally announced September 2006.

Comments: 4 pages

ACM Class: H.3.1

Journal ref: Workshop on Terminology design: quality criteria and evaluation methods (TermEval), Italie (2006) 18-21

Showing 1–14 of 14 results for author: Mekki, A