-
On Measuring Context Utilization in Document-Level MT Systems
Authors:
Wafaa Mohammed,
Vlad Niculae
Abstract:
Document-level translation models are usually evaluated using general metrics such as BLEU, which are not informative about the benefits of context. Current work on context-aware evaluation, such as contrastive methods, only measure translation accuracy on words that need context for disambiguation. Such measures cannot reveal whether the translation model uses the correct supporting context. We p…
▽ More
Document-level translation models are usually evaluated using general metrics such as BLEU, which are not informative about the benefits of context. Current work on context-aware evaluation, such as contrastive methods, only measure translation accuracy on words that need context for disambiguation. Such measures cannot reveal whether the translation model uses the correct supporting context. We propose to complement accuracy-based evaluation with measures of context utilization. We find that perturbation-based analysis (comparing models' performance when provided with correct versus random context) is an effective measure of overall context utilization. For a finer-grained phenomenon-specific evaluation, we propose to measure how much the supporting context contributes to handling context-dependent discourse phenomena. We show that automatically-annotated supporting context gives similar conclusions to human-annotated context and can be used as alternative for cases where human annotations are not available. Finally, we highlight the importance of using discourse-rich datasets when assessing context utilization.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Buffalo Genome Projects: Current Situation and Future Perspective in Improving Breeding Programs
Authors:
Ahmed M. Mousbah,
Hesham M. Abdullah,
Waleed S. Mohammed,
Ali M. El-Refy,
Mohamed Helmy
Abstract:
Buffaloes are farm animals that contribute to food security by providing high quality meat and milk. They can better tolerate the adverse effects of global climate change on their meat and milk production. Despite their advantages, buffaloes are heavily neglected animals with fewer studies compared to other farm animals, hence, the real potential of buffaloes has never been realized. The complete…
▽ More
Buffaloes are farm animals that contribute to food security by providing high quality meat and milk. They can better tolerate the adverse effects of global climate change on their meat and milk production. Despite their advantages, buffaloes are heavily neglected animals with fewer studies compared to other farm animals, hence, the real potential of buffaloes has never been realized. The complete genome sequencing projects of buffaloes are essential to better understanding the buffalos biology and production since they allow scientists to identify important genes and understand how the gene networks interact to determine the critical features of buffaloes. The genome projects are also valuable for gaining better knowledge of growth, development, maintenance, and determining factors associated with increased meat and milk production. Furthermore, having access to a complete genome of high quality and comprehensive annotations provides a powerful tool in breeding programs. The current review surveyed the publicly available buffalo genome projects and studied the impact of incorporating genomic selection into the buffalo breeding program. Our survey of the publicly available buffalo genome projects showed the promise of genomic selection in develo** water buffalo science and technology for food security on a global scale.
△ Less
Submitted 11 April, 2023;
originally announced April 2023.
-
Visual Grounding of Inter-lingual Word-Embeddings
Authors:
Wafaa Mohammed,
Hassan Shahmohammadi,
Hendrik P. A. Lensch,
R. Harald Baayen
Abstract:
Visual grounding of Language aims at enriching textual representations of language with multiple sources of visual knowledge such as images and videos. Although visual grounding is an area of intense research, inter-lingual aspects of visual grounding have not received much attention. The present study investigates the inter-lingual visual grounding of word embeddings. We propose an implicit align…
▽ More
Visual grounding of Language aims at enriching textual representations of language with multiple sources of visual knowledge such as images and videos. Although visual grounding is an area of intense research, inter-lingual aspects of visual grounding have not received much attention. The present study investigates the inter-lingual visual grounding of word embeddings. We propose an implicit alignment technique between the two spaces of vision and language in which inter-lingual textual information interacts in order to enrich pre-trained textual word embeddings. We focus on three languages in our experiments, namely, English, Arabic, and German. We obtained visually grounded vector representations for these languages and studied whether visual grounding on one or multiple languages improved the performance of embeddings on word similarity and categorization benchmarks. Our experiments suggest that inter-lingual knowledge improves the performance of grounded embeddings in similar languages such as German and English. However, inter-lingual grounding of German or English with Arabic led to a slight degradation in performance on word similarity benchmarks. On the other hand, we observed an opposite trend on categorization benchmarks where Arabic had the most improvement on English. In the discussion section, several reasons for those findings are laid out. We hope that our experiments provide a baseline for further research on inter-lingual visual grounding.
△ Less
Submitted 21 November, 2022; v1 submitted 8 September, 2022;
originally announced September 2022.
-
Masader Plus: A New Interface for Exploring +500 Arabic NLP Datasets
Authors:
Yousef Altaher,
Ali Fadel,
Mazen Alotaibi,
Mazen Alyazidi,
Mishari Al-Mutairi,
Mutlaq Aldhbuiub,
Abdulrahman Mosaibah,
Abdelrahman Rezk,
Abdulrazzaq Alhendi,
Mazen Abo Shal,
Emad A. Alghamdi,
Maged S. Alshaibani,
Jezia Zakraoui,
Wafaa Mohammed,
Kamel Gaanoun,
Khalid N. Elmadani,
Mustafa Ghaleb,
Nouamane Tazi,
Raed Alharbi,
Maraim Masoud,
Zaid Alyafeai
Abstract:
Masader (Alyafeai et al., 2021) created a metadata structure to be used for cataloguing Arabic NLP datasets. However, develo** an easy way to explore such a catalogue is a challenging task. In order to give the optimal experience for users and researchers exploring the catalogue, several design and user experience challenges must be resolved. Furthermore, user interactions with the website may p…
▽ More
Masader (Alyafeai et al., 2021) created a metadata structure to be used for cataloguing Arabic NLP datasets. However, develo** an easy way to explore such a catalogue is a challenging task. In order to give the optimal experience for users and researchers exploring the catalogue, several design and user experience challenges must be resolved. Furthermore, user interactions with the website may provide an easy approach to improve the catalogue. In this paper, we introduce Masader Plus, a web interface for users to browse Masader. We demonstrate data exploration, filtration, and a simple API that allows users to examine datasets from the backend. Masader Plus can be explored using this link https://arbml.github.io/masader. A video recording explaining the interface can be found here https://www.youtube.com/watch?v=SEtdlSeqchk.
△ Less
Submitted 1 August, 2022;
originally announced August 2022.
-
Fast Diffusion Limit for Reaction-Diffusion Systems with Stochastic Neumann Boundary Conditions
Authors:
Wael W. Mohammed,
Dirk Blömker
Abstract:
We consider a class of reaction-diffusion equations with a stochastic perturbation on the boundary. We show that in the limit of fast diffusion, one can rigorously approximate solutions of the system of PDEs with stochastic Neumann boundary conditions by the solution of a suitable stochastic/deterministic differential equation for the average concentration that involves reactions only. An interest…
▽ More
We consider a class of reaction-diffusion equations with a stochastic perturbation on the boundary. We show that in the limit of fast diffusion, one can rigorously approximate solutions of the system of PDEs with stochastic Neumann boundary conditions by the solution of a suitable stochastic/deterministic differential equation for the average concentration that involves reactions only. An interesting effect occurs, if the noise on the boundary does not change the averaging concentration, but is sufficiently large. Then surprising additional effective reaction terms appear.
We focus on systems with polynomial nonlinearities only and give applications to the two dimensional nonlinear heat equation and the cubic auto-catalytic reaction between two chemicals.
△ Less
Submitted 11 August, 2014;
originally announced August 2014.
-
Implementation of Tic-Tac-Toe Game in LabVIEW
Authors:
Lalitha Saroja Thota,
Manal Elsayeed,
Naseema Shaik,
Tayf Abdullah Ghawa,
Ahlam Rashed,
Mona Refdan,
Wejdan Mohammed,
Rawan Ali,
Suresh Babu Changalasetty
Abstract:
Tic-Tac-Toe game can be played by two players where the square block (3 x 3) can be filled with a cross (X) or a circle (O). The game will toggle between the players by giving the chance for each player to mark their move. When one of the players make a combination of 3 same markers in a horizontal, vertical or diagonal line the program will display which player has won, whether X or O. In this pa…
▽ More
Tic-Tac-Toe game can be played by two players where the square block (3 x 3) can be filled with a cross (X) or a circle (O). The game will toggle between the players by giving the chance for each player to mark their move. When one of the players make a combination of 3 same markers in a horizontal, vertical or diagonal line the program will display which player has won, whether X or O. In this paper, we implement a 3x3 tic-tac-toe game in LabVIEW. The game is designed so that two players can play tic-tac-toe using LabVIEW software. The program will contain a display function and a select function to place the symbol as well as toggle between the symbols allowing each player a turn to play the game. The program will update after each player makes their move and check for the conditions of game as it goes on. Overall program works without any bugs and is able to use
△ Less
Submitted 19 June, 2014;
originally announced June 2014.