Search | arXiv e-print repository

Neural Machine Translation for the Indigenous Languages of the Americas: An Introduction

Authors: Manuel Mager, Rajat Bhatnagar, Graham Neubig, Ngoc Thang Vu, Katharina Kann

Abstract: Neural models have drastically advanced state of the art for machine translation (MT) between high-resource languages. Traditionally, these models rely on large amounts of training data, but many language pairs lack these resources. However, an important part of the languages in the world do not have this amount of data. Most languages from the Americas are among them, having a limited amount of p… ▽ More Neural models have drastically advanced state of the art for machine translation (MT) between high-resource languages. Traditionally, these models rely on large amounts of training data, but many language pairs lack these resources. However, an important part of the languages in the world do not have this amount of data. Most languages from the Americas are among them, having a limited amount of parallel and monolingual data, if any. Here, we present an introduction to the interested reader to the basic challenges, concepts, and techniques that involve the creation of MT systems for these languages. Finally, we discuss the recent advances and findings and open questions, product of an increased interest of the NLP community in these languages. △ Less

Submitted 11 June, 2023; originally announced June 2023.

Comments: Accepted to AmericasNLP 2023

arXiv:2204.11788 [pdf, other]

Human-AI Collaboration via Conditional Delegation: A Case Study of Content Moderation

Authors: Vivian Lai, Samuel Carton, Rajat Bhatnagar, Q. Vera Liao, Yunfeng Zhang, Chenhao Tan

Abstract: Despite impressive performance in many benchmark datasets, AI models can still make mistakes, especially among out-of-distribution examples. It remains an open question how such imperfect models can be used effectively in collaboration with humans. Prior work has focused on AI assistance that helps people make individual high-stakes decisions, which is not scalable for a large amount of relatively… ▽ More Despite impressive performance in many benchmark datasets, AI models can still make mistakes, especially among out-of-distribution examples. It remains an open question how such imperfect models can be used effectively in collaboration with humans. Prior work has focused on AI assistance that helps people make individual high-stakes decisions, which is not scalable for a large amount of relatively low-stakes decisions, e.g., moderating social media comments. Instead, we propose conditional delegation as an alternative paradigm for human-AI collaboration where humans create rules to indicate trustworthy regions of a model. Using content moderation as a testbed, we develop novel interfaces to assist humans in creating conditional delegation rules and conduct a randomized experiment with two datasets to simulate in-distribution and out-of-distribution scenarios. Our study demonstrates the promise of conditional delegation in improving model performance and provides insights into design for this novel paradigm, including the effect of AI explanations. △ Less

Submitted 25 April, 2022; originally announced April 2022.

Comments: 18 pages, 44 figures

arXiv:2106.06875 [pdf, other]

Don't Rule Out Monolingual Speakers: A Method For Crowdsourcing Machine Translation Data

Authors: Rajat Bhatnagar, Ananya Ganesh, Katharina Kann

Abstract: High-performing machine translation (MT) systems can help overcome language barriers while making it possible for everyone to communicate and use language technologies in the language of their choice. However, such systems require large amounts of parallel sentences for training, and translators can be difficult to find and expensive. Here, we present a data collection strategy for MT which, in co… ▽ More High-performing machine translation (MT) systems can help overcome language barriers while making it possible for everyone to communicate and use language technologies in the language of their choice. However, such systems require large amounts of parallel sentences for training, and translators can be difficult to find and expensive. Here, we present a data collection strategy for MT which, in contrast, is cheap and simple, as it does not require bilingual speakers. Based on the insight that humans pay specific attention to movements, we use graphics interchange formats (GIFs) as a pivot to collect parallel sentences from monolingual annotators. We use our strategy to collect data in Hindi, Tamil and English. As a baseline, we also collect data using images as a pivot. We perform an intrinsic evaluation by manually evaluating a subset of the sentence pairs and an extrinsic evaluation by finetuning mBART on the collected data. We find that sentences collected via GIFs are indeed of higher quality. △ Less

Submitted 12 June, 2021; originally announced June 2021.

Comments: 5 pages, 1 figure, ACL-IJCNLP 2021 submission, Natural Language Processing, Data Collection, Monolingual Speakers, Machine Translation, GIFs, Images

ACM Class: I.2.7

arXiv:1409.1980 [pdf, ps, other]

On the Sum of Correlated Squared $κ-μ$ Shadowed Random Variables and its Application to Performance Analysis of MRC

Authors: Manav R. Bhatnagar

Abstract: In this paper, we study the statistical characterization of the sum of the squared $κ-μ$ shadowed random variables with correlated shadowing components. The probability density function (PDF) of this sum is obtained in the form of a power series. The derived PDF is utilized for obtaining the performance results of the maximal ratio combining (MRC) scheme over correlated $κ-μ$ shadowed fading chann… ▽ More In this paper, we study the statistical characterization of the sum of the squared $κ-μ$ shadowed random variables with correlated shadowing components. The probability density function (PDF) of this sum is obtained in the form of a power series. The derived PDF is utilized for obtaining the performance results of the maximal ratio combining (MRC) scheme over correlated $κ-μ$ shadowed fading channels. First, we derive the moment generating function (MGF) of the received signal-to-noise ratio of the MRC receiver. By using the derived MGF expression, the analytical diversity order is obtained; it is deduced on the basis of this analysis that the diversity of the MRC receiver over correlated $κ-μ$ shadowed channels depends upon the number of diversity branches and $μ$ parameter. Further, the analytical average bit error rate of the MRC scheme is also derived, which is applicable for $M$-PSK and $M$-QAM constellations. The Shannon capacity of the correlated $κ-μ$ shadowed channels is also derived in the form of the Meijer-G function. △ Less

Submitted 5 September, 2014; originally announced September 2014.

Comments: IEEE Transactions on Vehicular Technology

arXiv:1407.1166 [pdf, ps, other]

On the Capacity of CSI Based Transmission Link Selection in Decode-and-Forward Cooperative System

Authors: Manav R. Bhatnagar

Abstract: In this paper, we study the problem of best transmission link selection in a decode-and-forward (DF) cooperative system from capacity point of view. The transmission link can be a cooperative (via a relay) or direct link between the source and destination nodes. In a two-hop DF system with multiple relays and a direct link in between the source and destination, the transmission link selection can… ▽ More In this paper, we study the problem of best transmission link selection in a decode-and-forward (DF) cooperative system from capacity point of view. The transmission link can be a cooperative (via a relay) or direct link between the source and destination nodes. In a two-hop DF system with multiple relays and a direct link in between the source and destination, the transmission link selection can be performed based on full or partial channel state information (CSI) of all links involved in cooperation. We derive analytical ergodic capacity of full and partial CSI based path selection schemes in the DF cooperative system. Further, the full and partial CSI based link selection schemes are compared with help of these expressions. △ Less

Submitted 4 July, 2014; originally announced July 2014.

arXiv:1407.1112 [pdf, ps, other]

On the Capacity of Decode-and-Forward Relaying over Rician Fading Channels

Authors: Manav R. Bhatnagar

Abstract: In this letter, we derive the probability density function (PDF) and cumulative distribution function (CDF) of the minimum of two non-central Chi-square random variables with two degrees of freedom in terms of power series. With the help of the derived PDF and CDF, we obtain the exact ergodic capacity of the following adaptive protocols in a decode-and-forward (DF) cooperative system over dissimil… ▽ More In this letter, we derive the probability density function (PDF) and cumulative distribution function (CDF) of the minimum of two non-central Chi-square random variables with two degrees of freedom in terms of power series. With the help of the derived PDF and CDF, we obtain the exact ergodic capacity of the following adaptive protocols in a decode-and-forward (DF) cooperative system over dissimilar Rician fading channels: (i) constant power with optimal rate adaptation; (ii) optimal simultaneous power and rate adaptation; (iii) channel inversion with fixed rate. By using the analytical expressions of the capacity, it is observed that the optimal power and rate adaptation provides better capacity than the optimal rate adaptation with constant power from low to moderate signal-to-noise ratio values over dissimilar Rician fading channels. Despite low complexity, the channel inversion based adaptive transmission is shown to suffer from significant loss in capacity as compared to the other adaptive transmission based techniques over DF Rician channels. △ Less

Submitted 3 July, 2014; originally announced July 2014.

arXiv:1407.1106 [pdf, ps, other]

Performance Analysis of Two-Way AF MIMO Relaying of OSTBCs with Imperfect Channel Gains

Authors: Arti M. K., Manav R. Bhatnagar

Abstract: In this paper, we consider the relaying of orthogonal space time block codes (OSTBCs) in a two-way amplify-and-forward (AF) multiple-input multiple-output (MIMO) relay system with estimated channel state information (CSI). A simple four phase protocol is used for training and OSTBC data transmission. Decoding of OSTBC data at a user terminal is performed by replacing the exact CSI by the estimated… ▽ More In this paper, we consider the relaying of orthogonal space time block codes (OSTBCs) in a two-way amplify-and-forward (AF) multiple-input multiple-output (MIMO) relay system with estimated channel state information (CSI). A simple four phase protocol is used for training and OSTBC data transmission. Decoding of OSTBC data at a user terminal is performed by replacing the exact CSI by the estimated CSI, in a maximum likelihood decoder. Tight approximations for the moment generating function (m.g.f.) of the received signal-to-noise ratio at a user is derived under Rayleigh fading by ignoring the higher order noise terms. Analytical average error performance of the considered cooperative scheme is derived by using the m.g.f. expression. Moreover, the analytical diversity order of the considered scheme is also obtained for certain system configurations. It is shown by simulations and analysis that the channel estimation does not affect the diversity order of the OSTBC based two-way AF MIMO relay system. △ Less

Submitted 3 July, 2014; originally announced July 2014.

arXiv:1302.6789 [pdf]

Exploratory Model Building

Authors: Raj Bhatnagar

Abstract: Some instances of creative thinking require an agent to build and test hypothetical theories. Such a reasoner needs to explore the space of not only those situations that have occurred in the past, but also those that are rationally conceivable. In this paper we present a formalism for exploring the space of conceivable situation-models for those domains in which the knowledge is primarily proba… ▽ More Some instances of creative thinking require an agent to build and test hypothetical theories. Such a reasoner needs to explore the space of not only those situations that have occurred in the past, but also those that are rationally conceivable. In this paper we present a formalism for exploring the space of conceivable situation-models for those domains in which the knowledge is primarily probabilistic in nature. The formalism seeks to construct consistent, minimal, and desirable situation-descriptions by selecting suitable domain-attributes and dependency relationships from the available domain knowledge. △ Less

Submitted 27 February, 2013; originally announced February 2013.

Comments: Appears in Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence (UAI1994)

Report number: UAI-P-1994-PG-77-85

arXiv:1205.0326 [pdf, ps, other]

doi 10.1109/LPT.2011.2176330

Performance Analysis of Decode-and-Forward Relaying in Gamma-Gamma Fading Channels

Authors: Manav R. Bhatnagar

Abstract: Decode-and-forward (DF) cooperative communication based on free space optical (FSO) links is studied in this letter. We analyze performance of the DF protocol in the FSO links following the Gamma-Gamma distribution. The cumulative distribution function (CDF) and probability density function (PDF) of a random variable containing mixture of the Gamma- Gamma and Gaussian random variables is derived.… ▽ More Decode-and-forward (DF) cooperative communication based on free space optical (FSO) links is studied in this letter. We analyze performance of the DF protocol in the FSO links following the Gamma-Gamma distribution. The cumulative distribution function (CDF) and probability density function (PDF) of a random variable containing mixture of the Gamma- Gamma and Gaussian random variables is derived. By using the derived CDF and PDF, average bit error rate of the DF relaying is obtained. △ Less

Submitted 2 May, 2012; originally announced May 2012.

Comments: 3 pages, 1 figure, journal

Journal ref: IEEE Photonics Technology Letters, volume 24, number 7, pages 545-547, April 2012

arXiv:1204.6396 [pdf]

Comparing Soft Computing Techniques For Early Stage Software Development Effort Estimations

Authors: Roheet Bhatnagar, Mrinal Kanti Ghose

Abstract: Accurately estimating the software size, cost, effort and schedule is probably the biggest challenge facing software developers today. It has major implications for the management of software development because both the overestimates and underestimates have direct impact for causing damage to software companies. Lot of models have been proposed over the years by various researchers for carrying o… ▽ More Accurately estimating the software size, cost, effort and schedule is probably the biggest challenge facing software developers today. It has major implications for the management of software development because both the overestimates and underestimates have direct impact for causing damage to software companies. Lot of models have been proposed over the years by various researchers for carrying out effort estimations. Also some of the studies for early stage effort estimations suggest the importance of early estimations. New paradigms offer alternatives to estimate the software development effort, in particular the Computational Intelligence (CI) that exploits mechanisms of interaction between humans and processes domain knowledge with the intention of building intelligent systems (IS). Among IS, Artificial Neural Network and Fuzzy Logic are the two most popular soft computing techniques for software development effort estimation. In this paper neural network models and Mamdani FIS model have been used to predict the early stage effort estimations using the student dataset. It has been found that Mamdani FIS was able to predict the early stage efforts more efficiently in comparison to the neural network models based models. △ Less

Submitted 28 April, 2012; originally announced April 2012.

Comments: 09 PAGES

Journal ref: International Journal of Software Engineering & Applications (IJSEA), Vol.3, No.2, March 2012

arXiv:1204.2433 [pdf, ps, other]

Decode-and-Forward Based Differential Modulation for Cooperative Communication System with Unitary and Non-Unitary Constellations

Authors: Manav R. Bhatnagar

Abstract: In this paper, we derive a maximum likelihood (ML) decoder of the differential data in a decode-and-forward (DF) based cooperative communication system utilizing uncoded transmissions. This decoder is applicable to complex-valued unitary and non-unitary constellations suitable for differential modulation. The ML decoder helps in improving the diversity of the DF based differential cooperative syst… ▽ More In this paper, we derive a maximum likelihood (ML) decoder of the differential data in a decode-and-forward (DF) based cooperative communication system utilizing uncoded transmissions. This decoder is applicable to complex-valued unitary and non-unitary constellations suitable for differential modulation. The ML decoder helps in improving the diversity of the DF based differential cooperative system using an erroneous relaying node. We also derive a piecewise linear (PL) decoder of the differential data transmitted in the DF based cooperative system. The proposed PL decoder significantly reduces the decoding complexity as compared to the proposed ML decoder without any significant degradation in the receiver performance. Existing ML and PL decoders of the differentially modulated uncoded data in the DF based cooperative communication system are only applicable to binary modulated signals like binary phase shift keying (BPSK) and binary frequency shift keying (BFSK), whereas, the proposed decoders are applicable to complex-valued unitary and non-unitary constellations suitable for differential modulation under uncoded transmissions. We derive a closedform expression of the uncoded average symbol error rate (SER) of the proposed PL decoder with M-PSK constellation in a cooperative communication system with a single relay and one source-destination pair. An approximate average SER by ignoring higher order noise terms is also derived for this set-up. It is analytically shown on the basis of the derived approximate SER that the proposed PL decoder provides full diversity of second order. In addition, we also derive approximate SER of the differential DF system with multiple relays at asymptotically high signal-to-noise ratio of the source-relay links. △ Less

Submitted 11 April, 2012; originally announced April 2012.

Showing 1–11 of 11 results for author: Bhatnagar, R