Search | arXiv e-print repository

arXiv:2405.20213 [pdf, other]

PostDoc: Generating Poster from a Long Multimodal Document Using Deep Submodular Optimization

Authors: Vijay Jaisankar, Sambaran Bandyopadhyay, Kalp Vyas, Varre Chaitanya, Shwetha Somasundaram

Abstract: A poster from a long input document can be considered as a one-page easy-to-read multimodal (text and images) summary presented on a nice template with good design elements. Automatic transformation of a long document into a poster is a very less studied but challenging task. It involves content summarization of the input document followed by template generation and harmonization. In this work, we… ▽ More A poster from a long input document can be considered as a one-page easy-to-read multimodal (text and images) summary presented on a nice template with good design elements. Automatic transformation of a long document into a poster is a very less studied but challenging task. It involves content summarization of the input document followed by template generation and harmonization. In this work, we propose a novel deep submodular function which can be trained on ground truth summaries to extract multimodal content from the document and explicitly ensures good coverage, diversity and alignment of text and images. Then, we use an LLM based paraphraser and propose to generate a template with various design aspects conditioned on the input content. We show the merits of our approach through extensive automated and human evaluations. △ Less

Submitted 30 May, 2024; originally announced May 2024.

arXiv:1503.07542 [pdf, ps, other]

Energy-Efficient Adaptive Power Allocation for Incremental MIMO Systems

Authors: Tumula V. K. Chaitanya, Tho Le-Ngoc

Abstract: We consider energy-efficient adaptive power allocation for three incremental multiple-input multiple-output (IMIMO) systems employing ARQ, hybrid ARQ (HARQ) with Chase combining (CC), and HARQ with incremental redundancy (IR), to minimize their rate-outage probability (or equivalently packet drop rate) under a constraint on average energy consumption per data packet. We first provide the rate-outa… ▽ More We consider energy-efficient adaptive power allocation for three incremental multiple-input multiple-output (IMIMO) systems employing ARQ, hybrid ARQ (HARQ) with Chase combining (CC), and HARQ with incremental redundancy (IR), to minimize their rate-outage probability (or equivalently packet drop rate) under a constraint on average energy consumption per data packet. We first provide the rate-outage probability expressions for the three IMIMO systems, and then propose methods to convert them into a tractable form and formulate the corresponding non-convex optimization problems that can be solved by an interior-point algorithm for finding a local optimum. To further reduce the solution complexity, using an asymptotically equivalent approximation of the rate-outage probability expressions, we approximate the non-convex optimization problems as a unified geometric programming problem (GPP), for which we derive the closed-form solution. Illustrative results indicate that the proposed power allocation (PPA) offers significant gains in energy savings as compared to the equal-power allocation (EPA), and the simple closed-form GPP solution can provide closer performance to the exact method at lower values of rate-outage probability, for the three IMIMO systems. △ Less

Submitted 25 March, 2015; originally announced March 2015.

Comments: Submitted IEEE Transactions on Vehicular Technology

arXiv:cs/0308020 [pdf]

LERIL : Collaborative Effort for Creating Lexical Resources

Authors: Akshar Bharati, Dipti M Sharma, Vineet Chaitanya, Amba P Kulkarni, Rajeev Sangal, Durgesh D Rao

Abstract: The paper reports on efforts taken to create lexical resources pertaining to Indian languages, using the collaborative model. The lexical resources being developed are: (1) Transfer lexicon and grammar from English to several Indian languages. (2) Dependencey tree bank of annotated corpora for several Indian languages. The dependency trees are based on the Paninian model. (3) Bilingual dictionar… ▽ More The paper reports on efforts taken to create lexical resources pertaining to Indian languages, using the collaborative model. The lexical resources being developed are: (1) Transfer lexicon and grammar from English to several Indian languages. (2) Dependencey tree bank of annotated corpora for several Indian languages. The dependency trees are based on the Paninian model. (3) Bilingual dictionary of 'core meanings'. △ Less

Submitted 7 August, 2003; originally announced August 2003.

Comments: [ To appear in Proceedings of Workshop on Language Resources in Asia, along with NLPRS-2001, Tokyo, 27-30 November 2001] Appeared in the Proceedings of Workshop on Language Resources in Asia, along with NLPRS-2001, Tokyo, 27-30 November 2001. Appeared in the proceedings of Workshop on Language Resources in Asia, along with NLPRS-2001, Tokyo, 27-30 November 2001

Report number: LTRC-TR015 ACM Class: I,2,7

arXiv:cs/0308019 [pdf]

Language Access: An Information Based Approach

Authors: Akshar Bharati, Vineet Chaitanya, Amba P. Kulkarni, Rajeev Sangal

Abstract: The anusaaraka system (a kind of machine translation system) makes text in one Indian language accessible through another Indian language. The machine presents an image of the source text in a language close to the target language. In the image, some constructions of the source language (which do not have equivalents in the target language) spill over to the output. Some special notation is also… ▽ More The anusaaraka system (a kind of machine translation system) makes text in one Indian language accessible through another Indian language. The machine presents an image of the source text in a language close to the target language. In the image, some constructions of the source language (which do not have equivalents in the target language) spill over to the output. Some special notation is also devised. Anusaarakas have been built from five pairs of languages: Telugu,Kannada, Marathi, Bengali and Punjabi to Hindi. They are available for use through Email servers. Anusaarkas follows the principle of substitutibility and reversibility of strings produced. This implies preservation of information while going from a source language to a target language. For narrow subject areas, specialized modules can be built by putting subject domain knowledge into the system, which produce good quality grammatical output. However, it should be remembered, that such modules will work only in narrow areas, and will sometimes go wrong. In such a situation, anusaaraka output will still remain useful. △ Less

Submitted 7 August, 2003; originally announced August 2003.

Comments: Published in the proceedings of Knowledge Based Computer Systems conference, 2000, Tata McGraw-Hill, New Delhi, Dec. 2000

Report number: LTRC-TR010 ACM Class: I,2,7

Journal ref: Published in the proceedings of Knowledge Based Computer Systems Conference, 2000, Tata McGraw Hill, New Delhi 2000

arXiv:cs/0308018 [pdf]

Anusaaraka: Overcoming the Language Barrier in India

Authors: Akshar Bharati, Vineet Chaitanya, Amba P. Kulkarni, Rajeev Sangal, G Umamaheshwara Rao

Abstract: The anusaaraka system makes text in one Indian language accessible in another Indian language. In the anusaaraka approach, the load is so divided between man and computer that the language load is taken by the machine, and the interpretation of the text is left to the man. The machine presents an image of the source text in a language close to the target language.In the image, some constructions… ▽ More The anusaaraka system makes text in one Indian language accessible in another Indian language. In the anusaaraka approach, the load is so divided between man and computer that the language load is taken by the machine, and the interpretation of the text is left to the man. The machine presents an image of the source text in a language close to the target language.In the image, some constructions of the source language (which do not have equivalents) spill over to the output. Some special notation is also devised. The user after some training learns to read and understand the output. Because the Indian languages are close, the learning time of the output language is short, and is expected to be around 2 weeks. The output can also be post-edited by a trained user to make it grammatically correct in the target language. Style can also be changed, if necessary. Thus, in this scenario, it can function as a human assisted translation system. Currently, anusaarakas are being built from Telugu, Kannada, Marathi, Bengali and Punjabi to Hindi. They can be built for all Indian languages in the near future. Everybody must pitch in to build such systems connecting all Indian languages, using the free software model. △ Less

Submitted 7 August, 2003; originally announced August 2003.

Comments: Published in "Anuvad: Approaches to Translation", Rukmini Bhaya Nair, (editor), Sage, New Delhi, 2001

Report number: LTRC-TR009 ACM Class: I,2,7

Journal ref: Published in "Anuvad: Approaches to Translation", Rukmini Bhaya Nair, (editor), Sage, New Delhi, 2001

arXiv:cs/0308017 [pdf]

Information Revolution

Authors: Akshar Bharati, Vineet Chaitanya, Rajeev Sangal

Abstract: The world is passing through a major revolution called the information revolution, in which information and knowledge is becoming available to people in unprecedented amounts wherever and whenever they need it. Those societies which fail to take advantage of the new technology will be left behind, just like in the industrial revolution. The information revolution is based on two major technolo… ▽ More The world is passing through a major revolution called the information revolution, in which information and knowledge is becoming available to people in unprecedented amounts wherever and whenever they need it. Those societies which fail to take advantage of the new technology will be left behind, just like in the industrial revolution. The information revolution is based on two major technologies: computers and communication. These technologies have to be delivered in a COST EFFECTIVE manner, and in LANGUAGES accessible to people. One way to deliver them in cost effective manner is to make suitable technology choices (discussed later), and to allow people to access through shared resources. This could be done throuch street corner shops (for computer usage, e-mail etc.), schools, community centers and local library centres. △ Less

Submitted 7 August, 2003; originally announced August 2003.

Comments: Published as a keynote lecture in IRIL-99: Information Revolution and Indian Languages, 12-14 Nov 1999

Report number: LTRC-TR007 ACM Class: I,2,7

Journal ref: Published as a keynote lecture in IRIL-99: Information Revolution and Indian Languages, 12-14 Nov 1999

arXiv:cs/0306130 [pdf]

Anusaaraka: Machine Translation in Stages

Authors: Akshar Bharati, Vineet Chaitanya, Amba P. Kulkarni, Rajeev Sangal

Abstract: Fully-automatic general-purpose high-quality machine translation systems (FGH-MT) are extremely difficult to build. In fact, there is no system in the world for any pair of languages which qualifies to be called FGH-MT. The reasons are not far to seek. Translation is a creative process which involves interpretation of the given text by the translator. Translation would also vary depending on the… ▽ More Fully-automatic general-purpose high-quality machine translation systems (FGH-MT) are extremely difficult to build. In fact, there is no system in the world for any pair of languages which qualifies to be called FGH-MT. The reasons are not far to seek. Translation is a creative process which involves interpretation of the given text by the translator. Translation would also vary depending on the audience and the purpose for which it is meant. This would explain the difficulty of building a machine translation system. Since, the machine is not capable of interpreting a general text with sufficient accuracy automatically at present - let alone re-expressing it for a given audience, it fails to perform as FGH-MT. FOOTNOTE{The major difficulty that the machine faces in interpreting a given text is the lack of general world knowledge or common sense knowledge.} △ Less

Submitted 25 June, 2003; originally announced June 2003.

Comments: 5 pages, Published in Vivek, A Quarterly in Artificial Intelligence, 10, 3, July 1997, pp. 22-25

ACM Class: I.2.7

Journal ref: Vivek, A Quarterly in Artificial Intelligence, 10, 3, July 1997, pp. 22-25

Showing 1–7 of 7 results for author: Chaitanya, V