Skip to main content

Showing 1–10 of 10 results for author: Şenel, L K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05255  [pdf, other

    cs.CL cs.AI

    Generative Explore-Exploit: Training-free Optimization of Generative Recommender Systems using LLM Optimizers

    Authors: Lütfi Kerem Senel, Besnik Fetahu, Davis Yoshida, Zhiyu Chen, Giuseppe Castellucci, Nikhita Vedula, Jason Choi, Shervin Malmasi

    Abstract: Recommender systems are widely used to suggest engaging content, and Large Language Models (LLMs) have given rise to generative recommenders. Such systems can directly generate items, including for open-set tasks like question suggestion. While the world knowledge of LLMs enable good recommendations, improving the generated content through user feedback is challenging as continuously fine-tuning L… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted at ACL 2024 Main Proceedings

  2. arXiv:2310.12020  [pdf, other

    cs.RO cs.CL cs.CV

    LoHoRavens: A Long-Horizon Language-Conditioned Benchmark for Robotic Tabletop Manipulation

    Authors: Shengqiang Zhang, Philipp Wicke, Lütfi Kerem Şenel, Luis Figueredo, Abdeldjallil Naceri, Sami Haddadin, Barbara Plank, Hinrich Schütze

    Abstract: The convergence of embodied agents and large language models (LLMs) has brought significant advancements to embodied instruction following. Particularly, the strong reasoning capabilities of LLMs make it possible for robots to perform long-horizon tasks without expensive annotated demonstrations. However, public benchmarks for testing the long-horizon reasoning capabilities of language-conditioned… ▽ More

    Submitted 23 October, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: 6 pages, 4 figures. The video and code of LoHoRavens are available at https://cisnlp.github.io/lohoravens-webpage/

  3. arXiv:2306.14830  [pdf, other

    cs.RO

    Towards Language-Based Modulation of Assistive Robots through Multimodal Models

    Authors: Philipp Wicke, Lüfti Kerem Şenel, Shengqiang Zhang, Luis Figueredo, Abdeldjallil Naceri, Sami Haddadin, Hinrich Schütze

    Abstract: In the field of Geriatronics, enabling effective and transparent communication between humans and robots is crucial for enhancing the acceptance and performance of assistive robots. Our early-stage research project investigates the potential of language-based modulation as a means to improve human-robot interaction. We propose to explore real-time modulation during task execution, leveraging langu… ▽ More

    Submitted 27 June, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: GERIATRONICS SUMMIT 2023

  4. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  5. arXiv:2203.08654  [pdf, other

    cs.CL

    Graph Neural Networks for Multiparallel Word Alignment

    Authors: Ayyoob Imani, Lütfi Kerem Şenel, Masoud Jalili Sabet, François Yvon, Hinrich Schütze

    Abstract: After a period of decrease, interest in word alignments is increasing again for their usefulness in domains such as typological research, cross-lingual annotation projection, and machine translation. Generally, alignment algorithms only use bitext and do not make use of the fact that many parallel corpora are multiparallel. Here, we compute high-quality word alignments between multiple language pa… ▽ More

    Submitted 10 August, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

    Report number: ACL 2022 Findings

  6. arXiv:2203.06228  [pdf, other

    cs.CL cs.AI

    CoDA21: Evaluating Language Understanding Capabilities of NLP Models With Context-Definition Alignment

    Authors: Lütfi Kerem Senel, Timo Schick, Hinrich Schütze

    Abstract: Pretrained language models (PLMs) have achieved superhuman performance on many benchmarks, creating a need for harder tasks. We introduce CoDA21 (Context Definition Alignment), a challenging benchmark that measures natural language understanding (NLU) capabilities of PLMs: Given a definition and a context each for k words, but not the words themselves, the task is to align the k definitions with t… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: To appear in ACL 2022, 5 pages, 2 figures

  7. arXiv:2109.06283  [pdf, other

    cs.CL

    Graph Algorithms for Multiparallel Word Alignment

    Authors: Ayyoob Imani, Masoud Jalili Sabet, Lütfi Kerem Şenel, Philipp Dufter, François Yvon, Hinrich Schütze

    Abstract: With the advent of end-to-end deep learning approaches in machine translation, interest in word alignments initially decreased; however, they have again become a focus of research more recently. Alignments are useful for typological research, transferring formatting like markup to translated texts, and can be used in the decoding of machine translation systems. At the same time, massively multilin… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021

  8. arXiv:2102.03596  [pdf, other

    cs.CL

    Does He Wink or Does He Nod? A Challenging Benchmark for Evaluating Word Understanding of Language Models

    Authors: Lutfi Kerem Senel, Hinrich Schütze

    Abstract: Recent progress in pretraining language models on large corpora has resulted in large performance gains on many NLP tasks. These large models acquire linguistic knowledge during pretraining, which helps to improve performance on downstream tasks via fine-tuning. To assess what kind of knowledge is acquired, language models are commonly probed by querying them with `fill in the blank' style cloze q… ▽ More

    Submitted 6 February, 2021; originally announced February 2021.

    Comments: 5 pages, to appear in EACL 2021

  9. Imparting Interpretability to Word Embeddings while Preserving Semantic Structure

    Authors: Lutfi Kerem Senel, Ihsan Utlu, Furkan Şahinuç, Haldun M. Ozaktas, Aykut Koç

    Abstract: As an ubiquitous method in natural language processing, word embeddings are extensively employed to map semantic properties of words into a dense vector representation. They capture semantic and syntactic relations among words but the vectors corresponding to the words are only meaningful relative to each other. Neither the vector nor its dimensions have any absolute, interpretable meaning. We int… ▽ More

    Submitted 2 July, 2020; v1 submitted 19 July, 2018; originally announced July 2018.

    Comments: 14 pages, 5 figures

    Journal ref: Natural Language Engineering, 1-26, 2020

  10. Semantic Structure and Interpretability of Word Embeddings

    Authors: Lutfi Kerem Senel, Ihsan Utlu, Veysel Yucesoy, Aykut Koc, Tolga Cukur

    Abstract: Dense word embeddings, which encode semantic meanings of words to low dimensional vector spaces have become very popular in natural language processing (NLP) research due to their state-of-the-art performances in many NLP tasks. Word embeddings are substantially successful in capturing semantic relations among words, so a meaningful semantic structure must be present in the respective vector space… ▽ More

    Submitted 16 May, 2018; v1 submitted 1 November, 2017; originally announced November 2017.

    Comments: 11 Pages, 8 Figures, accepted by IEEE/ACM Transactions on Audio, Speech, and Language Processing

    Journal ref: L. K. Şenel, İ. Utlu, V. Yücesoy, A. Koç and T. Çukur, "Semantic Structure and Interpretability of Word Embeddings," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 26, no. 10, pp. 1769-1779, Oct. 2018