Skip to main content

Showing 1–26 of 26 results for author: Köppel, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03118  [pdf, other

    cs.LG

    Can machine learning solve the challenge of adaptive learning and the individualization of learning paths? A field experiment in an online learning platform

    Authors: Tim Klausmann, Marius Köppel, Daniel Schunk, Isabell Zipperle

    Abstract: The individualization of learning contents based on digital technologies promises large individual and social benefits. However, it remains an open question how this individualization can be implemented. To tackle this question we conduct a randomized controlled trial on a large digital self-learning platform. We develop an algorithm based on two convolutional neural networks that assigns tasks to… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. Do Pretrained Contextual Language Models Distinguish between Hebrew Homograph Analyses?

    Authors: Avi Shmidman, Cheyn Shmuel Shmidman, Dan Bareket, Moshe Koppel, Reut Tsarfaty

    Abstract: Semitic morphologically-rich languages (MRLs) are characterized by extreme word ambiguity. Because most vowels are omitted in standard texts, many of the words are homographs with multiple possible analyses, each with a different pronunciation and different morphosyntactic properties. This ambiguity goes beyond word-sense disambiguation (WSD), and may include token segmentation into multiple word… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Journal ref: In Proceedings of EACL 2023, 849-864 (2023)

  3. arXiv:2403.06970  [pdf, other

    cs.CL

    MRL Parsing Without Tears: The Case of Hebrew

    Authors: Shaltiel Shmidman, Avi Shmidman, Moshe Koppel, Reut Tsarfaty

    Abstract: Syntactic parsing remains a critical tool for relation extraction and information extraction, especially in resource-scarce languages where LLMs are lacking. Yet in morphologically rich languages (MRLs), where parsers need to identify multiple lexical units in each token, existing systems suffer in latency and setup complexity. Some use a pipeline to peel away the layers: first segmentation, then… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  4. arXiv:2309.14568  [pdf, other

    cs.CL

    Introducing DictaLM -- A Large Generative Language Model for Modern Hebrew

    Authors: Shaltiel Shmidman, Avi Shmidman, Amir David Nissan Cohen, Moshe Koppel

    Abstract: We present DictaLM, a large-scale language model tailored for Modern Hebrew. Boasting 7B parameters, this model is predominantly trained on Hebrew-centric data. As a commitment to promoting research and development in the Hebrew language, we release both the foundation model and the instruct-tuned model under a Creative Commons license. Concurrently, we introduce DictaLM-Rab, another foundation mo… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  5. arXiv:2308.16687  [pdf, other

    cs.CL

    DictaBERT: A State-of-the-Art BERT Suite for Modern Hebrew

    Authors: Shaltiel Shmidman, Avi Shmidman, Moshe Koppel

    Abstract: We present DictaBERT, a new state-of-the-art pre-trained BERT model for modern Hebrew, outperforming existing models on most benchmarks. Additionally, we release three fine-tuned versions of the model, designed to perform three specific foundational tasks in the analysis of Hebrew texts: prefix segmentation, morphological tagging and question answering. These fine-tuned models allow any developer… ▽ More

    Submitted 13 October, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: Updated second version, with links to two question-answering models

  6. OSDaR23: Open Sensor Data for Rail 2023

    Authors: Rustam Tagiew, Martin Köppel, Karsten Schwalbe, Patrick Denzler, Philipp Neumaier, Tobias Klockau, Martin Boekhoff, Pavel Klasek, Roman Tilly

    Abstract: To achieve a driverless train operation on mainline railways, actual and potential obstacles for the train's driveway must be detected automatically by appropriate sensor systems. Machine learning algorithms have proven to be powerful tools for this task during the last years. However, these algorithms require large amounts of high-quality annotated data containing railway-specific objects as trai… ▽ More

    Submitted 19 March, 2024; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: 7 pages, 11 images, 5 tables

    MSC Class: 68T40 ACM Class: I.2.9

    Journal ref: 8th International Conference on Robotics and Automation Engineering (ICRAE), Singapore, Singapore, 2023, pp. 270-276

  7. arXiv:2211.15199  [pdf

    cs.CL

    Large Pre-Trained Models with Extra-Large Vocabularies: A Contrastive Analysis of Hebrew BERT Models and a New One to Outperform Them All

    Authors: Eylon Gueta, Avi Shmidman, Shaltiel Shmidman, Cheyn Shmuel Shmidman, Joshua Guedalia, Moshe Koppel, Dan Bareket, Amit Seker, Reut Tsarfaty

    Abstract: We present a new pre-trained language model (PLM) for modern Hebrew, termed AlephBERTGimmel, which employs a much larger vocabulary (128K items) than standard Hebrew PLMs before. We perform a contrastive analysis of this model against all previous Hebrew PLMs (mBERT, heBERT, AlephBERT) and assess the effects of larger vocabularies on task performance. Our experiments show that larger vocabularies… ▽ More

    Submitted 15 May, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

  8. arXiv:2208.02656  [pdf, other

    cs.LG cs.AI cs.NE

    Invariant Representations with Stochastically Quantized Neural Networks

    Authors: Mattia Cerrato, Marius Köppel, Roberto Esposito, Stefan Kramer

    Abstract: Representation learning algorithms offer the opportunity to learn invariant representations of the input data with regard to nuisance factors. Many authors have leveraged such strategies to learn fair representations, i.e., vectors where information about sensitive attributes is removed. These methods are attractive as they may be interpreted as minimizing the mutual information between a neural l… ▽ More

    Submitted 2 December, 2022; v1 submitted 4 August, 2022; originally announced August 2022.

    Comments: To appear in AAAI23

  9. arXiv:2208.01875  [pdf

    cs.CL

    Introducing BEREL: BERT Embeddings for Rabbinic-Encoded Language

    Authors: Avi Shmidman, Joshua Guedalia, Shaltiel Shmidman, Cheyn Shmuel Shmidman, Eli Handel, Moshe Koppel

    Abstract: We present a new pre-trained language model (PLM) for Rabbinic Hebrew, termed Berel (BERT Embeddings for Rabbinic-Encoded Language). Whilst other PLMs exist for processing Hebrew texts (e.g., HeBERT, AlephBert), they are all trained on modern Hebrew texts, which diverges substantially from Rabbinic Hebrew in terms of its lexicographical, morphological, syntactic and orthographic norms. We demonstr… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

  10. arXiv:2202.03078  [pdf, other

    cs.LG cs.CY

    Fair Interpretable Representation Learning with Correction Vectors

    Authors: Mattia Cerrato, Alesia Vallenas Coronel, Marius Köppel, Alexander Segner, Roberto Esposito, Stefan Kramer

    Abstract: Neural network architectures have been extensively employed in the fair representation learning setting, where the objective is to learn a new representation for a given vector which is independent of sensitive information. Various representation debiasing techniques have been proposed in the literature. However, as neural networks are inherently opaque, these methods are hard to comprehend, which… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

  11. arXiv:2201.06343  [pdf, other

    cs.LG stat.ML

    Fair Interpretable Learning via Correction Vectors

    Authors: Mattia Cerrato, Marius Köppel, Alexander Segner, Stefan Kramer

    Abstract: Neural network architectures have been extensively employed in the fair representation learning setting, where the objective is to learn a new representation for a given vector which is independent of sensitive information. Various "representation debiasing" techniques have been proposed in the literature. However, as neural networks are inherently opaque, these methods are hard to comprehend, whi… ▽ More

    Submitted 17 January, 2022; originally announced January 2022.

    Comments: ICLR-21 Workshop on Responsible AI

  12. arXiv:2201.06336  [pdf, other

    cs.LG cs.AI

    Fair Group-Shared Representations with Normalizing Flows

    Authors: Mattia Cerrato, Marius Köppel, Alexander Segner, Stefan Kramer

    Abstract: The issue of fairness in machine learning stems from the fact that historical data often displays biases against specific groups of people which have been underprivileged in the recent past, or still are. In this context, one of the possible approaches is to employ fair representation learning algorithms which are able to remove biases from data, making groups statistically indistinguishable. In t… ▽ More

    Submitted 17 January, 2022; originally announced January 2022.

    Comments: ICLR-21 Workshop on Responsible AI

  13. arXiv:2010.12613  [pdf, other

    cs.CL

    Ranking Creative Language Characteristics in Small Data Scenarios

    Authors: Julia Siekiera, Marius Köppel, Edwin Simpson, Kevin Stowe, Iryna Gurevych, Stefan Kramer

    Abstract: The ability to rank creative natural language provides an important general tool for downstream language understanding and generation. However, current deep ranking models require substantial amounts of labeled data that are difficult and expensive to obtain for different domains, languages and creative characteristics. A recent neural approach, the DirectRanker, promises to reduce the amount of t… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: 10 pages, 3 figures

  14. arXiv:2010.02864  [pdf

    cs.CL

    A Novel Challenge Set for Hebrew Morphological Disambiguation and Diacritics Restoration

    Authors: Avi Shmidman, Joshua Guedalia, Shaltiel Shmidman, Moshe Koppel, Reut Tsarfaty

    Abstract: One of the primary tasks of morphological parsers is the disambiguation of homographs. Particularly difficult are cases of unbalanced ambiguity, where one of the possible analyses is far more frequent than the others. In such cases, there may not exist sufficient examples of the minority analyses in order to properly evaluate performance, nor to train effective classifiers. In this paper we addres… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Journal ref: Findings of EMNLP, 2020

  15. arXiv:2005.03312  [pdf, other

    cs.CL

    Nakdan: Professional Hebrew Diacritizer

    Authors: Avi Shmidman, Shaltiel Shmidman, Moshe Koppel, Yoav Goldberg

    Abstract: We present a system for automatic diacritization of Hebrew text. The system combines modern neural models with carefully curated declarative linguistic knowledge and comprehensive manually constructed tables and dictionaries. Besides providing state of the art diacritization accuracy, the system also supports an interface for manual editing and correction of the automatic output, and has several f… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

    Comments: Accepted to ACL 2020, System Demonstrations

  16. arXiv:1909.02768  [pdf, other

    cs.IR cs.LG stat.ML

    Pairwise Learning to Rank by Neural Networks Revisited: Reconstruction, Theoretical Analysis and Practical Performance

    Authors: Marius Köppel, Alexander Segner, Martin Wagener, Lukas Pensel, Andreas Karwath, Stefan Kramer

    Abstract: We present a pairwise learning to rank approach based on a neural net, called DirectRanker, that generalizes the RankNet architecture. We show mathematically that our model is reflexive, antisymmetric, and transitive allowing for simplified training and improved performance. Experimental results on the LETOR MSLR-WEB10K, MQ2007 and MQ2008 datasets show that our model outperforms numerous state-of-… ▽ More

    Submitted 6 September, 2019; originally announced September 2019.

    Comments: 16 pages, 8 figures

  17. Comparison of data-driven uncertainty quantification methods for a carbon dioxide storage benchmark scenario

    Authors: Markus Köppel, Fabian Franzelin, Ilja Kröker, Sergey Oladyshkin, Gabriele Santin, Dominik Wittwar, Andrea Barth, Bernard Haasdonk, Wolfgang Nowak, Dirk Pflüger, Christian Rohde

    Abstract: A variety of methods is available to quantify uncertainties arising with\-in the modeling of flow and transport in carbon dioxide storage, but there is a lack of thorough comparisons. Usually, raw data from such storage sites can hardly be described by theoretical statistical distributions since only very limited data is available. Hence, exact information on distribution shapes for all uncertain… ▽ More

    Submitted 8 February, 2018; originally announced February 2018.

    MSC Class: 65D05; 65D15; 65C20

  18. arXiv:1711.08337  [pdf, ps, other

    cs.NE cs.LG stat.ML

    Genetic Algorithms for Evolving Computer Chess Programs

    Authors: Eli David, H. Jaap van den Herik, Moshe Koppel, Nathan S. Netanyahu

    Abstract: This paper demonstrates the use of genetic algorithms for evolving: 1) a grandmaster-level evaluation function, and 2) a search mechanism for a chess program, the parameter values of which are initialized randomly. The evaluation function of the program is evolved by learning from databases of (human) grandmaster games. At first, the organisms are evolved to mimic the behavior of human grandmaster… ▽ More

    Submitted 21 November, 2017; originally announced November 2017.

    Comments: Winner of Gold Award in 11th Annual "Humies" Awards for Human-Competitive Results. arXiv admin note: substantial text overlap with arXiv:1711.06840, arXiv:1711.06841, arXiv:1711.06839

    Journal ref: IEEE Transactions on Evolutionary Computation, Vol. 18, No. 5, pp. 779-789, September 2014

  19. arXiv:1711.06841  [pdf, ps, other

    cs.NE cs.LG stat.ML

    Expert-Driven Genetic Algorithms for Simulating Evaluation Functions

    Authors: Eli David, Moshe Koppel, Nathan S. Netanyahu

    Abstract: In this paper we demonstrate how genetic algorithms can be used to reverse engineer an evaluation function's parameters for computer chess. Our results show that using an appropriate expert (or mentor), we can evolve a program that is on par with top tournament-playing chess programs, outperforming a two-time World Computer Chess Champion. This performance gain is achieved by evolving a program th… ▽ More

    Submitted 18 November, 2017; originally announced November 2017.

    Comments: arXiv admin note: substantial text overlap with arXiv:1711.06839, arXiv:1711.06840

    Journal ref: Genetic Programming and Evolvable Machines, Vol. 12, No. 1, pp. 5-22, March 2011

  20. arXiv:1711.06840  [pdf, ps, other

    cs.NE cs.LG stat.ML

    Simulating Human Grandmasters: Evolution and Coevolution of Evaluation Functions

    Authors: Eli David, H. Jaap van den Herik, Moshe Koppel, Nathan S. Netanyahu

    Abstract: This paper demonstrates the use of genetic algorithms for evolving a grandmaster-level evaluation function for a chess program. This is achieved by combining supervised and unsupervised learning. In the supervised learning phase the organisms are evolved to mimic the behavior of human grandmasters, and in the unsupervised learning phase these evolved organisms are further improved upon by means of… ▽ More

    Submitted 18 November, 2017; originally announced November 2017.

    Comments: arXiv admin note: substantial text overlap with arXiv:1711.06839, arXiv:1711.06841

    Journal ref: ACM Genetic and Evolutionary Computation Conference (GECCO), pages 1483-1489, Montreal, Canada, July 2009

  21. arXiv:1711.06839  [pdf, ps, other

    cs.NE cs.LG stat.ML

    Genetic Algorithms for Mentor-Assisted Evaluation Function Optimization

    Authors: Eli David, Moshe Koppel, Nathan S. Netanyahu

    Abstract: In this paper we demonstrate how genetic algorithms can be used to reverse engineer an evaluation function's parameters for computer chess. Our results show that using an appropriate mentor, we can evolve a program that is on par with top tournament-playing chess programs, outperforming a two-time World Computer Chess Champion. This performance gain is achieved by evolving a program with a smaller… ▽ More

    Submitted 18 November, 2017; originally announced November 2017.

    Comments: Winner of Best Paper Award in GECCO 2008. arXiv admin note: substantial text overlap with arXiv:1711.06840, arXiv:1711.06841

    Journal ref: ACM Genetic and Evolutionary Computation Conference (GECCO), pages 1469-1475, Atlanta, GA, July 2008

  22. arXiv:1612.08989  [pdf, other

    cs.CL

    Shamela: A Large-Scale Historical Arabic Corpus

    Authors: Yonatan Belinkov, Alexander Magidow, Maxim Romanov, Avi Shmidman, Moshe Koppel

    Abstract: Arabic is a widely-spoken language with a rich and long history spanning more than fourteen centuries. Yet existing Arabic corpora largely focus on the modern period or lack sufficient diachronic information. We develop a large-scale, historical corpus of Arabic of about 1 billion words from diverse periods of time. We clean this corpus, process it with a morphological analyzer, and enhance it by… ▽ More

    Submitted 28 December, 2016; originally announced December 2016.

    Comments: Slightly expanded version of Coling LT4DH workshop paper

    ACM Class: I.2.7

  23. Identification of Parallel Passages Across a Large Hebrew/Aramaic Corpus

    Authors: Avi Shmidman, Moshe Koppel, Ely Porat

    Abstract: We propose a method for efficiently finding all parallel passages in a large corpus, even if the passages are not quite identical due to rephrasing and orthographic variation. The key ideas are the representation of each word in the corpus by its two most infrequent letters, finding matched pairs of strings of four or five words that differ by at most one word and then identifying clusters of such… ▽ More

    Submitted 31 December, 2017; v1 submitted 28 February, 2016; originally announced February 2016.

    Comments: Submission to the Journal of Data Mining and Digital Humanities (Special Issue on Computer-Aided Processing of Intertextuality in Ancient Languages)

    Journal ref: Journal of Data Mining & Digital Humanities, Special Issue on Computer-Aided Processing of Intertextuality in Ancient Languages, Towards a Digital Ecosystem: NLP. Corpus infrastructure. Methods for Retrieving Texts and Computing Text Similarities (March 11, 2018) jdmdh:1388

  24. arXiv:1009.0550  [pdf, ps, other

    cs.AI cs.NE

    Optimizing Selective Search in Chess

    Authors: Omid David-Tabibi, Moshe Koppel, Nathan S. Netanyahu

    Abstract: In this paper we introduce a novel method for automatically tuning the search parameters of a chess program using genetic algorithms. Our results show that a large set of parameter values can be learned automatically, such that the resulting performance is comparable with that of manually tuned parameters of top tournament-playing chess programs.

    Submitted 2 September, 2010; originally announced September 2010.

    Journal ref: Proceedings of the International Conference on Machine Learning (ICML) Workshop on Machine Learning and Games, Haifa, Israel, June 2010

  25. arXiv:cs/9803103  [pdf, ps

    cs.AI

    Tractability of Theory Patching

    Authors: S. Argamon-Engelson, M. Koppel

    Abstract: In this paper we consider the problem of `theory patching', in which we are given a domain theory, some of whose components are indicated to be possibly flawed, and a set of labeled training examples for the domain concept. The theory patching problem is to revise only the indicated components of the theory, such that the resulting theory correctly classifies all the training examples. Theory pa… ▽ More

    Submitted 28 February, 1998; originally announced March 1998.

    Comments: See http://www.jair.org/ for any accompanying files

    Journal ref: Journal of Artificial Intelligence Research, Vol 8, (1998), 39-65

  26. arXiv:cs/9402103  [pdf, ps

    cs.AI

    Bias-Driven Revision of Logical Domain Theories

    Authors: M. Koppel, R. Feldman, A. M. Segre

    Abstract: The theory revision problem is the problem of how best to go about revising a deficient domain theory using information contained in examples that expose inaccuracies. In this paper we present our approach to the theory revision problem for propositional domain theories. The approach described here, called PTR, uses probabilities associated with domain theory elements to numerically track the ``… ▽ More

    Submitted 31 January, 1994; originally announced February 1994.

    Comments: See http://www.jair.org/ for any accompanying files

    Journal ref: Journal of Artificial Intelligence Research, Vol 1, (1994), 159-208