Skip to main content

Showing 1–28 of 28 results for author: Kara, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.16224  [pdf, ps, other

    cs.DB

    Tractable Conjunctive Queries over Static and Dynamic Relations

    Authors: Ahmet Kara, Zheng Luo, Milos Nikolic, Dan Olteanu, Haozhe Zhang

    Abstract: We investigate the evaluation of conjunctive queries over static and dynamic relations. While static relations are given as input and do not change, dynamic relations are subject to inserts and deletes. We characterise syntactically three classes of queries that admit constant update time and constant enumeration delay. We call such queries tractable. Depending on the class, the preprocessing ti… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    ACM Class: H.2.4

  2. arXiv:2312.09331  [pdf, ps, other

    cs.DB

    Insert-Only versus Insert-Delete in Dynamic Query Evaluation

    Authors: Mahmoud Abo Khamis, Ahmet Kara, Dan Olteanu, Dan Suciu

    Abstract: We study the dynamic query evaluation problem: Given a join query Q and a sequence of updates, we would like to construct a data structure that supports constant-delay enumeration of the query output after each update. We show that a sequence of N insert-only updates (to an initially empty database) can be executed in total time O(N^{w(Q)}), where w(Q) is the fractional hypertree width of Q. Thi… ▽ More

    Submitted 8 June, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

  3. arXiv:2311.00123  [pdf, other

    math.OC cs.AI eess.SY

    Q-Learning for Stochastic Control under General Information Structures and Non-Markovian Environments

    Authors: Ali Devran Kara, Serdar Yuksel

    Abstract: As a primary contribution, we present a convergence theorem for stochastic iterations, and in particular, Q-learning iterates, under a general, possibly non-Markovian, stochastic environment. Our conditions for convergence involve an ergodicity and a positivity criterion. We provide a precise characterization on the limit of the iterates and conditions on the environment and initializations for co… ▽ More

    Submitted 4 March, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

    Comments: 2 figures

  4. arXiv:2309.11346  [pdf, other

    cs.CL cs.LG

    GECTurk: Grammatical Error Correction and Detection Dataset for Turkish

    Authors: Atakan Kara, Farrin Marouf Sofian, Andrew Bond, Gözde Gül Şahin

    Abstract: Grammatical Error Detection and Correction (GEC) tools have proven useful for native speakers and second language learners. Develo** such tools requires a large amount of parallel, annotated data, which is unavailable for most languages. Synthetic data generation is a common practice to overcome the scarcity of such data. However, it is not straightforward for morphologically rich languages like… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: Accepted at Findings of IJCNLP-AACL 2023

  5. arXiv:2308.05588  [pdf, other

    cs.DB

    Banzhaf Values for Facts in Query Answering

    Authors: Omer Abramovich, Daniel Deutch, Nave Frost, Ahmet Kara, Dan Olteanu

    Abstract: Quantifying the contribution of database facts to query answers has been studied as means of explanation. The Banzhaf value, originally developed in Game Theory, is a natural measure of fact contribution, yet its efficient computation for select-project-join-union queries is challenging. In this paper, we introduce three algorithms to compute the Banzhaf value of database facts: an exact algorithm… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  6. arXiv:2307.16540  [pdf, ps, other

    cs.DB

    ADOPT: Adaptively Optimizing Attribute Orders for Worst-Case Optimal Join Algorithms via Reinforcement Learning

    Authors: Junxiong Wang, Immanuel Trummer, Ahmet Kara, Dan Olteanu

    Abstract: The performance of worst-case optimal join algorithms depends on the order in which the join attributes are processed. Selecting good orders before query execution is hard, due to the large space of possible orders and unreliable execution cost estimates in case of data skew or data correlation. We propose ADOPT, a query engine that combines adaptive query processing with a worst-case optimal join… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    ACM Class: H.3

  7. arXiv:2306.14211  [pdf, ps, other

    cs.DB cs.CC cs.LO

    From Shapley Value to Model Counting and Back

    Authors: Ahmet Kara, Dan Olteanu, Dan Suciu

    Abstract: In this paper we investigate the problem of quantifying the contribution of each variable to the satisfying assignments of a Boolean function based on the Shapley value. Our main result is a polynomial-time equivalence between computing Shapley values and model counting for any class of Boolean functions that are closed under substitutions of variables with disjunctions of fresh variables. This… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

    Comments: 22 pages

    ACM Class: F.4.1; F.2; H.2

  8. F-IVM: Analytics over Relational Databases under Updates

    Authors: Ahmet Kara, Milos Nikolic, Dan Olteanu, Haozhe Zhang

    Abstract: This article describes F-IVM, a unified approach for maintaining analytics over changing relational data. We exemplify its versatility in four disciplines: processing queries with group-by aggregates and joins; learning linear regression models using the covariance matrix of the input features; building Chow-Liu trees using pairwise mutual information of the input features; and matrix chain multip… ▽ More

    Submitted 29 January, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

  9. arXiv:2208.14332  [pdf, other

    cs.IR

    Extracting Relations Between Sectors

    Authors: Atakan Kara, F. Serhan Daniş, Günce Keziban Orman, Sultan Nezihe Turhan

    Abstract: The term "sector" in professional business life is a vague concept since companies tend to identify themselves as operating in multiple sectors simultaneously. This ambiguity poses problems in recommending jobs to job seekers or finding suitable candidates for open positions. The latter holds significant importance when available candidates in a specific sector are also scarce; hence, finding cand… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: 13 pages and 3 figures

  10. arXiv:2206.09032  [pdf, other

    cs.DB

    Conjunctive Queries with Free Access Patterns under Updates

    Authors: Ahmet Kara, Milos Nikolic, Dan Olteanu, Haozhe Zhang

    Abstract: We study the problem of answering conjunctive queries with free access patterns (CQAP) under updates. A free access pattern is a partition of the free variables of the query into input and output. The query returns tuples over the output variables given a tuple of values over the input variables. We introduce a fully dynamic evaluation approach for CQAP queries. We also give a syntactic characte… ▽ More

    Submitted 14 February, 2024; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: Extended and polished version. Added new Section 11 on the dynamic evaluation of conjunctive queries with free access patterns over probabilistic databases

    ACM Class: H.2.4

  11. arXiv:2112.00195  [pdf, other

    cs.LG

    Efficient Online Bayesian Inference for Neural Bandits

    Authors: Gerardo Duran-Martin, Aleyna Kara, Kevin Murphy

    Abstract: In this paper we present a new algorithm for online (sequential) inference in Bayesian neural networks, and show its suitability for tackling contextual bandit problems. The key idea is to combine the extended Kalman filter (which locally linearizes the likelihood function at each time step) with a (learned or random) low-dimensional affine subspace for the parameters; the use of a subspace enable… ▽ More

    Submitted 30 November, 2021; originally announced December 2021.

    Journal ref: AISTATS 2022

  12. arXiv:2111.06781  [pdf, ps, other

    cs.LG eess.SY

    Q-Learning for MDPs with General Spaces: Convergence and Near Optimality via Quantization under Weak Continuity

    Authors: Ali Devran Kara, Naci Saldi, Serdar Yüksel

    Abstract: Reinforcement learning algorithms often require finiteness of state and action spaces in Markov decision processes (MDPs) (also called controlled Markov chains) and various efforts have been made in the literature towards the applicability of such algorithms for continuous state and action spaces. In this paper, we show that under very mild regularity conditions (in particular, involving only weak… ▽ More

    Submitted 7 September, 2023; v1 submitted 12 November, 2021; originally announced November 2021.

  13. arXiv:2107.13923  [pdf, ps, other

    cs.DB

    Machine Learning over Static and Dynamic Relational Data

    Authors: Ahmet Kara, Milos Nikolic, Dan Olteanu, Haozhe Zhang

    Abstract: This tutorial overviews principles behind recent works on training and maintaining machine learning models over relational data, with an emphasis on the exploitation of the relational data structure to improve the runtime performance of the learning task. The tutorial has the following parts: 1) Database research for data science 2) Three main ideas to achieve performance improvements 2.1)… ▽ More

    Submitted 29 July, 2021; originally announced July 2021.

    Comments: arXiv admin note: text overlap with arXiv:2008.07864

  14. arXiv:2103.12158  [pdf, other

    cs.LG eess.SY

    Convergence of Finite Memory Q-Learning for POMDPs and Near Optimality of Learned Policies under Filter Stability

    Authors: Ali Devran Kara, Serdar Yuksel

    Abstract: In this paper, for POMDPs, we provide the convergence of a Q learning algorithm for control policies using a finite history of past observations and control actions, and, consequentially, we establish near optimality of such limit Q functions under explicit filter stability conditions. We present explicit error bounds relating the approximation error to the length of the finite history window. We… ▽ More

    Submitted 25 October, 2022; v1 submitted 22 March, 2021; originally announced March 2021.

  15. arXiv:2010.07452  [pdf, other

    math.OC cs.LG

    Near Optimality of Finite Memory Feedback Policies in Partially Observed Markov Decision Processes

    Authors: Ali Devran Kara, Serdar Yuksel

    Abstract: In the theory of Partially Observed Markov Decision Processes (POMDPs), existence of optimal policies have in general been established via converting the original partially observed stochastic control problem to a fully observed one on the belief space, leading to a belief-MDP. However, computing an optimal policy for this fully observed model, and so for the original POMDP, using classical dynami… ▽ More

    Submitted 8 January, 2022; v1 submitted 14 October, 2020; originally announced October 2020.

  16. arXiv:2006.00694  [pdf, other

    cs.DB

    F-IVM: Learning over Fast-Evolving Relational Data

    Authors: Milos Nikolic, Haozhe Zhang, Ahmet Kara, Dan Olteanu

    Abstract: F-IVM is a system for real-time analytics such as machine learning applications over training datasets defined by queries over fast-evolving relational databases. We will demonstrate F-IVM for three such applications: model selection, Chow-Liu trees, and ridge linear regression.

    Submitted 31 May, 2020; originally announced June 2020.

    Comments: SIGMOD DEMO 2020, 5 pages

  17. arXiv:2004.03716  [pdf, ps, other

    cs.DB

    Maintaining Triangle Queries under Updates

    Authors: Ahmet Kara, Milos Nikolic, Hung Q. Ngo, Dan Olteanu, Haozhe Zhang

    Abstract: We consider the problem of incrementally maintaining the triangle queries with arbitrary free variables under single-tuple updates to the input relations. We introduce an approach called IVM$^ε$ that exhibits a trade-off between the update time, the space, and the delay for the enumeration of the query result, such that the update time ranges from the square root to linear in the database size whi… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

    Comments: 47 pages, 18 figures

    ACM Class: H.2.4

  18. Trade-offs in Static and Dynamic Evaluation of Hierarchical Queries

    Authors: Ahmet Kara, Milos Nikolic, Dan Olteanu, Haozhe Zhang

    Abstract: We investigate trade-offs in static and dynamic evaluation of hierarchical queries with arbitrary free variables. In the static setting, the trade-off is between the time to partially compute the query result and the delay needed to enumerate its tuples. In the dynamic setting, we additionally consider the time needed to update the query result under single-tuple inserts or deletes to the database… ▽ More

    Submitted 8 August, 2023; v1 submitted 3 July, 2019; originally announced July 2019.

    Journal ref: Logical Methods in Computer Science, Volume 19, Issue 3 (August 9, 2023) lmcs:10035

  19. arXiv:1902.00585  [pdf, ps, other

    cs.DB

    Incremental Techniques for Large-Scale Dynamic Query Processing

    Authors: Iman Elghandour, Ahmet Kara, Dan Olteanu, Stijn Vansummeren

    Abstract: Many applications from various disciplines are now required to analyze fast evolving big data in real time. Various approaches for incremental processing of queries have been proposed over the years. Traditional approaches rely on updating the results of a query when updates are streamed rather than re-computing these queries, and therefore, higher execution performance is expected. However, they… ▽ More

    Submitted 1 February, 2019; originally announced February 2019.

  20. arXiv:1804.02780  [pdf, ps, other

    cs.DB

    Counting Triangles under Updates in Worst-Case Optimal Time

    Authors: Ahmet Kara, Hung Q. Ngo, Milos Nikolic, Dan Olteanu, Haozhe Zhang

    Abstract: We consider the problem of incrementally maintaining the triangle count query under single-tuple updates to the input relations. We introduce an approach that exhibits a space-time tradeoff such that the space-time product is quadratic in the size of the input database and the update time can be as low as the square root of this size. This lowest update time is worst-case optimal conditioned on th… ▽ More

    Submitted 25 March, 2019; v1 submitted 8 April, 2018; originally announced April 2018.

    Comments: simplified notation; incremental maintenance of full triangle query, 4-path count query, count queries with three relations added; improved the space complexity of the dynamic algorithm maintaining the triangle count query

    ACM Class: H.2.4

  21. arXiv:1709.01600  [pdf, ps, other

    cs.DB

    Covers of Query Results

    Authors: Ahmet Kara, Dan Olteanu

    Abstract: We introduce succinct lossless representations of query results called covers. They are subsets of the query results that correspond to minimal edge covers in the hypergraphs of these results. We first study covers whose structures are given by fractional hypertree decompositions of join queries. For any decomposition of a query, we give asymptotically tight size bounds for the covers of the que… ▽ More

    Submitted 10 January, 2018; v1 submitted 5 September, 2017; originally announced September 2017.

    Comments: 33 pages. Notation simplified

    MSC Class: 68P15 ACM Class: H.2.1

  22. arXiv:1209.3944  [pdf

    cs.DB

    Cyclic Association Rules Mining under Constraints

    Authors: Wafa Tebourski Wahiba Ben Abdessalem Karaa

    Abstract: Several researchers have explored the temporal aspect of association rules mining. In this paper, we focus on the cyclic association rules, in order to discover correlations among items characterized by regular cyclic variation overtime. The overview of the state of the art has revealed the drawbacks of proposed algorithm literatures, namely the excessive number of generated rules which are not me… ▽ More

    Submitted 18 September, 2012; originally announced September 2012.

    Comments: 8

  23. arXiv:1110.1221  [pdf, ps, other

    cs.FL cs.LO

    Feasible Automata for Two-Variable Logic with Successor on Data Words

    Authors: Ahmet Kara, Thomas Schwentick, Tony Tan

    Abstract: We introduce an automata model for data words, that is words that carry at each position a symbol from a finite alphabet and a value from an unbounded data domain. The model is (semantically) a restriction of data automata, introduced by Bojanczyk, et. al. in 2006, therefore it is called weak data automata. It is strictly less expressive than data automata and the expressive power is incomparable… ▽ More

    Submitted 6 October, 2011; originally announced October 2011.

    Comments: 21 pages

  24. arXiv:1107.3297  [pdf

    cs.SE

    Semantic annotation of requirements for automatic UML class diagram generation

    Authors: Soumaya Amdouni, Wahiba Ben Abdessalem Karaa, Sondes Bouabid

    Abstract: The increasing complexity of software engineering requires effective methods and tools to support requirements analysts' activities. While much of a company's knowledge can be found in text repositories, current content management systems have limited capabilities for structuring and interpreting documents. In this context, we propose a tool for transforming text documents describing users' requir… ▽ More

    Submitted 17 July, 2011; originally announced July 2011.

  25. Named Entity Recognition Using Web Document Corpus

    Authors: Wahiba Ben Abdessalem Karaa

    Abstract: This paper introduces a named entity recognition approach in textual corpus. This Named Entity (NE) can be a named: location, person, organization, date, time, etc., characterized by instances. A NE is found in texts accompanied by contexts: words that are left or right of the NE. The work mainly aims at identifying contexts inducing the NE's nature. As such, The occurrence of the word "President"… ▽ More

    Submitted 28 February, 2011; originally announced February 2011.

    Comments: 11 pages 4 figures, 2 tables

    MSC Class: 68T50; 68T05 ACM Class: H.3.3; I.2.7; I.2.6

    Journal ref: International Journal of Managing Information Technology (IJMIT) Vol.3, No.1, February 2011

  26. arXiv:1012.5439  [pdf, ps, other

    cs.LO

    Extending Büchi Automata with Constraints on Data Values

    Authors: Ahmet Kara, Tony Tan

    Abstract: Recently data trees and data words have received considerable amount of attention in connection with XML reasoning and system verification. These are trees or words that, in addition to labels from a finite alphabet, carry data values from an infinite alphabet (data). In general it is rather hard to obtain logics for data words and trees that are sufficiently expressive, but still have reasonable… ▽ More

    Submitted 23 June, 2012; v1 submitted 24 December, 2010; originally announced December 2010.

  27. arXiv:1010.1139  [pdf, ps, other

    cs.LO

    Temporal Logics on Words with Multiple Data Values

    Authors: Ahmet Kara, Thomas Schwentick, Thomas Zeume

    Abstract: The paper proposes and studies temporal logics for attributed words, that is, data words with a (finite) set of (attribute,value)-pairs at each position. It considers a basic logic which is a semantical fragment of the logic $LTL^\downarrow_1$ of Demri and Lazic with operators for navigation into the future and the past. By reduction to the emptiness problem for data automata it is shown that this… ▽ More

    Submitted 6 October, 2010; originally announced October 2010.

  28. On the Hybrid Extension of CTL and CTL+

    Authors: Ahmet Kara, Martin Lange, Thomas Schwentick, Volker Weber

    Abstract: The paper studies the expressivity, relative succinctness and complexity of satisfiability for hybrid extensions of the branching-time logics CTL and CTL+ by variables. Previous complexity results show that only fragments with one variable do have elementary complexity. It is shown that H1CTL+ and H1CTL, the hybrid extensions with one variable of CTL+ and CTL, respectively, are expressively equi… ▽ More

    Submitted 14 June, 2009; originally announced June 2009.