Skip to main content

Showing 1–11 of 11 results for author: Oikarinen, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:1910.04069  [pdf, other

    stat.ML cs.LG

    Estimating regression errors without ground truth values

    Authors: Henri Tiittanen, Emilia Oikarinen, Andreas Henelius, Kai Puolamäki

    Abstract: Regression analysis is a standard supervised machine learning method used to model an outcome variable in terms of a set of predictor variables. In most real-world applications we do not know the true value of the outcome variable being predicted outside the training data, i.e., the ground truth is unknown. It is hence not straightforward to directly observe when the estimate from a model potentia… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

    Comments: 33 pages, 9 figures, 2 tables

  2. arXiv:1905.02515  [pdf, other

    stat.ML cs.LG

    Guided Visual Exploration of Relations in Data Sets

    Authors: Kai Puolamäki, Emilia Oikarinen, Andreas Henelius

    Abstract: Efficient explorative data analysis systems must take into account both what a user knows and wants to know. This paper proposes a principled framework for interactive visual exploration of relations in data, through views most informative given the user's current knowledge and objectives. The user can input pre-existing knowledge of relations in the data and also formulate specific exploration in… ▽ More

    Submitted 1 July, 2021; v1 submitted 7 May, 2019; originally announced May 2019.

    Comments: 32 pages, 13 figures. This article extends arXiv:1804.03194 and arXiv:1805.07725

    Journal ref: Journal of Machine Learning Research 22(96):1-32, 2021

  3. arXiv:1805.07725  [pdf, other

    stat.ML cs.LG

    Human-guided data exploration using randomisation

    Authors: Kai Puolamäki, Emilia Oikarinen, Buse Atli, Andreas Henelius

    Abstract: An explorative data analysis system should be aware of what the user already knows and what the user wants to know of the data: otherwise the system cannot provide the user with the most informative and useful views of the data. We propose a principled way to do exploratory data analysis, where the user's background knowledge is modeled by a distribution parametrised by subsets of rows and columns… ▽ More

    Submitted 30 December, 2018; v1 submitted 20 May, 2018; originally announced May 2018.

    Comments: 14 pages, 8 figures

  4. arXiv:1804.03194  [pdf, other

    stat.ML cs.HC cs.LG

    Human-Guided Data Exploration

    Authors: Andreas Henelius, Emilia Oikarinen, Kai Puolamäki

    Abstract: The outcome of the explorative data analysis (EDA) phase is vital for successful data analysis. EDA is more effective when the user interacts with the system used to carry out the exploration. In the recently proposed paradigm of iterative data mining the user controls the exploration by inputting knowledge in the form of patterns observed during the process. The system then shows the user views o… ▽ More

    Submitted 9 April, 2018; originally announced April 2018.

  5. arXiv:1710.08167  [pdf, other

    stat.ML cs.IT cs.LG

    Interactive Visual Data Exploration with Subjective Feedback: An Information-Theoretic Approach

    Authors: Kai Puolamäki, Emilia Oikarinen, Bo Kang, Jefrey Lijffijt, Tijl De Bie

    Abstract: Visual exploration of high-dimensional real-valued datasets is a fundamental task in exploratory data analysis (EDA). Existing methods use predefined criteria to choose the representation of data. There is a lack of methods that (i) elicit from the user what she has learned from the data and (ii) show patterns that she does not know yet. We construct a theoretical model where identified patterns c… ▽ More

    Submitted 23 October, 2017; originally announced October 2017.

    Comments: 12 pages, 9 figures, 2 tables, conference submission

    Journal ref: Data Mining and Knowledge Discovery 34 (2020) 21-49

  6. Subjectively Interesting Subgroup Discovery on Real-valued Targets

    Authors: Jefrey Lijffijt, Bo Kang, Wouter Duivesteijn, Kai Puolamäki, Emilia Oikarinen, Tijl De Bie

    Abstract: Deriving insights from high-dimensional data is one of the core problems in data mining. The difficulty mainly stems from the fact that there are exponentially many variable combinations to potentially consider, and there are infinitely many if we consider weighted combinations, even for linear combinations. Hence, an obvious question is whether we can automate the search for interesting patterns… ▽ More

    Submitted 12 October, 2017; originally announced October 2017.

    Comments: 12 pages, 10 figures, 2 tables, conference submission

  7. Optimizing Phylogenetic Supertrees Using Answer Set Programming

    Authors: Laura Koponen, Emilia Oikarinen, Tomi Janhunen, Laura Säilä

    Abstract: The supertree construction problem is about combining several phylogenetic trees with possibly conflicting information into a single tree that has all the leaves of the source trees as its leaves and the relationships between the leaves are as consistent with the source trees as possible. This leads to an optimization problem that is computationally challenging and typically heuristic methods, suc… ▽ More

    Submitted 19 July, 2015; originally announced July 2015.

    Comments: To appear in Theory and Practice of Logic Programming (TPLP), Proceedings of ICLP 2015

    MSC Class: 68T30

    Journal ref: Theory and Practice of Logic Programming 15 (2015) 604-619

  8. arXiv:1401.3484  [pdf

    cs.LO cs.AI

    Modularity Aspects of Disjunctive Stable Models

    Authors: Tomi Janhunen, Emilia Oikarinen, Hans Tompits, Stefan Woltran

    Abstract: Practically all programming languages allow the programmer to split a program into several modules which brings along several advantages in software development. In this paper, we are interested in the area of answer-set programming where fully declarative and nonmonotonic languages are applied. In this context, obtaining a modular structure for programs is by no means straightforward since the ou… ▽ More

    Submitted 15 January, 2014; originally announced January 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 35, pages 813-857, 2009

  9. arXiv:0809.4582  [pdf, ps, other

    cs.AI

    Achieving compositionality of the stable model semantics for Smodels programs

    Authors: Emilia Oikarinen, Tomi Janhunen

    Abstract: In this paper, a Gaifman-Shapiro-style module architecture is tailored to the case of Smodels programs under the stable model semantics. The composition of Smodels program modules is suitably limited by module conditions which ensure the compatibility of the module system with stable models. Hence the semantics of an entire Smodels program depends directly on stable models assigned to its module… ▽ More

    Submitted 26 September, 2008; originally announced September 2008.

    Comments: 44 pages, 2 tables

  10. Extended ASP tableaux and rule redundancy in normal logic programs

    Authors: Matti Järvisalo, Emilia Oikarinen

    Abstract: We introduce an extended tableau calculus for answer set programming (ASP). The proof system is based on the ASP tableaux defined in [Gebser&Schaub, ICLP 2006], with an added extension rule. We investigate the power of Extended ASP Tableaux both theoretically and empirically. We study the relationship of Extended ASP Tableaux with the Extended Resolution proof system defined by Tseitin for sets… ▽ More

    Submitted 18 September, 2008; originally announced September 2008.

    Comments: 27 pages, 5 figures, 1 table

    Journal ref: Theory and Practice of Logic Programming, 8(5-6):691-716, 2008

  11. arXiv:cs/0608099  [pdf, ps, other

    cs.AI cs.LO

    Automated verification of weak equivalence within the SMODELS system

    Authors: Tomi Janhunen, Emilia Oikarinen

    Abstract: In answer set programming (ASP), a problem at hand is solved by (i) writing a logic program whose answer sets correspond to the solutions of the problem, and by (ii) computing the answer sets of the program using an answer set solver as a search engine. Typically, a programmer creates a series of gradually improving logic programs for a particular problem when optimizing program length and execu… ▽ More

    Submitted 25 August, 2006; originally announced August 2006.

    Comments: 48 pages, 7 figures, 2 tables

    ACM Class: I.2.4; F.4.1; F.2.2