Skip to main content

Showing 1–6 of 6 results for author: Hallac, İ R

.
  1. arXiv:1802.03821  [pdf

    cs.DC cs.CL

    Distributed Readability Analysis Of Turkish Elementary School Textbooks

    Authors: Betul Karakus, Ibrahim Riza Hallac, Galip Aydin

    Abstract: The readability assessment deals with estimating the level of difficulty in reading texts.Many readability tests, which do not indicate execution efficiency, have been applied on specific texts to measure the reading grade level in science textbooks. In this paper, we analyze the content covered in elementary school Turkish textbooks by employing a distributed parallel processing framework based o… ▽ More

    Submitted 11 February, 2018; originally announced February 2018.

    Comments: Proceedings of International Conference on Information Technology and Computer Science July 11-12, 2015, ISBN:9788193137307

  2. arXiv:1802.03606  [pdf

    cs.DC cs.CL

    Distributed NLP

    Authors: Galip Aydin, Ibrahim Riza Hallac

    Abstract: In this paper we present the performance of parallel text processing with Map Reduce on a cloud platform. Scientific papers in Turkish language are processed using Zemberek NLP library. Experiments were run on a Hadoop cluster and compared with the single machines performance.

    Submitted 10 February, 2018; originally announced February 2018.

    Comments: Presented at Third International Symposium on Innovative Technologies in Engineering and Science 3-5 June, 2015, Valencia, Spain

  3. Running genetic algorithms on Hadoop for solving high dimensional optimization problems

    Authors: Güngör Yildirim, İbrahim R Hallac, Galip Aydin, Yetkin Tatar

    Abstract: Hadoop is a popular MapReduce framework for develo** parallel applications in distributed environments. Several advantages of MapReduce such as programming ease and ability to use commodity hardware make the applicability of soft computing methods for parallel and distributed systems easier than before. In this paper, we present the results of an experimental study on running soft computing algo… ▽ More

    Submitted 10 February, 2018; originally announced February 2018.

  4. Document Classification Using Distributed Machine Learning

    Authors: Galip Aydin, Ibrahim Riza Hallac

    Abstract: In this paper, we investigate the performance and success rates of Naïve Bayes Classification Algorithm for automatic classification of Turkish news into predetermined categories like economy, life, health etc. We use Apache Big Data technologies such as Hadoop, HDFS, Spark and Mahout, and apply these distributed technologies to Machine Learning.

    Submitted 10 February, 2018; originally announced February 2018.

  5. Distributed Log Analysis on the Cloud Using MapReduce

    Authors: Galip Aydin, Ibrahim Riza Hallac

    Abstract: In this paper we describe our work on designing a web based, distributed data analysis system based on the popular MapReduce framework deployed on a small cloud; developed specifically for analyzing web server logs. The log analysis system consists of several cluster nodes, it splits the large log files on a distributed file system and quickly processes them using MapReduce programming model. The… ▽ More

    Submitted 10 February, 2018; originally announced February 2018.

  6. Preparation of Improved Turkish DataSet for Sentiment Analysis in Social Media

    Authors: Semiha Makinist, Ibrahim Riza Hallac, Betul Ay Karakus, Galip Aydin

    Abstract: A public dataset, with a variety of properties suitable for sentiment analysis [1], event prediction, trend detection and other text mining applications, is needed in order to be able to successfully perform analysis studies. The vast majority of data on social media is text-based and it is not possible to directly apply machine learning processes into these raw data, since several different proce… ▽ More

    Submitted 31 January, 2018; v1 submitted 30 January, 2018; originally announced January 2018.

    Comments: Presented at CMES2017