Search | arXiv e-print repository

Optimal Nonlinearities Improve Generalization Performance of Random Features

Abstract: Random feature model with a nonlinear activation function has been shown to perform asymptotically equivalent to a Gaussian model in terms of training and generalization errors. Analysis of the equivalent model reveals an important yet not fully understood role played by the activation function. To address this issue, we study the "parameters" of the equivalent model to achieve improved generaliza… ▽ More Random feature model with a nonlinear activation function has been shown to perform asymptotically equivalent to a Gaussian model in terms of training and generalization errors. Analysis of the equivalent model reveals an important yet not fully understood role played by the activation function. To address this issue, we study the "parameters" of the equivalent model to achieve improved generalization performance for a given supervised learning problem. We show that acquired parameters from the Gaussian model enable us to define a set of optimal nonlinearities. We provide two example classes from this set, e.g., second-order polynomial and piecewise linear functions. These functions are optimized to improve generalization performance regardless of the actual form. We experiment with regression and classification problems, including synthetic and real (e.g., CIFAR10) data. Our numerical results validate that the optimized nonlinearities achieve better generalization performance than widely-used nonlinear functions such as ReLU. Furthermore, we illustrate that the proposed nonlinearities also mitigate the so-called double descent phenomenon, which is known as the non-monotonic generalization performance regarding the sample size and the model size. △ Less

Submitted 28 September, 2023; originally announced September 2023.

Comments: ACML 2023

arXiv:2209.04033 [pdf, other]

Banach space valued $H^p$ spaces with $A_p$ weight

Authors: Sakin Demir

Abstract: In this research we introduce the Banach space valued $H^p$ spaces with $A_p$ weight, and prove the following results: Let $\mathbb{A}$ and $\mathbb{B}$ Banach spaces, and $T$ be a convolution operator map** $\mathbb{A}$-valued functions into $\mathbb{B}$-valued functions, i.e., $$Tf(x)=\int_{\mathbb{R}^n}K(x-y)\cdot f(y)\, dy,$$ where $K$ is a strongly measurable function defined on… ▽ More In this research we introduce the Banach space valued $H^p$ spaces with $A_p$ weight, and prove the following results: Let $\mathbb{A}$ and $\mathbb{B}$ Banach spaces, and $T$ be a convolution operator map** $\mathbb{A}$-valued functions into $\mathbb{B}$-valued functions, i.e., $$Tf(x)=\int_{\mathbb{R}^n}K(x-y)\cdot f(y)\, dy,$$ where $K$ is a strongly measurable function defined on $\mathbb{R}^n$ such that $\|K(x)\|_{\mathbb{B}}$ is locally integrable away from the origin. Suppose that $w$ is a positive weight function defined on $\mathbb{R}^n$, and that i) For some $q\in [1, \infty ]$, there exists a positive constant $C_1$ such that $$\int_{\mathbb{R}^n}\|Tf(x)\|^q_{\mathbb{B}}w(x)\, dx\leq C_1\int_{\mathbb{R}^n}\|f(x)\|_{\mathbb{A}}^q w(x)\,dx$$ for all $f\in L^q_{\mathbb{A}}(\mathbb{R}^n)$. ii) There exists a positive constant $C_2$ independent of $y\in\mathbb{R}^n$ such that $$\int_{|x|>2|y|}\|K(x-y)-K(x)\|_{\mathbb{B}}\, dx<C_2.$$ Then there exists a positive constant $C_3$ such that $$\|Tf\|_{L^1_{\mathbb{B}}(w)}\leq C_3\|f\|_{H^1_{\mathbb{A}}(w)}$$ for all $f\in H^1_{\mathbb{A}}(w)$. Let $w\in A_1$. Assume that $K\in L_{\rm{loc}}(\mathbb{R}^n\backslash \{0\})$ satisfies $$\|K\ast f\|_{L^2_{\mathbb{B}}(w)}\leq C_1\|f\|_{L^2_{\mathbb{A}}(w)}$$ and $$\int_{|x|\geq C_2|y|}\|K(x-y)-K(x)\|_{\mathbb{B}}w(x+h)\, dx\leq C_3w(y+h)\;\;\;(\forall y\neq 0, \forall h\in\mathbb{R}^n) $$ for certain absolute constants $C_1$, $C_2$, and $C_3$. Then there exists a positive constant $C$ independent of $f$ such that $$\|K\ast f\|_{L^1_{\mathbb{B}}(w)}\leq C\|f\|_{H^1_{\mathbb{A}}(w)}$$ for all $f\in H^1_{\mathbb{A}}(w)$. △ Less

Submitted 5 January, 2023; v1 submitted 2 August, 2022; originally announced September 2022.

MSC Class: 42B30; 42B20

arXiv:2203.13905 [pdf, other]

Variaiton and $λ$-jump inequalities on $H^p$ spaces

Authors: Sakin Demir

Abstract: Let $φ\in \mathscr{S}$ with $\intφ(x)\, dx=1$, and define $$φ_t(x)=\frac{1}{t^n}φ(\frac{x}{t}),$$ and denote the function family $\{φ_t\ast f(x)\}_{t>0}$ by $Φ\ast f(x)$. Suppose that there exists a constant $C_1$ such that $$\sum_{t>0} |\hatφ_t(x)|^2<C_1$$ for all $x\in \mathbb{R}^n$. Then (i) There exists a constant $C_2>0$ such that… ▽ More Let $φ\in \mathscr{S}$ with $\intφ(x)\, dx=1$, and define $$φ_t(x)=\frac{1}{t^n}φ(\frac{x}{t}),$$ and denote the function family $\{φ_t\ast f(x)\}_{t>0}$ by $Φ\ast f(x)$. Suppose that there exists a constant $C_1$ such that $$\sum_{t>0} |\hatφ_t(x)|^2<C_1$$ for all $x\in \mathbb{R}^n$. Then (i) There exists a constant $C_2>0$ such that $$\|\mathscr{V}_2(Φ\ast f)\|_{L^p}\leq C_2\|f\|_{H^p},\;\;\frac{n}{n+1}<p\leq 1$$ for all $f\in H^p(\mathbb{R}^n)$, $\frac{n}{n+1}<p\leq 1$. (ii) The $λ$-jump operator $N_λ(Φ\ast f)$ satisfies $$\|λ[N_λ(Φ\ast f)]^{1/2}\|_{L^p}\leq C_3\|f\|_{H^p},\;\;\frac{n}{n+1}<p\leq 1,$$ uniformly in $λ>0$ for some constant $C_3>0$. △ Less

Submitted 6 September, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

MSC Class: 42B25; 42B30

arXiv:2203.02154 [pdf, other]

Variational Inequalities For The Differences Of Averages Over Lacunary Sequences

Authors: Sakin Demir

Abstract: Let $f$ be a locally integrable function defined on $\mathbb{R}$, and let $(n_k)$ be a lacunary sequence. Define the operator $A_{n_k}$ by $$A_{n_k}f(x)=\frac{1}{n_k}\int_0^{n_k}f(x-t)\, dt.$$ We prove various types of new inequalities for the variation operator $$\mathcal{V}_sf(x)=\left(\sum_{k=1}^\infty|A_{n_k}f(x)-A_{n_{k-1}}f(x)|^s\right)^{1/s}$$ when $2\leq s<\infty$. Let $f$ be a locally integrable function defined on $\mathbb{R}$, and let $(n_k)$ be a lacunary sequence. Define the operator $A_{n_k}$ by $$A_{n_k}f(x)=\frac{1}{n_k}\int_0^{n_k}f(x-t)\, dt.$$ We prove various types of new inequalities for the variation operator $$\mathcal{V}_sf(x)=\left(\sum_{k=1}^\infty|A_{n_k}f(x)-A_{n_{k-1}}f(x)|^s\right)^{1/s}$$ when $2\leq s<\infty$. △ Less

Submitted 25 May, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

MSC Class: 26D07; 26D15; 42B20

Journal ref: New York Journal of Mathematics, Vol. 28, 2022, pp. 1099-1111

arXiv:2107.14030 [pdf, other]

Variation and oscillation inequalities for operator averages on a complex Hilbert space

Authors: Sakin Demir

Abstract: Let $\mathcal{H}$ be a complex Hilbert space and $T:\mathcal{H}\to \mathcal{H}$ be a contraction. Let $$A_nf=\frac{1}{n}\sum_{j=1}^nT^jf$$ for $f\in \mathcal{H}$. Let $(n_k)$ be a sequence satisfying $β\geq n_{k+1}/n_k\geq α>1$ for all $k\geq 1$, then there exists a constant $C_1>0$ such that $$\sum_{k=1}^\infty\|A_{n_{k+1}}f-A_{n_k}f\|_{\mathcal{H}}\leq C_1\|f\|_{\mathcal{H}}$$ for all… ▽ More Let $\mathcal{H}$ be a complex Hilbert space and $T:\mathcal{H}\to \mathcal{H}$ be a contraction. Let $$A_nf=\frac{1}{n}\sum_{j=1}^nT^jf$$ for $f\in \mathcal{H}$. Let $(n_k)$ be a sequence satisfying $β\geq n_{k+1}/n_k\geq α>1$ for all $k\geq 1$, then there exists a constant $C_1>0$ such that $$\sum_{k=1}^\infty\|A_{n_{k+1}}f-A_{n_k}f\|_{\mathcal{H}}\leq C_1\|f\|_{\mathcal{H}}$$ for all $f\in \mathcal{H}$. Let $(n_k)$ be a sequence satisfying $β\geq n_{k+1}/n_k\geq α>1$ for all $k\geq 1$, and let $M$ be any sequence. Then there exists a constant $C_2>0$ such that $$\sum_{k=1}^\infty\sup_{\substack{n_k\leq m< n_{k+1}\\m\in M}}\|A_m(T)f-A_{n_k}(T)f\|_{H}\leq C_2\|f\|_{\mathcal{H}}$$ for all $f\in \mathcal{H}$. △ Less

Submitted 8 January, 2024; v1 submitted 29 July, 2021; originally announced July 2021.

MSC Class: 47B02; 37A30

arXiv:2101.03910 [pdf, other]

Unconditional convergence of the differences of Fejér kernels on $L^2(\mathbb{R})$

Authors: Sakin Demir

Abstract: Let $K_n(x)$ denote the Fejér kernel given by $$K_n(x)=\sum_{j=-n}^n\left(1-\frac{|j|}{n+1}\right)e^{-ijx}$$ and let $σ_nf(x)=(K_n\ast f)(x)$, where as usual $f\ast g$ denotes the convolution of $f$ and $g$. Let the sequence $\{n_k\}$ be lacunary. Then the series $$\mathcal{G}f(x)=\sum_{k=1}^\infty \left(σ_{n_{k+1}}f(x)-σ_{n_k}f(x)\right)$$ converges unconditionally for all $f\in L^2(\mathbb{R})$.… ▽ More Let $K_n(x)$ denote the Fejér kernel given by $$K_n(x)=\sum_{j=-n}^n\left(1-\frac{|j|}{n+1}\right)e^{-ijx}$$ and let $σ_nf(x)=(K_n\ast f)(x)$, where as usual $f\ast g$ denotes the convolution of $f$ and $g$. Let the sequence $\{n_k\}$ be lacunary. Then the series $$\mathcal{G}f(x)=\sum_{k=1}^\infty \left(σ_{n_{k+1}}f(x)-σ_{n_k}f(x)\right)$$ converges unconditionally for all $f\in L^2(\mathbb{R})$. Let $(n_k)$ be a lacunary sequence, and $\{c_k\}_{k=1}^\infty \in \ell^\infty$. Define $$\mathcal{R}f(x)=\sum_{k=1}^\infty c_k\left(σ_{n_{k+1}}f(x)-σ_{n_k}f(x)\right).$$ Then there exists a constant $C>0$ such that $$\|\mathcal{R}f\|_2\leq C\|f\|_2$$ for all $f\in L^2(\mathbb{R})$, i.e., $\mathcal{R}f$ is of strong type $(2,2)$. As a special case it follows that $\mathcal{G}f$ also is of strong type $(2,2)$. △ Less

Submitted 8 June, 2022; v1 submitted 5 January, 2021; originally announced January 2021.

Comments: This is a revised version of the present paper

MSC Class: 42A24; 26D05

arXiv:2009.05822 [pdf, other]

Complete convergence of the Hilbert transform

Authors: Sakin Demir

Abstract: Suppose that $\{a_j\}\in \ell^1$, and suppose that for any sequence $(t_n)$ of integers there exits a constant $C_1>0$ such that $$\sharp\left\{k\in\mathbb{Z}:\sup_{n\geq 1}\left|\sum_{i\in \mathcal{B}_n-t_n} \!\!\!\raise{1.9ex}\hbox{$\scriptsize\prime… ▽ More Suppose that $\{a_j\}\in \ell^1$, and suppose that for any sequence $(t_n)$ of integers there exits a constant $C_1>0$ such that $$\sharp\left\{k\in\mathbb{Z}:\sup_{n\geq 1}\left|\sum_{i\in \mathcal{B}_n-t_n} \!\!\!\raise{1.9ex}\hbox{$\scriptsize\prime$}\; \frac{a_{k+i}}{i}\right|>λ\right\}\\ \leq C_1\sharp\left\{k\in\mathbb{Z}:\sup_{n\geq 1}\left|\sum_{i\in \mathcal{B}_n} \!\!\raise{1.9ex}\hbox{$\scriptsize\prime$}\; \frac{a_{k+i}}{i}\right|>λ\right\},$$ for all $λ>0$, where $\mathcal{B}_n=\{-n, -(n-1), -(n-2),\dots , n-2, n-1, n\}$. Then there is a constant $C_2>0$ which does not depend on the sequence $\{a_j\}$ such that $$\sum_{n=1}^\infty\sharp\left\{k\in\mathbb{Z}:\left|\sum_{i=-n}^{n} \!\!\raise{1.9ex}\hbox{$\scriptsize\prime$}\; \frac{a_{k+i}}{i}\right|>λ\right\}\leq\frac{C_2}λ\sum_{i=-\infty}^{\infty}|a_i|$$ for all $λ>0$. Let $(X,\mathscr{B},μ)$ be a measure space, $τ:X\to X$ an invertible measure-preserving transformation, and suppose that $f\in L^1(X)$ such that for any sequence $(t_n)$ of integers there exists a constant $C_1>0$ such that $$μ\left\{ x: \sup_{n\geq 1}\left|\sum_{i\in \mathcal{B}_n-t_n}\!\!\!\raise{1.9ex}\hbox{$\scriptsize\prime$}\; \frac{f(τ^ix)}{i}\right| >λ\right\}\leq C_1μ\left\{x: \sup_{n\geq 1}\left|\sum_{i\in \mathcal{B}_n}\!\!\raise{1.9ex}\hbox{$\scriptsize\prime$}\; \frac{f(τ^i x)}{i}\right|>λ\right\} $$ for all $λ>0$, where $\mathcal{B}_n=\{-n, -(n-1), -(n-2),\dots , n-2, n-1, n\}$. Then there exists a constant $C_2>0$ which does not depend on $f$ such that $$\sum_{n=1}^\inftyμ\left\{x:\left|\sum_{i=-n}^{n}\!\!\raise{1.9ex}\hbox{$\scriptsize\prime$} \;\frac{f(τ^ix)}{i}\right|>λ\right\}\leq\frac{C_2}λ\|f\|_1$$ for all $λ>0$. △ Less

Submitted 3 August, 2022; v1 submitted 12 September, 2020; originally announced September 2020.

MSC Class: 26D07; 47A35

arXiv:2006.13216 [pdf, other]

Oscillation inequalities on real and ergodic $H^1$ spaces

Authors: Sakin Demir

Abstract: Let $(x_n)$ be a sequence and $ρ\geq 1$. For a fixed sequences $n_1<n_2<n_3<\dots$, and $M$ define the oscillation operators $$\mathcal{O}_ρ(x_n)=\left(\sum_{k=1}^\infty\sup_{\substack{n_k\leq m< n_{k+1}\\m\in M}}\left|x_m-x_{n_k}\right|^ρ\right)^{1/ρ}.$$ Let $(X,\mathscr{B} ,μ, τ)$ be a dynamical system with $(X,\mathscr{B} ,μ)$ a probability space and $τ$ a measurable, invertible, measure preser… ▽ More Let $(x_n)$ be a sequence and $ρ\geq 1$. For a fixed sequences $n_1<n_2<n_3<\dots$, and $M$ define the oscillation operators $$\mathcal{O}_ρ(x_n)=\left(\sum_{k=1}^\infty\sup_{\substack{n_k\leq m< n_{k+1}\\m\in M}}\left|x_m-x_{n_k}\right|^ρ\right)^{1/ρ}.$$ Let $(X,\mathscr{B} ,μ, τ)$ be a dynamical system with $(X,\mathscr{B} ,μ)$ a probability space and $τ$ a measurable, invertible, measure preserving point transformation from $X$ to itself.\\ Suppose that the sequences $(n_k)$ and $M$ are lacunary. Then we prove the following results for $ρ\geq 2$: (i) Define $φ_n(x)=\frac{1}{n}χ_{[0,n]}(x)$ on $\mathbb{R}$. Then there exists a constant $C>0$ such that $\|\mathcal{O}_ρ(φ_n\ast f)\|_{L^1(\mathbb{R})}\leq C\|f\|_{H^1(\mathbb{R})}$ for all $f\in H^1(\mathbb{R})$. (ii) Let $A_nf(x)=\frac{1}{n}\sum_{k=1}^nf(τ^kx)$ be the usual ergodic averages in ergodic theory. Then $\|\mathcal{O}_ρ(A_nf)\|_{L^1(X)}\leq C\|f\|_{H^1(X)}$ for all $f\in H^1(X)$. (iii) If $[f(x)\log (x)]^+$ is integrable, then $\mathcal{O}_ρ(A_nf)$ is integrable. △ Less

Submitted 26 May, 2022; v1 submitted 23 June, 2020; originally announced June 2020.

MSC Class: 42B20; 28D05; 42B30

Journal ref: Russian Mathematics, 2023, Vol. 67, No.3, pp. 42-52

arXiv:2006.08162 [pdf, other]

doi 10.3906/elk-1807-287

Filter design for small target detection on infrared imagery using normalized-cross-correlation layer

Authors: H. Seçkin Demir, Erdem Akagunduz

Abstract: In this paper, we introduce a machine learning approach to the problem of infrared small target detection filter design. For this purpose, similarly to a convolutional layer of a neural network, the normalized-cross-correlational (NCC) layer, which we utilize for designing a target detection/recognition filter bank, is proposed. By employing the NCC layer in a neural network structure, we introduc… ▽ More In this paper, we introduce a machine learning approach to the problem of infrared small target detection filter design. For this purpose, similarly to a convolutional layer of a neural network, the normalized-cross-correlational (NCC) layer, which we utilize for designing a target detection/recognition filter bank, is proposed. By employing the NCC layer in a neural network structure, we introduce a framework, in which supervised training is used to calculate the optimal filter shape and the optimum number of filters required for a specific target detection/recognition task on infrared images. We also propose the mean-absolute-deviation NCC (MAD-NCC) layer, an efficient implementation of the proposed NCC layer, designed especially for FPGA systems, in which square root operations are avoided for real-time computation. As a case study we work on dim-target detection on mid-wave infrared imagery and obtain the filters that can discriminate a dim target from various types of background clutter, specific to our operational concept. △ Less

Submitted 15 June, 2020; originally announced June 2020.

Journal ref: published in Turkish Journal of Electrical Engineering and Computer Sciences, vol.28, 302:317, 2020

arXiv:2005.07692 [pdf, other]

An Evaluation of Recent Neural Sequence Tagging Models in Turkish Named Entity Recognition

Authors: Gizem Aras, Didem Makaroglu, Seniz Demir, Altan Cakir

Abstract: Named entity recognition (NER) is an extensively studied task that extracts and classifies named entities in a text. NER is crucial not only in downstream language processing applications such as relation extraction and question answering but also in large scale big data operations such as real-time analysis of online digital media content. Recent research efforts on Turkish, a less studied langua… ▽ More Named entity recognition (NER) is an extensively studied task that extracts and classifies named entities in a text. NER is crucial not only in downstream language processing applications such as relation extraction and question answering but also in large scale big data operations such as real-time analysis of online digital media content. Recent research efforts on Turkish, a less studied language with morphologically rich nature, have demonstrated the effectiveness of neural architectures on well-formed texts and yielded state-of-the art results by formulating the task as a sequence tagging problem. In this work, we empirically investigate the use of recent neural architectures (Bidirectional long short-term memory and Transformer-based networks) proposed for Turkish NER tagging in the same setting. Our results demonstrate that transformer-based networks which can model long-range context overcome the limitations of BiLSTM networks where different input features at the character, subword, and word levels are utilized. We also propose a transformer-based network with a conditional random field (CRF) layer that leads to the state-of-the-art result (95.95\% f-measure) on a common dataset. Our study contributes to the literature that quantifies the impact of transfer learning on processing morphologically rich languages. △ Less

Submitted 18 May, 2020; v1 submitted 14 May, 2020; originally announced May 2020.

Comments: Submitted to Expert Systems with Applications

Report number: ITUAI08

arXiv:2004.00462 [pdf, other]

An extension of Calderon Transfer Principle

Authors: Sakin Demir

Abstract: We first prove that the well known transfer principle of A. P. Calderón can be extended to the vector-valued setting and then we apply this extension to vector-valued inequalities for the Hardy-Littlewood maximal function to prove the vector-valued strong type $L^p$ norm inequalities for $1<p<\infty$ and the vector-valued weak type $(1,1)$ inequality for ergodic maximal function. We first prove that the well known transfer principle of A. P. Calderón can be extended to the vector-valued setting and then we apply this extension to vector-valued inequalities for the Hardy-Littlewood maximal function to prove the vector-valued strong type $L^p$ norm inequalities for $1<p<\infty$ and the vector-valued weak type $(1,1)$ inequality for ergodic maximal function. △ Less

Submitted 31 March, 2020; originally announced April 2020.

MSC Class: 47A35; 28D05; 47A64

Journal ref: Asian Journal of Mathematical Sciences, Vol. 4, Issue 2, 2020, pp. 15-18

arXiv:2002.07589 [pdf, other]

A transfer principle in ergodic theory on weighted spaces

Authors: Sakin Demir

Abstract: The power of Calderón transfer principle is well known when proving strong type and weak type inequalities for certain type of operators in ergodic theory. In this article we show that Calderón's argument can be extended to have a transfer principle to be able to prove weighted inequalities for those operators satisfying the conditions of Calderón transfer principle. We also include some applicati… ▽ More The power of Calderón transfer principle is well known when proving strong type and weak type inequalities for certain type of operators in ergodic theory. In this article we show that Calderón's argument can be extended to have a transfer principle to be able to prove weighted inequalities for those operators satisfying the conditions of Calderón transfer principle. We also include some applications of our results. △ Less

Submitted 24 February, 2024; v1 submitted 6 February, 2020; originally announced February 2020.

MSC Class: 47A35; 28D05; 47A64

arXiv:2002.01430 [pdf, other]

A Sufficient Condition For An Operator To Map $uL^\infty$ to ${\rm{BMO}}_u$

Authors: Sakin Demir

Abstract: Let $T$ be an operator and suppose that there exists a positive constant $C$ such that $$\left(\int_I|Tf(x)|^q\, dx\right)^{1/q}\leq C\left(\int_I|f(x)|^q\, dx\right)^{1/q}$$ for every $q$ which is near enough to $1$ and for every interval $I$ in $\mathbb{R}$ and $f\in L^{\infty}(\mathbb{R})$. Then we show that $T$ maps $uL^{\infty}$ to ${\rm{BMO}}_u$. Let $T$ be an operator and suppose that there exists a positive constant $C$ such that $$\left(\int_I|Tf(x)|^q\, dx\right)^{1/q}\leq C\left(\int_I|f(x)|^q\, dx\right)^{1/q}$$ for every $q$ which is near enough to $1$ and for every interval $I$ in $\mathbb{R}$ and $f\in L^{\infty}(\mathbb{R})$. Then we show that $T$ maps $uL^{\infty}$ to ${\rm{BMO}}_u$. △ Less

Submitted 29 January, 2022; v1 submitted 2 February, 2020; originally announced February 2020.

MSC Class: 47B38

Journal ref: International J. Functional Analysis, Operator Theory and Applications, Vol. 14, 2022, pp. 13-17

arXiv:2001.09316 [pdf, other]

Inequalities For Variation Operator

Authors: Sakin Demir

Abstract: Let $f$ be a measurable function defined on $\mathbb{R}$. For each $n\in\mathbb{Z}$ define the operator $A_n$ by $$A_nf(x)=\frac{1}{2^n}\int_x^{x+2^n}f(y)\, dy.$$ Consider the variation operator $$\mathcal{V}f(x)=\left(\sum_{n=-\infty}^\infty|A_nf(x)-A_{n-1}f(x)|^s\right)^{1/s}$$ for $2\leq s<\infty$. It has been proved in \cite{jkw1} that $\mathcal{V}$ is of strong type $(p,p)$ for $1<p<\infty$ a… ▽ More Let $f$ be a measurable function defined on $\mathbb{R}$. For each $n\in\mathbb{Z}$ define the operator $A_n$ by $$A_nf(x)=\frac{1}{2^n}\int_x^{x+2^n}f(y)\, dy.$$ Consider the variation operator $$\mathcal{V}f(x)=\left(\sum_{n=-\infty}^\infty|A_nf(x)-A_{n-1}f(x)|^s\right)^{1/s}$$ for $2\leq s<\infty$. It has been proved in \cite{jkw1} that $\mathcal{V}$ is of strong type $(p,p)$ for $1<p<\infty$ and is of weak type $(1,1)$, it maps $L^\infty$ to BMO. We first provide a completely different proofs for these known results and in addition we prove that $\mathcal{V}$ maps $H^1$ to $L^1$. Furthermore, we prove that it satisfies vector-valued weighted strong type and weak type inequalities. As a special case it follows that $\mathcal{V}$ satisfies weighted strong type and weak type inequalities. △ Less

Submitted 25 January, 2020; originally announced January 2020.

MSC Class: 26D07; 26D15; 42B20

Journal ref: Bulletin of the Hellenic Mathematical Society, Vol. 64, 2020, pp. 92.-97

arXiv:1912.01982 [pdf, other]

Neural Academic Paper Generation

Authors: Samet Demir, Uras Mutlu, Özgur Özdemir

Abstract: In this work, we tackle the problem of structured text generation, specifically academic paper generation in $\LaTeX{}$, inspired by the surprisingly good results of basic character-level language models. Our motivation is using more recent and advanced methods of language modeling on a more complex dataset of $\LaTeX{}$ source files to generate realistic academic papers. Our first contribution is… ▽ More In this work, we tackle the problem of structured text generation, specifically academic paper generation in $\LaTeX{}$, inspired by the surprisingly good results of basic character-level language models. Our motivation is using more recent and advanced methods of language modeling on a more complex dataset of $\LaTeX{}$ source files to generate realistic academic papers. Our first contribution is preparing a dataset with $\LaTeX{}$ source files on recent open-source computer vision papers. Our second contribution is experimenting with recent methods of language modeling and text generation such as Transformer and Transformer-XL to generate consistent $\LaTeX{}$ code. We report cross-entropy and bits-per-character (BPC) results of the trained models, and we also discuss interesting points on some examples of the generated $\LaTeX{}$ code. △ Less

Submitted 2 December, 2019; originally announced December 2019.

arXiv:1911.10621 [pdf, other]

DeepSmartFuzzer: Reward Guided Test Generation For Deep Learning

Authors: Samet Demir, Hasan Ferit Eniser, Alper Sen

Abstract: Testing Deep Neural Network (DNN) models has become more important than ever with the increasing usage of DNN models in safety-critical domains such as autonomous cars. The traditional approach of testing DNNs is to create a test set, which is a random subset of the dataset about the problem of interest. This kind of approach is not enough for testing most of the real-world scenarios since these t… ▽ More Testing Deep Neural Network (DNN) models has become more important than ever with the increasing usage of DNN models in safety-critical domains such as autonomous cars. The traditional approach of testing DNNs is to create a test set, which is a random subset of the dataset about the problem of interest. This kind of approach is not enough for testing most of the real-world scenarios since these traditional test sets do not include corner cases, while a corner case input is generally considered to introduce erroneous behaviors. Recent works on adversarial input generation, data augmentation, and coverage-guided fuzzing (CGF) have provided new ways to extend traditional test sets. Among those, CGF aims to produce new test inputs by fuzzing existing ones to achieve high coverage on a test adequacy criterion (i.e. coverage criterion). Given that the subject test adequacy criterion is a well-established one, CGF can potentially find error inducing inputs for different underlying reasons. In this paper, we propose a novel CGF solution for structural testing of DNNs. The proposed fuzzer employs Monte Carlo Tree Search to drive the coverage-guided search in the pursuit of achieving high coverage. Our evaluation shows that the inputs generated by our method result in higher coverage than the inputs produced by the previously introduced coverage-guided fuzzing techniques. △ Less

Submitted 24 November, 2019; originally announced November 2019.

arXiv:1909.02889 [pdf, other]

Identification of 2-bridge links

Authors: Ali Sait Demir

Abstract: We find all 2-Bridge links up to 11 crossings and locate them in Thistlethwaite's link table. The splitting numbers of some links are calculated as a consequence of this identification. We find all 2-Bridge links up to 11 crossings and locate them in Thistlethwaite's link table. The splitting numbers of some links are calculated as a consequence of this identification. △ Less

Submitted 23 September, 2019; v1 submitted 6 September, 2019; originally announced September 2019.

MSC Class: 52M25; 52M27

arXiv:1801.08903 [pdf, ps, other]

Covering morphisms of internal groupoids in the models of a semi-abelian theory

Authors: Osman Mucuk, Serap Demir

Abstract: In this paper, for given an algebraic theory $T$ whose category $C$ of models is semi-abelian, we consider the topological models of $T$ called topological $T$-algebras and obtain some results related to the fundamental groups of topological $T$-algebras. We also deal with the internal groupoid structure in the category of models providing that the fundamental groupoid deduces a functor from topol… ▽ More In this paper, for given an algebraic theory $T$ whose category $C$ of models is semi-abelian, we consider the topological models of $T$ called topological $T$-algebras and obtain some results related to the fundamental groups of topological $T$-algebras. We also deal with the internal groupoid structure in the category of models providing that the fundamental groupoid deduces a functor from topological $T$-algebras to the internal groupoids in $C$ and prove a criterion for the lifting of such an internal groupoid structure to the covering groupoids. △ Less

Submitted 26 January, 2018; originally announced January 2018.

Comments: 19 pages, research paper, LaTeX2e, xypic

MSC Class: 18D35; 18B30; 22A05; 57M10

arXiv:1801.08900 [pdf, ps, other]

Topological aspect of monodromy groupoid for a group-groupoid

Authors: Osman Mucuk, Serap Demir

Abstract: In this paper we develop star topological and topological group-groupoid structures of monodromy groupoid and prove that the monodromy groupoid of a topological group-groupoid is also a topological group-groupoid. In this paper we develop star topological and topological group-groupoid structures of monodromy groupoid and prove that the monodromy groupoid of a topological group-groupoid is also a topological group-groupoid. △ Less

Submitted 26 January, 2018; originally announced January 2018.

Comments: 13 pages, research paper, LaTeX2e, xypic

MSC Class: 20L05; 57M10; 22AXX; 22A22

arXiv:1801.08762 [pdf, ps, other]

Normality and quotient in crossed modules over groupoids and double groupoids

Authors: Osman Mucuk, Serap Demir

Abstract: We consider the categorical equivalence between crossed modules over groupoids and double groupoids with thin structures; and by this equivalence, we prove how normality and quotient concepts are related in these two categories and give some examples of these objects. We consider the categorical equivalence between crossed modules over groupoids and double groupoids with thin structures; and by this equivalence, we prove how normality and quotient concepts are related in these two categories and give some examples of these objects. △ Less

Submitted 26 January, 2018; originally announced January 2018.

Comments: 14 pages, research paper, LaTeX2e, xypic

MSC Class: 20L05; 22A22; 18D35

arXiv:1706.05015

Optical Characteristics for Inductively RF Discharge and Post-Discharge of Pure Neon at Low Pressure

Authors: Murat Tanisli, Neslihan Sahin, Suleyman Demir

Abstract: Electron temperature for inductively RF post-discharge of pure neon (Ne) with a newly reactor design was presented in comparison for two different methods. Optical emission spectroscopy (OES) was applied for characterizations of inductively RF Ne plasma at pressures between 0.17mbar and 1.4mbar for newly reactor type. Discharge and post-discharge were generated with an RF power supply at a frequen… ▽ More Electron temperature for inductively RF post-discharge of pure neon (Ne) with a newly reactor design was presented in comparison for two different methods. Optical emission spectroscopy (OES) was applied for characterizations of inductively RF Ne plasma at pressures between 0.17mbar and 1.4mbar for newly reactor type. Discharge and post-discharge were generated with an RF power supply at a frequency of 13.56MHz and output powers of 100, 160 and 200W. Spectra were evaluated in the range 200-1200nm by an optical spectrometer. At low pressure, the main spectral features reported were the wavelengths of the atomic Ne transitions at 585.248nm and 724.516nm. The atomic emission intensities showed a maximum in inductive system when the pressure was about 0.77mbar. △ Less

Submitted 28 May, 2019; v1 submitted 15 June, 2017; originally announced June 2017.

Comments: all data of the study was transferred to another study and the results were canceled

arXiv:1108.5410 [pdf, ps, other]

Monodromy groups of real Enriques surfaces

Authors: Sultan Erdoğan Demir

Abstract: We compute the monodromy groups of real Enriques surfaces of hyperbolic type. The principal tools are the deformation classification of such surfaces and a modified version of Donaldson's trick, relating real Enriques surfaces and real rational surfaces. We compute the monodromy groups of real Enriques surfaces of hyperbolic type. The principal tools are the deformation classification of such surfaces and a modified version of Donaldson's trick, relating real Enriques surfaces and real rational surfaces. △ Less

Submitted 5 March, 2013; v1 submitted 26 August, 2011; originally announced August 2011.

Comments: 17 pages, 2 figures, 1 table. A few typos corrected and a few references added

MSC Class: 14P25; 14J28; 14J15

Journal ref: Topology and its Applications 159 (2012), pp. 2580-2591

arXiv:1008.2306 [pdf, ps, other]

doi 10.1088/0954-3899/38/1/015004

Shear Viscosity in a Perturbative Quark-Gluon-Plasma

Authors: John Fuini III, Nasser S. Demir, Dinesh K. Srivastava, Steffen A. Bass

Abstract: Among the key features of hot and dense QCD matter produced in ultra-relativistic heavy-ion collisions at RHIC is its very low shear viscosity, indicative of the properties of a near-ideal fluid, and a large opacity demonstrated by jet energy loss measurements. In this work, we utilize a microscopic transport model based on the Boltzmann equation with quark and gluon degrees of freedom and cross s… ▽ More Among the key features of hot and dense QCD matter produced in ultra-relativistic heavy-ion collisions at RHIC is its very low shear viscosity, indicative of the properties of a near-ideal fluid, and a large opacity demonstrated by jet energy loss measurements. In this work, we utilize a microscopic transport model based on the Boltzmann equation with quark and gluon degrees of freedom and cross sections calculated from perturbative Quantum Chromodynamics to simulate an ideal Quark-Gluon-Plasma in full thermal and chemical equilibrium. We then use the Kubo formalism to calculate the shear viscosity to entropy density ratio of the medium as a function of temperature and system composition. One of our key results is that the shear viscosity over entropy-density ratio $η/s$ becomes invariant to the chemical composition of the system when plotted as a function of energy-density instead of temperature. △ Less

Submitted 23 August, 2010; v1 submitted 13 August, 2010; originally announced August 2010.

Comments: 11 pages, 8 figures: version #2 contains some revisions and added references to clarify relationship to previously published work

Journal ref: J.Phys.G38:015004,2011

Showing 1–23 of 23 results for author: Demir, S