-
Optimal Nonlinearities Improve Generalization Performance of Random Features
Authors:
Samet Demir,
Zafer Doğan
Abstract:
Random feature model with a nonlinear activation function has been shown to perform asymptotically equivalent to a Gaussian model in terms of training and generalization errors. Analysis of the equivalent model reveals an important yet not fully understood role played by the activation function. To address this issue, we study the "parameters" of the equivalent model to achieve improved generaliza…
▽ More
Random feature model with a nonlinear activation function has been shown to perform asymptotically equivalent to a Gaussian model in terms of training and generalization errors. Analysis of the equivalent model reveals an important yet not fully understood role played by the activation function. To address this issue, we study the "parameters" of the equivalent model to achieve improved generalization performance for a given supervised learning problem. We show that acquired parameters from the Gaussian model enable us to define a set of optimal nonlinearities. We provide two example classes from this set, e.g., second-order polynomial and piecewise linear functions. These functions are optimized to improve generalization performance regardless of the actual form. We experiment with regression and classification problems, including synthetic and real (e.g., CIFAR10) data. Our numerical results validate that the optimized nonlinearities achieve better generalization performance than widely-used nonlinear functions such as ReLU. Furthermore, we illustrate that the proposed nonlinearities also mitigate the so-called double descent phenomenon, which is known as the non-monotonic generalization performance regarding the sample size and the model size.
△ Less
Submitted 28 September, 2023;
originally announced September 2023.
-
Banach space valued $H^p$ spaces with $A_p$ weight
Authors:
Sakin Demir
Abstract:
In this research we introduce the Banach space valued $H^p$ spaces with $A_p$ weight, and prove the following results: Let $\mathbb{A}$ and $\mathbb{B}$ Banach spaces, and $T$ be a convolution operator map** $\mathbb{A}$-valued functions into $\mathbb{B}$-valued functions, i.e., $$Tf(x)=\int_{\mathbb{R}^n}K(x-y)\cdot f(y)\, dy,$$ where $K$ is a strongly measurable function defined on…
▽ More
In this research we introduce the Banach space valued $H^p$ spaces with $A_p$ weight, and prove the following results: Let $\mathbb{A}$ and $\mathbb{B}$ Banach spaces, and $T$ be a convolution operator map** $\mathbb{A}$-valued functions into $\mathbb{B}$-valued functions, i.e., $$Tf(x)=\int_{\mathbb{R}^n}K(x-y)\cdot f(y)\, dy,$$ where $K$ is a strongly measurable function defined on $\mathbb{R}^n$ such that $\|K(x)\|_{\mathbb{B}}$ is locally integrable away from the origin. Suppose that $w$ is a positive weight function defined on $\mathbb{R}^n$, and that
i) For some $q\in [1, \infty ]$, there exists a positive constant $C_1$ such that $$\int_{\mathbb{R}^n}\|Tf(x)\|^q_{\mathbb{B}}w(x)\, dx\leq C_1\int_{\mathbb{R}^n}\|f(x)\|_{\mathbb{A}}^q w(x)\,dx$$ for all $f\in L^q_{\mathbb{A}}(\mathbb{R}^n)$.
ii) There exists a positive constant $C_2$ independent of $y\in\mathbb{R}^n$ such that $$\int_{|x|>2|y|}\|K(x-y)-K(x)\|_{\mathbb{B}}\, dx<C_2.$$
Then there exists a positive constant $C_3$ such that $$\|Tf\|_{L^1_{\mathbb{B}}(w)}\leq C_3\|f\|_{H^1_{\mathbb{A}}(w)}$$ for all $f\in H^1_{\mathbb{A}}(w)$. Let $w\in A_1$. Assume that $K\in L_{\rm{loc}}(\mathbb{R}^n\backslash \{0\})$ satisfies $$\|K\ast f\|_{L^2_{\mathbb{B}}(w)}\leq C_1\|f\|_{L^2_{\mathbb{A}}(w)}$$ and $$\int_{|x|\geq C_2|y|}\|K(x-y)-K(x)\|_{\mathbb{B}}w(x+h)\, dx\leq C_3w(y+h)\;\;\;(\forall y\neq 0, \forall h\in\mathbb{R}^n) $$ for certain absolute constants $C_1$, $C_2$, and $C_3$. Then there exists a positive constant $C$ independent of $f$ such that $$\|K\ast f\|_{L^1_{\mathbb{B}}(w)}\leq C\|f\|_{H^1_{\mathbb{A}}(w)}$$ for all $f\in H^1_{\mathbb{A}}(w)$.
△ Less
Submitted 5 January, 2023; v1 submitted 2 August, 2022;
originally announced September 2022.
-
Variaiton and $λ$-jump inequalities on $H^p$ spaces
Authors:
Sakin Demir
Abstract:
Let $φ\in \mathscr{S}$ with $\intφ(x)\, dx=1$, and define $$φ_t(x)=\frac{1}{t^n}φ(\frac{x}{t}),$$ and denote the function family $\{φ_t\ast f(x)\}_{t>0}$ by $Φ\ast f(x)$. Suppose that there exists a constant $C_1$ such that $$\sum_{t>0} |\hatφ_t(x)|^2<C_1$$ for all $x\in \mathbb{R}^n$. Then
(i) There exists a constant $C_2>0$ such that…
▽ More
Let $φ\in \mathscr{S}$ with $\intφ(x)\, dx=1$, and define $$φ_t(x)=\frac{1}{t^n}φ(\frac{x}{t}),$$ and denote the function family $\{φ_t\ast f(x)\}_{t>0}$ by $Φ\ast f(x)$. Suppose that there exists a constant $C_1$ such that $$\sum_{t>0} |\hatφ_t(x)|^2<C_1$$ for all $x\in \mathbb{R}^n$. Then
(i) There exists a constant $C_2>0$ such that $$\|\mathscr{V}_2(Φ\ast f)\|_{L^p}\leq C_2\|f\|_{H^p},\;\;\frac{n}{n+1}<p\leq 1$$ for all $f\in H^p(\mathbb{R}^n)$, $\frac{n}{n+1}<p\leq 1$.
(ii) The $λ$-jump operator $N_λ(Φ\ast f)$ satisfies $$\|λ[N_λ(Φ\ast f)]^{1/2}\|_{L^p}\leq C_3\|f\|_{H^p},\;\;\frac{n}{n+1}<p\leq 1,$$ uniformly in $λ>0$ for some constant $C_3>0$.
△ Less
Submitted 6 September, 2022; v1 submitted 25 March, 2022;
originally announced March 2022.
-
Variational Inequalities For The Differences Of Averages Over Lacunary Sequences
Authors:
Sakin Demir
Abstract:
Let $f$ be a locally integrable function defined on $\mathbb{R}$, and let $(n_k)$ be a lacunary sequence. Define the operator $A_{n_k}$ by $$A_{n_k}f(x)=\frac{1}{n_k}\int_0^{n_k}f(x-t)\, dt.$$ We prove various types of new inequalities for the variation operator $$\mathcal{V}_sf(x)=\left(\sum_{k=1}^\infty|A_{n_k}f(x)-A_{n_{k-1}}f(x)|^s\right)^{1/s}$$ when $2\leq s<\infty$.
Let $f$ be a locally integrable function defined on $\mathbb{R}$, and let $(n_k)$ be a lacunary sequence. Define the operator $A_{n_k}$ by $$A_{n_k}f(x)=\frac{1}{n_k}\int_0^{n_k}f(x-t)\, dt.$$ We prove various types of new inequalities for the variation operator $$\mathcal{V}_sf(x)=\left(\sum_{k=1}^\infty|A_{n_k}f(x)-A_{n_{k-1}}f(x)|^s\right)^{1/s}$$ when $2\leq s<\infty$.
△ Less
Submitted 25 May, 2022; v1 submitted 4 March, 2022;
originally announced March 2022.
-
Variation and oscillation inequalities for operator averages on a complex Hilbert space
Authors:
Sakin Demir
Abstract:
Let $\mathcal{H}$ be a complex Hilbert space and $T:\mathcal{H}\to \mathcal{H}$ be a contraction. Let $$A_nf=\frac{1}{n}\sum_{j=1}^nT^jf$$ for $f\in \mathcal{H}$. Let $(n_k)$ be a sequence satisfying $β\geq n_{k+1}/n_k\geq α>1$ for all $k\geq 1$, then there exists a constant $C_1>0$ such that $$\sum_{k=1}^\infty\|A_{n_{k+1}}f-A_{n_k}f\|_{\mathcal{H}}\leq C_1\|f\|_{\mathcal{H}}$$ for all…
▽ More
Let $\mathcal{H}$ be a complex Hilbert space and $T:\mathcal{H}\to \mathcal{H}$ be a contraction. Let $$A_nf=\frac{1}{n}\sum_{j=1}^nT^jf$$ for $f\in \mathcal{H}$. Let $(n_k)$ be a sequence satisfying $β\geq n_{k+1}/n_k\geq α>1$ for all $k\geq 1$, then there exists a constant $C_1>0$ such that $$\sum_{k=1}^\infty\|A_{n_{k+1}}f-A_{n_k}f\|_{\mathcal{H}}\leq C_1\|f\|_{\mathcal{H}}$$ for all $f\in \mathcal{H}$. Let $(n_k)$ be a sequence satisfying $β\geq n_{k+1}/n_k\geq α>1$ for all $k\geq 1$, and let $M$ be any sequence. Then there exists a constant $C_2>0$ such that $$\sum_{k=1}^\infty\sup_{\substack{n_k\leq m< n_{k+1}\\m\in M}}\|A_m(T)f-A_{n_k}(T)f\|_{H}\leq C_2\|f\|_{\mathcal{H}}$$ for all $f\in \mathcal{H}$.
△ Less
Submitted 8 January, 2024; v1 submitted 29 July, 2021;
originally announced July 2021.
-
Unconditional convergence of the differences of Fejér kernels on $L^2(\mathbb{R})$
Authors:
Sakin Demir
Abstract:
Let $K_n(x)$ denote the Fejér kernel given by $$K_n(x)=\sum_{j=-n}^n\left(1-\frac{|j|}{n+1}\right)e^{-ijx}$$ and let $σ_nf(x)=(K_n\ast f)(x)$, where as usual $f\ast g$ denotes the convolution of $f$ and $g$. Let the sequence $\{n_k\}$ be lacunary. Then the series $$\mathcal{G}f(x)=\sum_{k=1}^\infty \left(σ_{n_{k+1}}f(x)-σ_{n_k}f(x)\right)$$ converges unconditionally for all $f\in L^2(\mathbb{R})$.…
▽ More
Let $K_n(x)$ denote the Fejér kernel given by $$K_n(x)=\sum_{j=-n}^n\left(1-\frac{|j|}{n+1}\right)e^{-ijx}$$ and let $σ_nf(x)=(K_n\ast f)(x)$, where as usual $f\ast g$ denotes the convolution of $f$ and $g$. Let the sequence $\{n_k\}$ be lacunary. Then the series $$\mathcal{G}f(x)=\sum_{k=1}^\infty \left(σ_{n_{k+1}}f(x)-σ_{n_k}f(x)\right)$$ converges unconditionally for all $f\in L^2(\mathbb{R})$. Let $(n_k)$ be a lacunary sequence, and $\{c_k\}_{k=1}^\infty \in \ell^\infty$. Define $$\mathcal{R}f(x)=\sum_{k=1}^\infty c_k\left(σ_{n_{k+1}}f(x)-σ_{n_k}f(x)\right).$$ Then there exists a constant $C>0$ such that $$\|\mathcal{R}f\|_2\leq C\|f\|_2$$ for all $f\in L^2(\mathbb{R})$, i.e., $\mathcal{R}f$ is of strong type $(2,2)$. As a special case it follows that $\mathcal{G}f$ also is of strong type $(2,2)$.
△ Less
Submitted 8 June, 2022; v1 submitted 5 January, 2021;
originally announced January 2021.
-
Complete convergence of the Hilbert transform
Authors:
Sakin Demir
Abstract:
Suppose that $\{a_j\}\in \ell^1$, and suppose that for any sequence $(t_n)$ of integers there exits a constant $C_1>0$ such that $$\sharp\left\{k\in\mathbb{Z}:\sup_{n\geq 1}\left|\sum_{i\in \mathcal{B}_n-t_n} \!\!\!\raise{1.9ex}\hbox{$\scriptsize\prime…
▽ More
Suppose that $\{a_j\}\in \ell^1$, and suppose that for any sequence $(t_n)$ of integers there exits a constant $C_1>0$ such that $$\sharp\left\{k\in\mathbb{Z}:\sup_{n\geq 1}\left|\sum_{i\in \mathcal{B}_n-t_n} \!\!\!\raise{1.9ex}\hbox{$\scriptsize\prime$}\; \frac{a_{k+i}}{i}\right|>λ\right\}\\ \leq C_1\sharp\left\{k\in\mathbb{Z}:\sup_{n\geq 1}\left|\sum_{i\in \mathcal{B}_n} \!\!\raise{1.9ex}\hbox{$\scriptsize\prime$}\; \frac{a_{k+i}}{i}\right|>λ\right\},$$ for all $λ>0$, where $\mathcal{B}_n=\{-n, -(n-1), -(n-2),\dots , n-2, n-1, n\}$. Then there is a constant $C_2>0$ which does not depend on the sequence $\{a_j\}$ such that $$\sum_{n=1}^\infty\sharp\left\{k\in\mathbb{Z}:\left|\sum_{i=-n}^{n} \!\!\raise{1.9ex}\hbox{$\scriptsize\prime$}\; \frac{a_{k+i}}{i}\right|>λ\right\}\leq\frac{C_2}λ\sum_{i=-\infty}^{\infty}|a_i|$$ for all $λ>0$.
Let $(X,\mathscr{B},μ)$ be a measure space, $τ:X\to X$ an invertible measure-preserving transformation, and suppose that $f\in L^1(X)$ such that for any sequence $(t_n)$ of integers there exists a constant $C_1>0$ such that $$μ\left\{ x: \sup_{n\geq 1}\left|\sum_{i\in \mathcal{B}_n-t_n}\!\!\!\raise{1.9ex}\hbox{$\scriptsize\prime$}\; \frac{f(τ^ix)}{i}\right| >λ\right\}\leq C_1μ\left\{x: \sup_{n\geq 1}\left|\sum_{i\in \mathcal{B}_n}\!\!\raise{1.9ex}\hbox{$\scriptsize\prime$}\; \frac{f(τ^i x)}{i}\right|>λ\right\} $$ for all $λ>0$, where $\mathcal{B}_n=\{-n, -(n-1), -(n-2),\dots , n-2, n-1, n\}$. Then there exists a constant $C_2>0$ which does not depend on $f$ such that $$\sum_{n=1}^\inftyμ\left\{x:\left|\sum_{i=-n}^{n}\!\!\raise{1.9ex}\hbox{$\scriptsize\prime$} \;\frac{f(τ^ix)}{i}\right|>λ\right\}\leq\frac{C_2}λ\|f\|_1$$ for all $λ>0$.
△ Less
Submitted 3 August, 2022; v1 submitted 12 September, 2020;
originally announced September 2020.
-
Oscillation inequalities on real and ergodic $H^1$ spaces
Authors:
Sakin Demir
Abstract:
Let $(x_n)$ be a sequence and $ρ\geq 1$. For a fixed sequences $n_1<n_2<n_3<\dots$, and $M$ define the oscillation operators $$\mathcal{O}_ρ(x_n)=\left(\sum_{k=1}^\infty\sup_{\substack{n_k\leq m< n_{k+1}\\m\in M}}\left|x_m-x_{n_k}\right|^ρ\right)^{1/ρ}.$$ Let $(X,\mathscr{B} ,μ, τ)$ be a dynamical system with $(X,\mathscr{B} ,μ)$ a probability space and $τ$ a measurable, invertible, measure preser…
▽ More
Let $(x_n)$ be a sequence and $ρ\geq 1$. For a fixed sequences $n_1<n_2<n_3<\dots$, and $M$ define the oscillation operators $$\mathcal{O}_ρ(x_n)=\left(\sum_{k=1}^\infty\sup_{\substack{n_k\leq m< n_{k+1}\\m\in M}}\left|x_m-x_{n_k}\right|^ρ\right)^{1/ρ}.$$ Let $(X,\mathscr{B} ,μ, τ)$ be a dynamical system with $(X,\mathscr{B} ,μ)$ a probability space and $τ$ a measurable, invertible, measure preserving point transformation from $X$ to itself.\\ Suppose that the sequences $(n_k)$ and $M$ are lacunary. Then we prove the following results for $ρ\geq 2$:
(i) Define $φ_n(x)=\frac{1}{n}χ_{[0,n]}(x)$ on $\mathbb{R}$. Then there exists a constant $C>0$ such that $\|\mathcal{O}_ρ(φ_n\ast f)\|_{L^1(\mathbb{R})}\leq C\|f\|_{H^1(\mathbb{R})}$ for all $f\in H^1(\mathbb{R})$.
(ii) Let $A_nf(x)=\frac{1}{n}\sum_{k=1}^nf(τ^kx)$ be the usual ergodic averages in ergodic theory. Then $\|\mathcal{O}_ρ(A_nf)\|_{L^1(X)}\leq C\|f\|_{H^1(X)}$ for all $f\in H^1(X)$.
(iii) If $[f(x)\log (x)]^+$ is integrable, then $\mathcal{O}_ρ(A_nf)$ is integrable.
△ Less
Submitted 26 May, 2022; v1 submitted 23 June, 2020;
originally announced June 2020.
-
Filter design for small target detection on infrared imagery using normalized-cross-correlation layer
Authors:
H. Seçkin Demir,
Erdem Akagunduz
Abstract:
In this paper, we introduce a machine learning approach to the problem of infrared small target detection filter design. For this purpose, similarly to a convolutional layer of a neural network, the normalized-cross-correlational (NCC) layer, which we utilize for designing a target detection/recognition filter bank, is proposed. By employing the NCC layer in a neural network structure, we introduc…
▽ More
In this paper, we introduce a machine learning approach to the problem of infrared small target detection filter design. For this purpose, similarly to a convolutional layer of a neural network, the normalized-cross-correlational (NCC) layer, which we utilize for designing a target detection/recognition filter bank, is proposed. By employing the NCC layer in a neural network structure, we introduce a framework, in which supervised training is used to calculate the optimal filter shape and the optimum number of filters required for a specific target detection/recognition task on infrared images. We also propose the mean-absolute-deviation NCC (MAD-NCC) layer, an efficient implementation of the proposed NCC layer, designed especially for FPGA systems, in which square root operations are avoided for real-time computation. As a case study we work on dim-target detection on mid-wave infrared imagery and obtain the filters that can discriminate a dim target from various types of background clutter, specific to our operational concept.
△ Less
Submitted 15 June, 2020;
originally announced June 2020.
-
An Evaluation of Recent Neural Sequence Tagging Models in Turkish Named Entity Recognition
Authors:
Gizem Aras,
Didem Makaroglu,
Seniz Demir,
Altan Cakir
Abstract:
Named entity recognition (NER) is an extensively studied task that extracts and classifies named entities in a text. NER is crucial not only in downstream language processing applications such as relation extraction and question answering but also in large scale big data operations such as real-time analysis of online digital media content. Recent research efforts on Turkish, a less studied langua…
▽ More
Named entity recognition (NER) is an extensively studied task that extracts and classifies named entities in a text. NER is crucial not only in downstream language processing applications such as relation extraction and question answering but also in large scale big data operations such as real-time analysis of online digital media content. Recent research efforts on Turkish, a less studied language with morphologically rich nature, have demonstrated the effectiveness of neural architectures on well-formed texts and yielded state-of-the art results by formulating the task as a sequence tagging problem. In this work, we empirically investigate the use of recent neural architectures (Bidirectional long short-term memory and Transformer-based networks) proposed for Turkish NER tagging in the same setting. Our results demonstrate that transformer-based networks which can model long-range context overcome the limitations of BiLSTM networks where different input features at the character, subword, and word levels are utilized. We also propose a transformer-based network with a conditional random field (CRF) layer that leads to the state-of-the-art result (95.95\% f-measure) on a common dataset. Our study contributes to the literature that quantifies the impact of transfer learning on processing morphologically rich languages.
△ Less
Submitted 18 May, 2020; v1 submitted 14 May, 2020;
originally announced May 2020.
-
An extension of Calderon Transfer Principle
Authors:
Sakin Demir
Abstract:
We first prove that the well known transfer principle of A. P. Calderón can be extended to the vector-valued setting and then we apply this extension to vector-valued inequalities for the Hardy-Littlewood maximal function to prove the vector-valued strong type $L^p$ norm inequalities for $1<p<\infty$ and the vector-valued weak type $(1,1)$ inequality for ergodic maximal function.
We first prove that the well known transfer principle of A. P. Calderón can be extended to the vector-valued setting and then we apply this extension to vector-valued inequalities for the Hardy-Littlewood maximal function to prove the vector-valued strong type $L^p$ norm inequalities for $1<p<\infty$ and the vector-valued weak type $(1,1)$ inequality for ergodic maximal function.
△ Less
Submitted 31 March, 2020;
originally announced April 2020.
-
A transfer principle in ergodic theory on weighted spaces
Authors:
Sakin Demir
Abstract:
The power of Calderón transfer principle is well known when proving strong type and weak type inequalities for certain type of operators in ergodic theory. In this article we show that Calderón's argument can be extended to have a transfer principle to be able to prove weighted inequalities for those operators satisfying the conditions of Calderón transfer principle. We also include some applicati…
▽ More
The power of Calderón transfer principle is well known when proving strong type and weak type inequalities for certain type of operators in ergodic theory. In this article we show that Calderón's argument can be extended to have a transfer principle to be able to prove weighted inequalities for those operators satisfying the conditions of Calderón transfer principle. We also include some applications of our results.
△ Less
Submitted 24 February, 2024; v1 submitted 6 February, 2020;
originally announced February 2020.
-
A Sufficient Condition For An Operator To Map $uL^\infty$ to ${\rm{BMO}}_u$
Authors:
Sakin Demir
Abstract:
Let $T$ be an operator and suppose that there exists a positive constant $C$ such that $$\left(\int_I|Tf(x)|^q\, dx\right)^{1/q}\leq C\left(\int_I|f(x)|^q\, dx\right)^{1/q}$$ for every $q$ which is near enough to $1$ and for every interval $I$ in $\mathbb{R}$ and $f\in L^{\infty}(\mathbb{R})$. Then we show that $T$ maps $uL^{\infty}$ to ${\rm{BMO}}_u$.
Let $T$ be an operator and suppose that there exists a positive constant $C$ such that $$\left(\int_I|Tf(x)|^q\, dx\right)^{1/q}\leq C\left(\int_I|f(x)|^q\, dx\right)^{1/q}$$ for every $q$ which is near enough to $1$ and for every interval $I$ in $\mathbb{R}$ and $f\in L^{\infty}(\mathbb{R})$. Then we show that $T$ maps $uL^{\infty}$ to ${\rm{BMO}}_u$.
△ Less
Submitted 29 January, 2022; v1 submitted 2 February, 2020;
originally announced February 2020.
-
Inequalities For Variation Operator
Authors:
Sakin Demir
Abstract:
Let $f$ be a measurable function defined on $\mathbb{R}$. For each $n\in\mathbb{Z}$ define the operator $A_n$ by $$A_nf(x)=\frac{1}{2^n}\int_x^{x+2^n}f(y)\, dy.$$ Consider the variation operator $$\mathcal{V}f(x)=\left(\sum_{n=-\infty}^\infty|A_nf(x)-A_{n-1}f(x)|^s\right)^{1/s}$$ for $2\leq s<\infty$. It has been proved in \cite{jkw1} that $\mathcal{V}$ is of strong type $(p,p)$ for $1<p<\infty$ a…
▽ More
Let $f$ be a measurable function defined on $\mathbb{R}$. For each $n\in\mathbb{Z}$ define the operator $A_n$ by $$A_nf(x)=\frac{1}{2^n}\int_x^{x+2^n}f(y)\, dy.$$ Consider the variation operator $$\mathcal{V}f(x)=\left(\sum_{n=-\infty}^\infty|A_nf(x)-A_{n-1}f(x)|^s\right)^{1/s}$$ for $2\leq s<\infty$. It has been proved in \cite{jkw1} that $\mathcal{V}$ is of strong type $(p,p)$ for $1<p<\infty$ and is of weak type $(1,1)$, it maps $L^\infty$ to BMO. We first provide a completely different proofs for these known results and in addition we prove that $\mathcal{V}$ maps $H^1$ to $L^1$. Furthermore, we prove that it satisfies vector-valued weighted strong type and weak type inequalities. As a special case it follows that $\mathcal{V}$ satisfies weighted strong type and weak type inequalities.
△ Less
Submitted 25 January, 2020;
originally announced January 2020.
-
Neural Academic Paper Generation
Authors:
Samet Demir,
Uras Mutlu,
Özgur Özdemir
Abstract:
In this work, we tackle the problem of structured text generation, specifically academic paper generation in $\LaTeX{}$, inspired by the surprisingly good results of basic character-level language models. Our motivation is using more recent and advanced methods of language modeling on a more complex dataset of $\LaTeX{}$ source files to generate realistic academic papers. Our first contribution is…
▽ More
In this work, we tackle the problem of structured text generation, specifically academic paper generation in $\LaTeX{}$, inspired by the surprisingly good results of basic character-level language models. Our motivation is using more recent and advanced methods of language modeling on a more complex dataset of $\LaTeX{}$ source files to generate realistic academic papers. Our first contribution is preparing a dataset with $\LaTeX{}$ source files on recent open-source computer vision papers. Our second contribution is experimenting with recent methods of language modeling and text generation such as Transformer and Transformer-XL to generate consistent $\LaTeX{}$ code. We report cross-entropy and bits-per-character (BPC) results of the trained models, and we also discuss interesting points on some examples of the generated $\LaTeX{}$ code.
△ Less
Submitted 2 December, 2019;
originally announced December 2019.
-
DeepSmartFuzzer: Reward Guided Test Generation For Deep Learning
Authors:
Samet Demir,
Hasan Ferit Eniser,
Alper Sen
Abstract:
Testing Deep Neural Network (DNN) models has become more important than ever with the increasing usage of DNN models in safety-critical domains such as autonomous cars. The traditional approach of testing DNNs is to create a test set, which is a random subset of the dataset about the problem of interest. This kind of approach is not enough for testing most of the real-world scenarios since these t…
▽ More
Testing Deep Neural Network (DNN) models has become more important than ever with the increasing usage of DNN models in safety-critical domains such as autonomous cars. The traditional approach of testing DNNs is to create a test set, which is a random subset of the dataset about the problem of interest. This kind of approach is not enough for testing most of the real-world scenarios since these traditional test sets do not include corner cases, while a corner case input is generally considered to introduce erroneous behaviors. Recent works on adversarial input generation, data augmentation, and coverage-guided fuzzing (CGF) have provided new ways to extend traditional test sets. Among those, CGF aims to produce new test inputs by fuzzing existing ones to achieve high coverage on a test adequacy criterion (i.e. coverage criterion). Given that the subject test adequacy criterion is a well-established one, CGF can potentially find error inducing inputs for different underlying reasons. In this paper, we propose a novel CGF solution for structural testing of DNNs. The proposed fuzzer employs Monte Carlo Tree Search to drive the coverage-guided search in the pursuit of achieving high coverage. Our evaluation shows that the inputs generated by our method result in higher coverage than the inputs produced by the previously introduced coverage-guided fuzzing techniques.
△ Less
Submitted 24 November, 2019;
originally announced November 2019.
-
Identification of 2-bridge links
Authors:
Ali Sait Demir
Abstract:
We find all 2-Bridge links up to 11 crossings and locate them in Thistlethwaite's link table. The splitting numbers of some links are calculated as a consequence of this identification.
We find all 2-Bridge links up to 11 crossings and locate them in Thistlethwaite's link table. The splitting numbers of some links are calculated as a consequence of this identification.
△ Less
Submitted 23 September, 2019; v1 submitted 6 September, 2019;
originally announced September 2019.
-
Covering morphisms of internal groupoids in the models of a semi-abelian theory
Authors:
Osman Mucuk,
Serap Demir
Abstract:
In this paper, for given an algebraic theory $T$ whose category $C$ of models is semi-abelian, we consider the topological models of $T$ called topological $T$-algebras and obtain some results related to the fundamental groups of topological $T$-algebras. We also deal with the internal groupoid structure in the category of models providing that the fundamental groupoid deduces a functor from topol…
▽ More
In this paper, for given an algebraic theory $T$ whose category $C$ of models is semi-abelian, we consider the topological models of $T$ called topological $T$-algebras and obtain some results related to the fundamental groups of topological $T$-algebras. We also deal with the internal groupoid structure in the category of models providing that the fundamental groupoid deduces a functor from topological $T$-algebras to the internal groupoids in $C$ and prove a criterion for the lifting of such an internal groupoid structure to the covering groupoids.
△ Less
Submitted 26 January, 2018;
originally announced January 2018.
-
Topological aspect of monodromy groupoid for a group-groupoid
Authors:
Osman Mucuk,
Serap Demir
Abstract:
In this paper we develop star topological and topological group-groupoid structures of monodromy groupoid and prove that the monodromy groupoid of a topological group-groupoid is also a topological group-groupoid.
In this paper we develop star topological and topological group-groupoid structures of monodromy groupoid and prove that the monodromy groupoid of a topological group-groupoid is also a topological group-groupoid.
△ Less
Submitted 26 January, 2018;
originally announced January 2018.
-
Normality and quotient in crossed modules over groupoids and double groupoids
Authors:
Osman Mucuk,
Serap Demir
Abstract:
We consider the categorical equivalence between crossed modules over groupoids and double groupoids with thin structures; and by this equivalence, we prove how normality and quotient concepts are related in these two categories and give some examples of these objects.
We consider the categorical equivalence between crossed modules over groupoids and double groupoids with thin structures; and by this equivalence, we prove how normality and quotient concepts are related in these two categories and give some examples of these objects.
△ Less
Submitted 26 January, 2018;
originally announced January 2018.
-
Optical Characteristics for Inductively RF Discharge and Post-Discharge of Pure Neon at Low Pressure
Authors:
Murat Tanisli,
Neslihan Sahin,
Suleyman Demir
Abstract:
Electron temperature for inductively RF post-discharge of pure neon (Ne) with a newly reactor design was presented in comparison for two different methods. Optical emission spectroscopy (OES) was applied for characterizations of inductively RF Ne plasma at pressures between 0.17mbar and 1.4mbar for newly reactor type. Discharge and post-discharge were generated with an RF power supply at a frequen…
▽ More
Electron temperature for inductively RF post-discharge of pure neon (Ne) with a newly reactor design was presented in comparison for two different methods. Optical emission spectroscopy (OES) was applied for characterizations of inductively RF Ne plasma at pressures between 0.17mbar and 1.4mbar for newly reactor type. Discharge and post-discharge were generated with an RF power supply at a frequency of 13.56MHz and output powers of 100, 160 and 200W. Spectra were evaluated in the range 200-1200nm by an optical spectrometer. At low pressure, the main spectral features reported were the wavelengths of the atomic Ne transitions at 585.248nm and 724.516nm. The atomic emission intensities showed a maximum in inductive system when the pressure was about 0.77mbar.
△ Less
Submitted 28 May, 2019; v1 submitted 15 June, 2017;
originally announced June 2017.
-
Monodromy groups of real Enriques surfaces
Authors:
Sultan Erdoğan Demir
Abstract:
We compute the monodromy groups of real Enriques surfaces of hyperbolic type. The principal tools are the deformation classification of such surfaces and a modified version of Donaldson's trick, relating real Enriques surfaces and real rational surfaces.
We compute the monodromy groups of real Enriques surfaces of hyperbolic type. The principal tools are the deformation classification of such surfaces and a modified version of Donaldson's trick, relating real Enriques surfaces and real rational surfaces.
△ Less
Submitted 5 March, 2013; v1 submitted 26 August, 2011;
originally announced August 2011.
-
Shear Viscosity in a Perturbative Quark-Gluon-Plasma
Authors:
John Fuini III,
Nasser S. Demir,
Dinesh K. Srivastava,
Steffen A. Bass
Abstract:
Among the key features of hot and dense QCD matter produced in ultra-relativistic heavy-ion collisions at RHIC is its very low shear viscosity, indicative of the properties of a near-ideal fluid, and a large opacity demonstrated by jet energy loss measurements. In this work, we utilize a microscopic transport model based on the Boltzmann equation with quark and gluon degrees of freedom and cross s…
▽ More
Among the key features of hot and dense QCD matter produced in ultra-relativistic heavy-ion collisions at RHIC is its very low shear viscosity, indicative of the properties of a near-ideal fluid, and a large opacity demonstrated by jet energy loss measurements. In this work, we utilize a microscopic transport model based on the Boltzmann equation with quark and gluon degrees of freedom and cross sections calculated from perturbative Quantum Chromodynamics to simulate an ideal Quark-Gluon-Plasma in full thermal and chemical equilibrium. We then use the Kubo formalism to calculate the shear viscosity to entropy density ratio of the medium as a function of temperature and system composition. One of our key results is that the shear viscosity over entropy-density ratio $η/s$ becomes invariant to the chemical composition of the system when plotted as a function of energy-density instead of temperature.
△ Less
Submitted 23 August, 2010; v1 submitted 13 August, 2010;
originally announced August 2010.