Revisiting sums and products in countable and finite fields

Ioannis Kousek

Abstract

We establish a polynomial ergodic theorem for actions of the affine group of a countable field $K$ . As an application, we deduce–via a variant of Furstenberg’s correspondence principle–that for fields of characteristic zero, any “large” set $E\subset K$ contains “many” patterns of the form $\{p(x)+y,xy\}$ , for every non-constant polynomial $p(x)\in K[x]$ .

Our methods are flexible enough that they allow us to recover analogous density results in the setting of finite fields and, with the aid of a new finitistic variant of Bergelson’s “colouring trick”, show that for $r\in\mathbb{N}$ fixed, any $r-$ colouring of a large enough finite field will contain monochromatic patterns of the form $\{x,p(x)+y,xy\}$ .

In a different direction, we obtain a double ergodic theorem for actions of the affine group of a countable field. An adaptation of the argument for affine actions of finite fields leads to a generalisation of a theorem of Shkredov. Finally, to highlight the utility of the aforementioned finitistic “colouring trick”, we provide a conditional, elementary generalisation of Green and Sanders’ $\{x,y,x+y,xy\}$ theorem.

1 Introduction

1.1 Historic background

A well-known and still open question of Hindman (see, for example, [9]) reads as follows.

Question 1.1.

Given any finite colouring of $\mathbb{N}$ , do there always exists $x,y\in\mathbb{N}$ such that $\{x,y,x+y,xy\}$ is monochromatic, i.e. $x,y,x+y$ and $xy$ all have the same colour?

In [11], Moreira proved the following result marking significant progress towards an answer to Question 1.1.

Theorem 1.2 (Moreira).

For any finite colouring of $\mathbb{N}$ there exist (infinitely many) $x,y\in\mathbb{N}$ such that $\{x,x+y,xy\}$ is monochromatic.

Prior to Moreira’s theorem, Shkredov ([12]) addressed its analogue for finite fields of prime order proving two density results.

Theorem 1.3 (Shkredov).

Let $\mathbb{Z}_{p}$ be a finite field of prime order $p$ . If $A_{1},A_{2}\subset\mathbb{Z}_{p}$ are any sets with $|A_{1}||A_{2}|\geq 20p$ , then there exist $x,y\in\mathbb{Z}^{*}_{p}:=\mathbb{Z}_{p}\setminus\{0\}$ such that $x+y\in A_{1}$ and $xy\in A_{2}$ .

Theorem 1.4 (Shkredov).

Let $\mathbb{Z}_{p}$ be a finite field of prime order $p$ . If $A_{1},A_{2},A_{3}\subset\mathbb{Z}_{p}$ are any sets with $|A_{1}||A_{2}||A_{3}|\geq 40p^{5/2}$ , then there exist $x,y\in\mathbb{Z}^{*}_{p}$ such that $x+y\in A_{1}$ , $xy\in A_{2}$ and $x\in A_{3}$ .

It follows from Theorem 1.4 that if $\mathbb{Z}_{p}$ is $r$ -coloured and $p$ is large enough relative to $r$ , then there exist $x,y\in\mathbb{Z}^{*}_{p}$ such that $\{x,x+y,xy\}$ is monochromatic. Later, the analogue of Question 1.1 for finite fields of prime order was solved by Green and Sanders in [7] via the following quantitative result.

Theorem 1.5 (Green-Sanders).

Let $r\in\mathbb{N}$ be fixed and $\mathbb{Z}_{p}$ be a finite field of prime order $p$ , with $p$ large enough. For any $r$ -colouring of $\mathbb{Z}_{p}$ there are at least $c_{r}p^{2}$ monochromatic quadruples $\{x,y,x+y,xy\}$ , where $c_{r}>0$ does not depend on $p$ .

Observe that Theorems 1.3 and 1.4 are density results, while there is no density version of the partition regularity Theorem 1.5. This was pointed out by Shkredov in [12].

In the context of countable fields, Bowen and Sabok in [4] gave a positive answer to the analogue of Question 1.1. By a compactness principle they also solved the analogue of this question for all finite fields as a corollary of their main theorem.

Before that, Bergelson and Moreira in [3] established the following analogue of Theorem 1.2 using methods from ergodic theory.

Theorem 1.6 (Bergelson-Moreira).

Let $K$ be a countable field and consider a finite colouring $K=\bigcup_{j=1}^{r}C_{j}$ , $r\in\mathbb{N}$ . Then, there exists a colour $C_{i}$ , $1\leq i\leq r$ , and “many” $x,y\in K^{*}$ , such that $\{x,x+y,xy\}\subset C_{i}.$

In this setting, an appropriate notion of largeness, which guarantees patterns involving both addition and multiplication in any large set, turns out to be that of positive upper density with respect to double Følner sequences. We recall the definition given in [3].

Definition 1.7.

Let $K$ be a countable field. A double Følner sequence in $K$ is a sequence of (non-empty) finite subsets $(F_{N})_{N\in\mathbb{N}}\subset K$ which is asymptotically invariant under any fixed affine transformation of $K$ , that is,

\lim_{N\to\infty}\frac{\left|F_{N}\cap\left(x+F_{N}\right)\right|}{|F_{N}|}=% \lim_{N\to\infty}\frac{\left|F_{N}\cap\left(xF_{N}\right)\right|}{|F_{N}|}=1,

for any $x\in K^{*}$ .

This notion of sequence allows us to define asymptotic densities with good properties such as shift invariance. For a countable field $K$ and $(F_{N})_{N\in\mathbb{N}}$ a double Følner sequence in $K$ as above, given a set $E\subset K$ , its upper density with respect to $(F_{N})_{N\in\mathbb{N}}$ is defined as

\overline{\mathop{}\!\mathrm{d}}_{(F_{N})}(E)=\limsup_{N\to\infty}\frac{\left|% E\cap F_{N}\right|}{|F_{N}|}.

Moreover, its lower density with respect to $(F_{N})_{N\in\mathbb{N}}$ is defined as

\underline{\mathop{}\!\mathrm{d}}_{(F_{N})}(E)=\liminf_{N\to\infty}\frac{\left% |E\cap F_{N}\right|}{|F_{N}|}

and whenever the limit exists we say that $E$ has a density with respect to $(F_{N})_{N\in\mathbb{N}}$ given by $\mathop{}\!\mathrm{d}_{(F_{N})}(E)=\overline{\mathop{}\!\mathrm{d}}_{(F_{N})}(% E)=\underline{\mathop{}\!\mathrm{d}}_{(F_{N})}(E).$

Using a “colouring trick” Bergelson and Moreira were able to recover Theorem 1.6 from essentially the following theorem, which we state vaguely.

Theorem 1.8 (Bergelson-Moreira).

Let $K$ be a countable field, $(F_{N})_{N\in\mathbb{N}}$ be a double Følner sequence in $K$ and $E\subset K$ with $\overline{\mathop{}\!\mathrm{d}}_{F_{N}}(E)>0$ . Then, there exist “many” $x,y\in K$ such that $\{x+y,xy\}\subset E$ .

An advantage of the statement of Theorem 1.8, over that of Theorem 1.6, is that it’s form can be handled with ergodic theoretic tools and methods. This is a general principle, discovered by Furstenberg in his seminal proof of Szemerédi’s theorem (see [6]). There he introduced a correspondence principle, which often allows one to translate a problem of finding patterns in large sets (subsets of the integers, of semi-groups, of fields, etc.) to a problem about recurrence in measure preserving systems.

The following ergodic theorem from [3], whose proof utilizes the group of affine transformations of a field $K$ , defined as $\mathscr{A}_{K}:=\{f:x\mapsto ux+v\big{|}\ u,v\in K,u\neq 0\}$ , implies Theorem 1.8. We write $A_{u}$ for the map $x\mapsto x+u$ , if $u\in K$ and $M_{u}$ for $x\mapsto ux$ , if $u\in K^{*}:=K\setminus\{0\}$ .

Theorem 1.9 (Bergelson-Moreira).

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}\mu(A_{-u}B\cap M_{1/u}B)% \geq(\mu(B))^{2}.

Remark.

The fact that $(T_{g})_{g\in\mathscr{A}_{K}}$ acts on $(X,\mathscr{X},\mu)$ by m.p.t. means that $(T_{g})_{g\in\mathscr{A}_{K}}$ is a group action on $X$ , so that $T_{g}\circ T_{h}=T_{g\circ h}$ , any $g,h\in\mathscr{A}_{K}$ , and that $\mu(A)=\mu(T_{g}^{-1}A),$ for any $A\in\mathscr{X}$ and $g\in\mathscr{A}_{K}$ . Also, in an abuse of notation, we write $A_{u}$ for $T_{A_{u}}$ and $M_{u}$ for $T_{M_{u}}$ , where $u\in K^{*}$ .

1.2 Main results

A question which occurs naturally is whether we can extend Theorem 1.6, by finding monochromatic patterns of the form $\{x,p(x)+y,xy\}$ , where $p(x)$ is a polynomial over $K$ , other than $p(x)=x$ . This is addressed by our first main result (stated somewhat vaguely for now) which we formulate after an important–throughout this paper–definition.

Definition 1.10.

Given a field $K$ with prime characteristic $\text{char}(K)=q$ , we say that a non-constant polynomial $p(x)\in K[x]$ is admissible for $K$ , if $\deg(p(x))\leq q-1$ . If $K$ is a countable field with $\text{char}(K)=0$ , then any non-constant polynomial $p(x)\in K[x]$ is admissible for $K$ .

Theorem 1.11.

Let $K$ be a countable field and $p(x)\in K[x]\setminus K$ be any admissible polynomial. Then, for any finite colouring $K=C_{1}\cup\dots\cup C_{r},$ there exists a colour $C_{j}$ , $1\leq j\leq r$ , and “many” $x,y\in K^{*}$ , so that $\{x,p(x)+y,xy\}\subset C_{j}.$

The density theorem which we will use to prove Theorem 1.11 is the following.

Theorem 1.12.

Let $K$ be a countable field, $(F_{N})_{N\in\mathbb{N}}$ be a double Følner sequence in $K$ and $E\subset K$ with $\overline{\mathop{}\!\mathrm{d}}_{F_{N}}(E)>0$ . Then, for any admissible polynomial $p(x)\in K[x]\setminus K$ there exist “many” $x,y\in K$ such that $\{p(x)+y,xy\}\subset E$ .

In the same spirit as in the end of Section $1.1$ , Theorem 1.12 is implied by an ergodic theorem.

Theorem 1.13.

Let $K$ , $p(x)\in K[x]\setminus K$ and $(F_{N})_{N\in\mathbb{N}}$ be as in the statement of Theorem 1.12. Let $(X,\mathscr{X},\mu)$ be a probability space on which we assume that $(T_{g})_{g\in\mathscr{A}_{K}}$ acts by measure preserving transformations. Then, given any $f\in L^{2}(X,\mu)$ we have that

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}M_{u}A_{-p(u)}f=Pf,

where the limit is in $L^{2}$ and $P:L^{2}(X,\mu)\to L^{2}(X,\mu)$ denotes the orthogonal projection onto the subspace of $\mathscr{A}_{K}$ -invariant functions.

The proof of this statement is based on that of Bergelson and Moreira’s proof of Theorem 1.9, with additional applications of van der Corput type of lemmas to facilitate an induction argument on the degree of the polynomial. This appears especially in the proof of the polynomial mean ergodic theorem of Proposition 3.2.

We also finitise the arguments used to prove Theorem 1.13 in order to recover the following analogue of our main density result, Theorem 1.12, in the setting of finite fields.

Theorem 1.14.

Let $F$ be a finite field and let $p(x)\in F[x]$ be an admissible polynomial over $F$ of degree $q:=\deg(p(x))$ . Then, if $E,G\subset F$ with $|E||G|>2(q+2)|F|^{2-(1/2^{q-1})}$ , there are $x,y\in F^{*}$ , so that $xy\in E$ and $p(x)+y\in G$ .

In particular, letting $E=G$ , we have the finite field version of the density statement that there exist $x,y\in F^{*}$ such that $\{p(x)+y,xy\}\subset E$ , provided $E\subset F$ is large enough.

We also produce a new finitistic version of the “colouring trick” mentioned earlier and with the aid of Theorem 1.14 recover the next partition regularity result.

Theorem 1.15.

Let $r,q\in\mathbb{N}$ be fixed. Then, there exists $n(r,q)\in\mathbb{N}$ with the following property. If $F$ is any finite field with $|F|\geq n(r,q)$ and $\text{char}(F)>q$ and $p(x)\in F[x]$ is a polynomial of $\deg(p(x))=q$ , then for any finite colouring $F=C_{1}\cup\dots\cup C_{r}$ , there is a colour $C_{j}$ and $x,y\in F^{*}$ , such that $\{x,p(x)+y,xy\}\subset C_{j}.$

Remark.

The assumption $\text{char}(F)>q$ is only to ensure that the polynomial $p(x)\in F[x]$ is admissible according to Definition 1.10.

A special case of this theorem (when $p(x)=x$ ) is the partition regularity corollary of Shkredov’s Theorem 1.4 mentioned after its statement. An advantage of the ergodic theoretic techniques used here is that we can recover more general polynomial patterns and also that the result holds for all finite fields and not only $\mathbb{Z}_{p}$ . A perhaps more interesting feature, however, is the use of the novel– in the finitistic setting–“colouring trick”, which, in a way, allows us to recover this partition regularity statement from a weaker density theorem.

In a different direction we are also interested in the question of section $6.4$ of [3]. Namely, is it true that under the assumptions of Theorem 1.9 above we get triple intersections of the form $\mu(B\cap A_{-u}B\cap M_{1/u}B)>0,$ for some $u\in K^{*}$ ? A generalization of the next non-commutative double ergodic theorem, without the assumption of ergodicity, would answer this question in the affirmative.

Theorem 1.16.

Let $K$ be a countable field and $(F_{N})_{N\in\mathbb{N}}$ be a double Følner sequence in $K$ . Let $(X,\mathscr{X},\mu)$ be a probability space on which we assume that $(T_{g})_{g\in\mathscr{A}_{K}}$ acts by measure preserving transformations and (crucially) we further assume that the action of the additive subgroup $S_{A}=\{A_{u}:u\in K\}$ is ergodic¹¹1The action $(T_{g})_{g\in G}$ of a group $G$ on a probability space $(X,\mathscr{X},\mu)$ is ergodic if for any $A\in\mathscr{X}$ we have that $T_{g}A=A,\ \text{for all}\ g\in G\implies\mu(A)\in\{0,1\}$ . Then, given any $B\in\mathscr{X}$ , we have that

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}\mu(B\cap A_{-u}B\cap M_{1/% u}B)\geq(\mu(B))^{3}.

Unfortunately, we were unable to recover the result in its full generality. However, we make a natural conjecture.

Conjecture 1.17.

In the context of Theorem 1.16, if $S_{A}$ does not act ergodically, then given any $B\in\mathscr{X}$ , we have that

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}\mu(B\cap A_{-u}B\cap M_{1/% u}B)\geq(\mu(B))^{4}.

In a relevant direction, Theorem 1.3 was generalised to all finite fields, initially by Cilleruelo ([5, Corollary $4.2$ ]) and subsequently by Hanson ([8, Theorem $1$ ]) and Bergelson and Moreira ([3, Theorem $5.3$ ]). However, a generalisation of Theorem 1.4 to any finite field remained open and we address this problem hereby through a “finitisation” of Theorem 1.16.

Theorem 1.18.

Let $F$ be any finite field and let $B_{1},B_{2},B_{3}\subset F$ be any sets satisfying $|B_{1}||B_{2}||B_{3}|\geq 8|F|^{5/2}$ . Then, there exist $x,y\in F^{*}$ such that $x+y\in B_{1}$ , $xy\in B_{2}$ and $x\in B_{3}$ .

The ideas and techniques appearing in the proof of Theorem 1.16 spring from classical ergodic theoretic arguments used in proving multiple ergodic theorems. In this regard, the proof of Theorem 1.18, which is more or less a “finitisation” of the above-mentioned proof, is different from Shkredov’s original combinatorial proof of Theorem 1.4.

Finally, by using the finitistic “colouring trick” and a finitistic version of Conjecture 1.17, we provide an elementary, conditional proof of the following generalisation of Green and Sanders’ Theorem 1.5.

Conjecture 1.19.

Let $r\in\mathbb{N}$ be fixed. Then, there is $n(r)\in\mathbb{N}$ , so that if $F$ is any finite field with $|F|\geq n(r)$ and $F=C_{1}\cup\dots\cup C_{r}$ , there are $c_{r}|F|^{2}$ quadruples monochromatic $\{x,y,x+y,xy\}$ , where $c_{r}>0$ does not depend on $|F|$ .

Acknowledgments. The author expresses gratitude to his advisor, Joel Moreira, for his guidance and beneficial discussions during the preparation of this paper. Thanks also go to Matt Bowen, Nikos Frantzikinakis and Andreas Mountakis for comments on earlier drafts.

2 Preliminaries and some useful results

2.1 The action of the affine group

For a countable field $K$ , we denote by $\mathscr{A}_{K}=\{f:x\mapsto ux+v:\ u,v\in K,\ u\neq 0\}$ the group of affine transformations of $K$ , with the operation of composition. The additive subgroup of $\mathscr{A}_{K}$ is denoted by $S_{A}$ and consists of the transformations $A_{u}:x\mapsto x+u$ , for $u\in K$ . Similarly, the multiplicative subgroup, denoted by $S_{M}$ , consists of transformations of the form $M_{u}:x\mapsto ux,$ for $u\in K^{*}$ . The map $x\mapsto ux+v$ can be represented by the composition $A_{v}M_{u}$ and we have the trivial, but very useful throughout this paper, identity:

M_{u}A_{v}=A_{uv}M_{u}.

(2.1)

The affine group appears naturally in our considerations because in order, for example, to find patterns $\{u+v,uv\}$ in a subset $E\subset K$ we can show that for some $u\in K^{*}$ , the intersection $A_{-u}E\cap M_{1/u}E$ is non-empty.

We have already mentioned the utility of double Følner sequences as averaging schemes in $K$ . The existence of such sequences was proved in Proposition $2.4$ of [3].

Proposition 2.1.

Any countable field $K$ admits a sequence of non-empty finite sets $(F_{N})_{N\in\mathbb{N}}$ which forms a Følner sequence for both the actions of the additive group $(K,+)$ and the multiplicative group $(K^{*},\cdot)$ . In other words, for any $u\in K^{*}$ , we have that

\lim_{N\to\infty}\frac{\left|F_{N}\cap\left(u+F_{N}\right)\right|}{|F_{N}|}=% \lim_{N\to\infty}\frac{\left|F_{N}\cap\left(uF_{N}\right)\right|}{|F_{N}|}=1.

According to Lemma $2.6$ in [3], some transformations of double Følner sequences remain double Følner sequences.

Lemma 2.2.

Let $K$ be a countable field. If $(F_{N})_{N\in\mathbb{N}}$ is a double Følner sequence in $K$ and $b\in K^{*}$ , then $(bF_{N})_{N\in\mathbb{N}}$ is still a double Følner sequence in $K$ .

We will further consider a probability space $(X,\mathscr{X},\mu)$ and a measure preserving action $(T_{g})_{g\in\mathscr{A}_{K}}$ of $\mathscr{A}_{K}$ on $X$ . In this context, we denote $L^{2}(X,\mu)$ by $H$ and let $(U_{g})_{g\in\mathscr{A}_{K}}$ be given by $(U_{g}f)(x)=f(T_{g}^{-1}x)$ , for $x\in X$ and $f\in H$ . This is known as the unitary Koopman representation of $\mathscr{A}_{K}$ . Abusing notation we will usually write $A_{u}f$ instead of $U_{A_{u}}f$ and $M_{u}f$ instead of $U_{M_{u}}f$ . By $P_{A}$ we denote the orthogonal projection from $H$ onto the subspace of vectors which are fixed by the action of the additive subgroup $S_{A}$ . Also, by $P_{M}$ we denote the orthogonal projection from $H$ onto the subspace of vectors fixed under the action of $S_{M}$ .

The useful and unintuitive fact that the projections $P_{A}$ and $P_{M}$ commute was established in Lemma $3.1$ of [3].

Lemma 2.3.

For any $f\in H$ we have that

P_{A}P_{M}f=P_{M}P_{A}f.

By Lemma 2.3 we see that $P_{A}P_{M}f$ is invariant under the actions of both $S_{A}$ and $S_{M}$ and that $P_{A}P_{M}f$ is an orthogonal projection. Since the subgroups $S_{A}$ and $S_{M}$ generate the whole group $\mathscr{A}_{K}$ , it follows that $P=P_{A}P_{M}=P_{M}P_{A}$ is the orthogonal projection from $H$ onto the subspace of vectors fixed under the action of $\mathscr{A}_{K}$ .

2.2 Ergodic theorems and van der Corput lemmas

The mean ergodic theorem for unitary representations of countable abelian groups, which we will extend later for our purposes, has the following form and a proof of this version can be found for example in [1], Theorem $5.4$ .

Theorem 2.4.

Let $G$ be a countable abelian group and $(F_{N})_{N\in\mathbb{N}}$ be a Følner sequence in $G$ . Let also $H$ be a Hilbert space and $(U_{g})_{g\in G}$ be a unitary representation of $G$ on $H$ . Then for any $f\in H$ ,

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{g\in F_{N}}U_{g}f=Pf,

where the limit is in the strong topology of $H$ and $P$ denotes the orthogonal projection onto the subspace of vectors fixed under $G$ .

Remark.

One may consider for example the cases where, provided that $\mathscr{A}_{K}$ acts by m.p.t. on a probability space $(X,\mathscr{X},\mu)$ , we have that $H=L^{2}(X,\mu)$ , $G=S_{A}$ or $G=S_{M}$ and then $P=P_{A}$ or $P=P_{M}$ , respectively.

We will consider an adaptation of the van der Corput lemma for unitary representations of countable abelian groups. A proof–of a stronger version–appears in Theorem $2.12$ of [2].

Lemma 2.5.

Let $(G,\cdot)$ be a countable abelian group and $(a_{u})_{u\in G}$ be a bounded sequence of vectors in a Hilbert space $H$ , indexed by the elements of $G$ . Let $(F_{N})_{N\in\mathbb{N}}$ be a Følner sequence in $G$ . If

\lim_{M\to\infty}\frac{1}{|F_{M}|}\sum_{v\in F_{M}}\limsup_{N\to\infty}\frac{1% }{|F_{N}|}\left|\sum_{u\in F_{N}}\langle a_{u\cdot v},a_{u}\rangle\right|=0,

then also

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}a_{u}=0.

Remark.

This, in particular, holds when $(G,\cdot)=(K,+)$ or when $(G,\cdot)=(K^{*},\cdot)$ for some countable field $K$ and $(F_{N})_{N\in\mathbb{N}}$ is a double Følner sequence in $K$ .

Another version of the van der Corput lemma, which will be used in Section 6, follows as a corollary of the inequality given in Lemma $1$ , Chapter $21$ of Host and Kra’s book [10].

Proposition 2.6.

Let $(G,\cdot)$ be a countable abelian group with identity $1$ and for each $b\in G$ let $(a_{u}(b))_{u\in G}$ be a bounded sequence of vectors in a Hilbert space $H$ with norm $\left\|\cdot\right\|$ , indexed by the elements of $G$ . Let $(F_{N})_{N\in\mathbb{N}}$ be a Følner sequence in $G$ . If for all $d\neq 1$ ,

\lim_{M\to\infty}\frac{1}{|F_{M}|}\sum_{b\in F_{M}}\limsup_{N\to\infty}\frac{1% }{|F_{N}|}\sum_{u\in F_{N}}\langle a_{u\cdot d}(b),a_{u}(b)\rangle=0,

then also

\lim_{M\to\infty}\frac{1}{|F_{M}|}\sum_{b\in F_{M}}\limsup_{N\to\infty}\left\|% \frac{1}{|F_{N}|}\sum_{u\in F_{N}}a_{u}(b)\right\|^{2}=0.

For finite groups, a version of the van der Corput lemma is given by the following simple equality. We will use this to adapt our infinite ergodic theorems to the setting of finite fields.

Proposition 2.7.

Let $(G,\cdot)$ be a finite group and $(f(g))_{g\in G}$ be a sequence taking values in a Hilbert space $H$ . Then,

\left\|\sum_{g\in G}f(g)\right\|^{2}=\sum_{g\in G}\sum_{h\in G}\langle f(g% \cdot h),f(g)\rangle.

Finally, we shall find the next classical result useful.

Lemma 2.8.

Let $(a_{u})_{u\in G}$ be a bounded, non-negative sequence, indexed by elements of a countable (amenable) group $G$ and let $(G_{N})_{N\in\mathbb{N}}$ be a Følner sequence in $G$ . Then

\lim_{N\to\infty}\frac{1}{|G_{N}|}\sum_{u\in G_{N}}a_{u}=0\iff\lim_{N\to\infty% }\frac{1}{|G_{N}|}\sum_{u\in G_{N}}a_{u}^{2}=0.

3 Proofs of Theorems 1.12 and 1.13

Throughout this section we assume that $K$ is a countable field, $(F_{N})_{N\in\mathbb{N}}$ is a double Følner sequence in $K$ and $p(x)\in K[x]$ is a non-constant admissible polynomial over $K$ , according to Definition 1.10. We also let $(X,\mathscr{X},\mu)$ be a probability space on which we assume that $(T_{g})_{g\in\mathscr{A}_{K}}$ acts by measure preserving transformations. In consistency with the notation from Section 2, $H=L^{2}(X,\mu)$ , $P:H\to H$ denotes the orthogonal projection from $H$ onto the subspace of functions fixed under the action of $\mathscr{A}_{K}$ and $P_{A}$ , $P_{M}$ are the orthogonal projections on the subspaces of vectors fixed under the additive action $S_{A}$ and the multiplicative action $S_{M}$ , respectively. Moreover, $(U_{g})_{g\in\mathscr{A}_{K}}$ is the unitary Koopman representation of $\mathscr{A}_{K}$ (for details recall the discussion after Lemma 2.2). Again, for simplicity, we will write $A_{u}$ instead of $U_{A_{u}}$ and $M_{u}$ instead of $U_{M_{u}}$ .

Before embarking on the proof of Theorem 1.13 we show the ensuing, straightforward corollary of it.

Corollary 3.1.

If $K$ , $p(x)\in K[x]\setminus K$ , $(F_{N})_{N\in\mathbb{N}}$ and $(X,\mathscr{X},\mu)$ are as above, then for any $B\in\mathscr{X}$ , we have that

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}\mu(A_{-p(u)}B\cap M_{1/u}B% )\geq(\mu(B))^{2}.

Proof.

For $B\in\mathscr{X}$ we see that

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}\mu(A_{-p(u)}B\cap M_{1/u}B% )=\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}\int_{X}A_{-p(u)}\mathbbm% {1}_{B}\cdot M_{1/u}\mathbbm{1}_{B}\ d\mu,

which can be written as (using that $M_{u}$ is preserves $\mu$ , for all $u\in K^{*}$ )

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}\int_{X}M_{u}A_{-p(u)}% \mathbbm{1}_{B}\cdot\mathbbm{1}_{B}\ d\mu.

(3.1)

By Theorem 1.13 applied for $f=\mathbbm{1}_{B}$ , (3.1) becomes

\int_{X}(P\mathbbm{1}_{B})\cdot\mathbbm{1}_{B}\ d\mu\geq(\mu(B))^{2}.

For the last inequality observe that $P$ is an orthogonal projection and so

\int_{X}(P\mathbbm{1}_{B})\cdot\mathbbm{1}_{B}\ d\mu=\int_{X}(P\mathbbm{1}_{B}% )^{2}\ d\mu\geq\left(\int_{X}P\mathbbm{1}_{B}\ d\mu\right)^{2},

by the Cauchy-Schwarz inequality. Finally, because $P1=1$ we have that

\int_{X}P\mathbbm{1}_{B}\ d\mu=\int_{X}\mathbbm{1}_{B}\ d\mu=\mu(B)

and thus we conclude. ∎

Remark.

A similar argument shows that if in the context of Theorem 1.13 the action of $\mathscr{A}_{K}$ is also ergodic, then for any $B,C\in\mathscr{X}$ we have that

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}\mu(A_{-p(u)}B\cap M_{1/u}C% )\geq\mu(B)\mu(C).

For the special case $p(x)=x$ , the proof of Theorem 1.13 was given in [3]. We only mention that in the proof of the linear case in [3], the authors relied on a version of the mean ergodic Theorem 2.4 for the action of $S_{A}$ . For the polynomial case of Theorem 1.13 we will use the subsequent generalization, which is a polynomial mean ergodic theorem for the action of $S_{A}$ . For that we will need an application of the van der Corput trick utilizing the additive structure of $K$ , which facilitates an induction argument on the polynomial’s degree.

Theorem 3.2.

Let $K$ be a countable field and $p(x)\in K[x]\setminus K$ be admissible. Let also $(F_{N})_{N\in\mathbb{N}}$ be a double Følner sequence in $K$ and $(X,\mathscr{X},\mu)$ a probability space, on which $(T_{A_{u}})_{u\in K}$ acts by measure preserving transformations (see also the beginning of this section). Then, given any $f\in H$ we have that

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}A_{p(u)}f=P_{A}f,

where the limit is in the strong topology of $H$ .

Proof.

We prove the case $\text{char}(K)=q$ , some $q\in\mathbbm{P}$ (see also Remark 3.3). If $p(x)=ax+b$ , where $a,b\in K$ and $a\neq 0$ , then it follows by the mean ergodic theorem that

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}A_{au+b}f=\lim_{N\to\infty}% \frac{1}{|F_{N}|}\sum_{u\in aF_{N}+b}A_{u}f=P_{A}f.

Note that here we used the fact that $(aF_{N}+b)_{N\in\mathbb{N}}$ is still a Følner sequence for the additive group $(K,+)$ , in view of Lemma 2.2 and the obvious observation that shifts of Følner sequences are also Følner sequences in any group. Now, assume the statement holds for polynomials of degree $m-1$ , where $2\leq m\leq q-1$ and let $p(x)\in K[x]\setminus K$ have degree $m$ , i.e., $p(x)=q_{0}+q_{1}x+\dots+q_{m}x^{m}$ , $q_{0},\dots,q_{m}\in K$ and $q_{m}\neq 0$ . First, we let $f\in H$ be such that $P_{A}f=0$ and set $a_{u}=A_{p(u)}f,$ $u\in K$ . Then, for any $b\in K^{*}$ , we have that

\langle a_{u+b},a_{u}\rangle=\langle A_{p(u+b)-p(u)}f,f\rangle.

Observe that

p(u+b)-p(u)=q_{m}\sum_{k=0}^{m-1}\binom{m}{k}u^{k}\cdot b^{m-k}+r_{b}(u),

where $\deg(r_{b}(x))\leq m-2$ . Therefore,

p(u+b)-p(u)=m\cdot(q_{m}b)u^{m-1}+r^{\prime}_{b}(u),

where $\deg(r_{b}^{\prime}(x))\leq m-2$ , and since $q_{m}b\neq 0$ , the above argument shows that the polynomial $g_{b}(x)=p(x+b)-p(x)$ has degree $m-1$ in $K[x]$ .

We note that an issue arises in allowing the polynomial’s degree to be $q$ , in which case if, for example, $p(x)=x^{q}$ , then $g_{b}(x)=b^{q}$ is a constant, because $(x+b)^{q}=x^{q}+b^{q}$ in a field of characteristic $q$ .

Returning to the proof, by the induction hypothesis and the assumption on $f$ , we see that for any $b\neq 0$ ,

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}\langle a_{u+b},a_{u}% \rangle=\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}\langle A_{g_{b}(u)% }f,f\rangle=\langle P_{A}f,f\rangle=0.

Thus, an application of the van der Corput trick as in Lemma 2.5 gives us that

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}A_{p(u)}f=0,

in $H$ , when $P_{A}f=0$ . Finally, for a general $f\in H$ we can write $f=P_{A}f+(f-P_{A}f)$ and from the above and linearity it follows that

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}A_{p(u)}f=\lim_{N\to\infty}% \frac{1}{|F_{N}|}\sum_{u\in F_{N}}A_{p(u)}P_{A}f=P_{A}f.

∎

Remark 3.3.

Note that the same proof in the case of $\text{char(K)}=0$ (for example when $K=\mathbb{Q}$ ), gives the same result for polynomials of arbitrarily large degree, because then it always holds that $x\mapsto p(x+b)-p(x)$ is a polynomial of degree equal to $\deg(p(x))-1$ , when $b\neq 0$ .

We will now give the proof of Theorem 1.13, the statement of which we recall for the reader’s convenience.

Theorem 1.13.

Let $K$ , $(F_{N})_{N\in\mathbb{N}}$ , $p(x)\in K[x]\setminus K$ , $(X,\mathscr{X},\mu)$ and $(T_{g})_{g\in\mathscr{A}_{K}}$ be as in the beginning of this section. Then, given any $f\in H$ we have that

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}M_{u}A_{-p(u)}f=Pf,

where the limit is in the strong topology of $H$ .

Proof.

Let $f\in H$ and assume that $P_{A}f=0$ . For $u\in K^{*}$ we now set $a_{u}=M_{u}A_{-p(u)}f$ and then, for any $b\in K^{*}$ we have that

\langle a_{ub},a_{u}\rangle=\langle A_{-p(ub)+p(u)/b}f,M_{1/b}f\rangle.

If $p(x)=q_{0}+q_{1}x+\dots+q_{m}x^{m}$ , $q_{0},\dots,q_{m}\in K$ and $q_{m}\neq 0$ ( $m<q$ if $\text{char}(K)=q$ ), then

p(ub)-p(u)/b=q_{0}\frac{b-1}{b}+u\left(q_{1}\frac{b^{2}-1}{b}\right)+\dots+u^{% m}\left(q_{m}\frac{b^{m+1}-1}{b}\right),

which, for $b\notin\{0,1,-1\}$ fixed, is also a polynomial of degree $m$ . Thus, applying Theorem 3.2 we have that for $b\notin\{0,1,-1\}$ ,

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}\langle a_{ub},a_{u}\rangle% =\langle P_{A}f,M_{1/b}f\rangle=0.

Once again, the van der Corput lemma implies that for $P_{A}f=0$ ,

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}M_{u}A_{-p(u)}f=0,

and this allows us to conclude just like in the case of Theorem 3.2, after decomposing a general $f\in H$ as $f=P_{A}f+(f-P_{A}f)$ . ∎

Using some quantitative bounds for the set of return times, which can be extracted from the proof of Corollary 3.1, and the variant of Furstenberg’s correspondence principle established in Theorem $2.8$ of [3], we can recover the following precise version of Theorem 1.12. The proof is a straightforward adaptation of the proof of Theorem $2.5$ from Theorem $2.10$ in [3], which amounts to the special case that $p(x)=x$ .

Theorem 3.4.

Let $K$ be a countable field, $p(x)\in K[x]\setminus K$ an admissible polynomial and $(F_{N})_{N\in\mathbb{N}}$ be a double Følner sequence in $K$ . Let $E\subset K$ with $\overline{\mathop{}\!\mathrm{d}}_{(F_{N})}(E)>0.$ Then, for any $\epsilon>0$ we have that

\underline{\mathop{}\!\mathrm{d}}_{(F_{N})}\left(\{u\in K^{*}:\overline{% \mathop{}\!\mathrm{d}}_{(F_{N})}\left((E-p(u))\cap(E/u)\right)\geq(\overline{% \mathop{}\!\mathrm{d}}_{(F_{N})}(E))^{2}-\epsilon\}\right)>0.

In less precise terms, for each element of a large set of $u\in K^{*}$ there is a large set of $v\in K^{*}$ satisfying $\{v+p(u),vu\}\subset E$ .

To conclude the results of this section we give a precise statement of Theorem 1.11.

Theorem 3.5.

Let $K$ be a countable field, $(F_{N})_{N\in\mathbb{N}}$ a double Følner sequence in $K$ and $p(x)\in K[x]\setminus K$ an admissible polynomial. Then, for any finite colouring $K=C_{1}\cup\dots\cup C_{r}$ , there exists a colour $C_{j}$ such that

\overline{\mathop{}\!\mathrm{d}}_{(F_{N})}\left(\{u\in C_{j}:\overline{\mathop% {}\!\mathrm{d}}_{(F_{N})}\left(\{v\in K:\{u,p(u)+v,uv\}\subset C_{j}\}\right)% \}\right)>0.

The proof of Theorem 3.5 is based on the “colouring trick” of (and is almost identical to) the proof of Theorem $4.1$ in [3], and therefore is omitted. The only difference being that we rely on Corollary 3.1, while in [3] the authors relied on its special case of a linear polynomial.

It seems like our methods are not rigid enough to deal with non-admissible polynomials according to Definition 1.10 because of the comment in the proof of Theorem 3.2, so we make the following natural questions.

Question 3.6.

Does Corollary 3.1 hold if $p(x)\in K[x]$ is not admissible?

Question 3.7.

Does Theorem 3.5 (or a vague version as in Theorem 1.11) hold for non-admissible polynomials $p(x)\in K[x]$ ?

We note that a positive answer to Question 3.6 would also imply a positive answer to Question 3.7 by the same argument that is used for the case of admissible polynomials.

4 A finite fields version of Theorem 1.12

In this section we will adapt the proof of Theorem 1.12 to the finite fields setting and prove Theorem 1.14.

For a finite field $F$ we consider its group of affine transformations, $\mathscr{A}_{F}$ , which consists of the maps of the form $x\mapsto ux+v,$ where $u\in F^{*}$ and $v\in F$ . We also let $(X,\mathscr{X},\mu)$ be a probability space on which $\mathscr{A}_{F}$ acts by measure preserving transformations, with $(T_{g})_{g\in\mathscr{A}_{F}}$ denoting the action. As before, we let $S_{A}=\{A_{u}:u\in F\}$ , where $A_{u}(x)=x+u$ and $S_{M}=\{M_{u}:u\in F^{*}\}$ , where $M_{u}(x)=xu$ . Also, in an abuse of notation, if $(U_{g})_{g\in\mathscr{A}_{F}}$ is the Koopman representation of $\mathscr{A}_{F}$ on $L^{2}(X,\mu)$ we write $A_{u}$ for $U_{A_{u}}$ and $M_{u}$ for $U_{M_{u}}$ , where for example, for $f\in L^{2}(X,\mu)$ we have that $U_{A_{u}}f(x)=f(T_{A_{u}}^{-1}x)=f(T_{A_{-u}}x)$ .

Moreover, if $P_{A}$ is the orthogonal projection onto the space of functions invariant under the subgroup $S_{A}$ , we see that $P_{A}f(x)=\frac{1}{|F|}\sum_{u\in F}A_{u}f(x)$ and if $P_{M}$ is the projection onto the space of functions invariant under $S_{M}$ , then $P_{M}f(x)=\frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}f(x)$ . We will begin with a finitistic version of the polynomial mean ergodic theorem of Section 3 and then prove an analogue of Theorem 1.13. As in the infinite case, $P_{A}$ and $P_{M}$ exhibit commuting behavior (see the proof of Theorem $5.1$ in [3]).

Proposition 4.1.

For $f\in L^{2}(X,\mu)$ and $P_{A}$ , $P_{M}$ as above, we have that $P_{A}P_{M}f=P_{M}P_{A}f$ .

Thus, $P_{A}P_{M}$ is an orthogonal projection onto the subspace of functions invariant under $\mathscr{A}_{F}$ . The promised finitistic analogue of Theorem 3.2 is this.

Proposition 4.2.

\left\|\frac{1}{|F|}\sum_{u\in F}A_{p(u)}f-P_{A}f\right\|_{2}^{2}\leq\frac{q-1% }{|F|^{1/2^{q-2}}}\left\|f-P_{A}f\right\|_{2}^{2}.

Proof.

If $p(x)=ax+b$ , $a,b\in F$ and $a\neq 0$ , this is obvious, for $p(F)=\{au+b:u\in F\}=F$ , whence it is enough to make a change of variables and use the definition of $P_{A}$ . Assume now that the conclusion holds for polynomials of degree at most $q<r-1$ and let $p(x)\in F[x]$ be a polynomial of degree $q+1\leq r-1$ , where $\text{char}(F)=r$ , some $r\in\mathbbm{P}$ . Then,

\frac{1}{|F|}\sum_{u\in F}A_{p(u)}f=\frac{1}{|F|}\sum_{u\in F}A_{p(u)}P_{A}f+% \frac{1}{|F|}\sum_{u\in F}A_{p(u)}\tilde{f},

where $\tilde{f}=f-P_{A}f$ , so that $P_{A}\tilde{f}=0.$ Clearly,

\frac{1}{|F|}\sum_{u\in F}A_{p(u)}P_{A}f=P_{A}f.

On the other hand, by Proposition 2.7 it follows that

\left\|\frac{1}{|F|}\sum_{u\in F}A_{p(u)}\tilde{f}\right\|_{2}^{2}=\frac{1}{|F% |}\sum_{v\in F}\frac{1}{|F|}\sum_{u\in F}\langle A_{p(u+v)-p(u)}\tilde{f},% \tilde{f}\rangle.

(4.1)

Since $\deg(p(x))=q+1\leq r-1$ , the polynomial $p(x+v)-p(x)$ has degree $q$ for any $v\neq 0$ (this would no longer be true if the degree of $p(x)$ was $r$ just like the infinite field case), and since $P_{A}\tilde{f}=0$ , the induction hypothesis implies that

\left\|\frac{1}{|F|}\sum_{u\in F}A_{p(u+v)-p(u)}\tilde{f}\right\|_{2}^{2}\leq% \frac{q-1}{|F|^{1/2^{q-2}}}\left\|\tilde{f}\right\|_{2}^{2}.

(4.2)

Finally, we see that

\frac{1}{|F|}\sum_{v\in F}\frac{1}{|F|}\sum_{u\in F}\langle A_{p(u+v)-p(u)}% \tilde{f},\tilde{f}\rangle\leq\frac{1}{|F|}\left\|\tilde{f}\right\|_{2}^{2}+% \frac{1}{|F|}\sum_{v\in F^{*}}\frac{1}{|F|}\sum_{u\in F}\langle A_{p(u+v)-p(u)% }\tilde{f},\tilde{f}\rangle,

which, by an application of the Cauchy-Schwarz inequality is bounded above by

\frac{1}{|F|}\left\|\tilde{f}\right\|_{2}^{2}+\left\|\frac{1}{|F|}\sum_{u\in F% }A_{p(u+v)-p(u)}\tilde{f}\right\|_{2}\left\|\tilde{f}\right\|_{2}.

(4.3)

Using (4.2) in (4.3) and then by (4.1) it follows that

\left\|\frac{1}{|F|}\sum_{u\in F}A_{p(u)}\tilde{f}\right\|_{2}^{2}\leq\frac{1}% {|F|}\left\|\tilde{f}\right\|_{2}^{2}+\frac{\sqrt{q-1}}{|F|^{1/2^{q-1}}}\left% \|\tilde{f}\right\|_{2}^{2}\leq\frac{q}{|F|^{1/2^{q-1}}}\left\|\tilde{f}\right% \|_{2}^{2}.

∎

We isolate the following estimate that appears in the proof of the finitistic analogue of Corollary 3.1, that is, Theorem 4.4 below. This estimate is the finitistic analogue of Theorem 1.13 for functions orthogonal to the space of functions fixed under the action of $S_{A}$ .

Proposition 4.3.

Let $F$ be a finite field and assume that $\mathscr{A}_{F}$ acts on $(X,\mathscr{X},\mu)$ as in the beginning of this section. Let also $p(x)\in F[x]\setminus F$ be an admissible polynomial of degree $q:=\deg(p(x))$ . Let $f=\mathbbm{1}_{C}-P_{A}\mathbbm{1}_{C}$ for some $C\in\mathscr{X}$ . Then,

\left\|\frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A_{-p(u)}f\right\|_{2}^{2}<2(q+2% )\mu(C)/|F^{*}|^{1/2^{q-1}}.

(4.4)

Proof.

From Proposition 2.7 we have that

\left\|\frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A_{-p(u)}f\right\|_{2}^{2}=\frac% {1}{|F^{*}|}\sum_{u\in F^{*}}\frac{1}{|F^{*}|}\sum_{v\in F^{*}}\langle M_{uv}A% _{-p(uv)}f,M_{u}A_{-p(u)f}\rangle=\\ \frac{1}{|F^{*}|}\sum_{v\in F^{*}}\frac{1}{|F^{*}|}\sum_{u\in F^{*}}\langle A_% {-p(uv)+p(u)/v}f,M_{1/v}f\rangle.

(4.5)

Now, for $v=\pm-1$ (in fact for any $v\in F^{*}$ , but this wouldn’t lead to a practically useful bound) it is easy to see that

\frac{1}{|F^{*}|}\sum_{u\in F^{*}}\langle A_{-p(uv)+p(u)/v}f,M_{1/v}f\rangle% \leq\left\|f\right\|_{2}^{2}.

(4.6)

On the other hand, for any $v\in F^{*},$ $v\neq\pm 1$ , we have

\left|\frac{1}{|F^{*}|}\sum_{u\in F^{*}}\langle A_{-p(uv)+p(u)/v}f,M_{1/v}f% \rangle\right|\leq\left\|\frac{1}{|F^{*}|}\sum_{u\in F^{*}}A_{-p(uv)+p(u)/v}f% \right\|_{2}\left\|f\right\|_{2}.

(4.7)

Moreover,

\left\|\frac{1}{|F^{*}|}\sum_{u\in F^{*}}A_{-p(uv)+p(u)/v}f\right\|_{2}\leq\\ \left\|\frac{|F|}{|F^{*}|}\frac{1}{|F|}\sum_{u\in F}A_{-p(uv)+p(u)/v}f\right\|% _{2}+\left\|\frac{1}{|F^{*}|}A_{-p(0)+p(0)/v}f\right\|_{2}.

(4.8)

But, if $v\not\in\{0,1,-1\}$ , then $-p(uv)+p(u)/v$ is a polynomial of same degree as $p(u)$ , and so by Proposition 4.2 and because $P_{A}f=0$ , (4.8) becomes²²2We used that $|F|\big{/}|F^{*}|\left(\sqrt{q-1}\big{/}|F|^{1/2^{q-1}}\right)+1\big{/}|F^{*}|% \leq q\big{/}|F^{*}|^{1/2^{q-1}}$ , whenever $|F|\geq 3$ .

\left\|\frac{1}{|F^{*}|}\sum_{u\in F^{*}}A_{-p(uv)+p(u)/v}f\right\|_{2}\leq% \frac{q}{|F^{*}|^{1/2^{q-1}}}\left\|f\right\|_{2}.

Using this in (4.7) we get that (for $v\notin\{0,1,-1\})$

\frac{1}{|F^{*}|}\sum_{u\in F^{*}}\langle A_{-p(uv)+p(u)/v}f,M_{1/v}f\rangle% \leq\frac{q}{|F^{*}|^{1/2^{q-1}}}\left\|f\right\|_{2}^{2}.

(4.9)

Combining (4.6) and (4.9) it follows from (4.5) that

\left\|\frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A_{-p(u)}f\right\|_{2}^{2}\leq(q% +2)\left\|f\right\|_{2}^{2}/|F^{*}|^{1/2^{q-1}}.

It is shown in the proof of Theorem $5.1$ in [3] that $\left\|f\right\|_{2}\leq\sqrt{2\mu(C)}$ . Therefore, the latter inequality readily implies (4.4) and so we conclude. ∎

Theorem 4.4.

Let $F$ be a finite field and assume that $\mathscr{A}_{F}$ acts on $(X,\mathscr{X},\mu)$ as in the beginning of this section. Let also $p(x)\in F[x]\setminus F$ be an admissible polynomial of degree $q:=\deg(p(x))$ . Then, for any set $B\in\mathscr{X}$ , such that $(\mu(B))^{2}>2(q+2)\big{/}|F^{*}|^{1/2^{q-1}}$ , there exists $u\in F^{*}$ so that $\mu(B\cap M_{u}A_{-p(u)}B)>0.$

If, in addition, the action of $S_{A}$ is ergodic, then for any sets $B,C\in\mathscr{X}$ which satisfy $\mu(B)\mu(C)>2(q+2)\big{/}|F^{*}|^{1/2^{q-1}}$ , there is some $u\in F^{*}$ with $\mu(B\cap M_{u}A_{-p(u)}C)>0.$

Remark.

For the case $p(x)=x$ , that is, when $q=1$ , the bounds in this statement coincide with those that Bergelson and Moreira found in [3].

Proof.

Let $B,C\in\mathscr{X}$ . For the second conclusion it suffices to prove the following averages are positive (for the first conclusion we prove the same thing with $B=C$ )

	$\displaystyle\frac{1}{\|F^{}\|}\sum_{u\in F^{}}\mu(B\cap M_{u}A_{-p(u)}C)\ =\$	$\displaystyle\langle\mathbbm{1}_{B},\frac{1}{\|F^{}\|}\sum_{u\in F^{}}M_{u}A_{% -p(u)}\mathbbm{1}_{C}\rangle=$
	$\displaystyle\langle\mathbbm{1}_{B},\frac{1}{\|F^{}\|}\sum_{u\in F^{}}M_{u}A_{% -p(u)}P_{A}\mathbbm{1}_{C}\rangle\ +$	$\displaystyle\ \langle\mathbbm{1}_{B},\frac{1}{\|F^{}\|}\sum_{u\in F^{}}M_{u}A% _{-p(u)}f\rangle,$		(4.10)

where $f=\mathbbm{1}_{C}-P_{A}\mathbbm{1}_{C}$ . Now, we observe that

\langle\mathbbm{1}_{B},\frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A_{-p(u)}P_{A}% \mathbbm{1}_{C}\rangle=\langle\mathbbm{1}_{B},\frac{1}{|F^{*}|}\sum_{u\in F^{*% }}M_{u}P_{A}\mathbbm{1}_{C}\rangle=\langle\mathbbm{1}_{B},P_{M}P_{A}\mathbbm{1% }_{C}\rangle.

(4.11)

If $S_{A}$ acts ergodically, then $P_{A}\mathbbm{1}_{C}=\mu(C)$ and so (4.11) becomes

\langle\mathbbm{1}_{B},\frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A_{-p(u)}P_{A}% \mathbbm{1}_{C}\rangle=\mu(B)\mu(C).

(4.12)

If $B=C$ and we don’t assume ergodicity, then $P_{M}P_{A}\mathbbm{1}_{B}=P\mathbbm{1}_{B}$ , where $P$ is the projection onto the space of functions invariant under $\mathscr{A}_{F}$ by Proposition 4.1. Therefore $P1=1$ and it follows by the Cauchy-Schwarz inequality that

\langle\mathbbm{1}_{B},\frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A_{-p(u)}P_{A}% \mathbbm{1}_{B}\rangle=\langle\mathbbm{1}_{B},P\mathbbm{1}_{B}\rangle=\left\|P% \mathbbm{1}_{B}\right\|_{2}^{2}\geq(\mu(B))^{2}.

(4.13)

For the last averages in (4.10) another application of Cauchy-Schwarz’s inequality gives that

\left|\langle\mathbbm{1}_{B},\frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A_{-p(u)}f% \rangle\right|\leq\sqrt{\mu(B)}\left\|\frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A% _{-p(u)}f\right\|_{2}.

(4.14)

So, from (4.4) in Proposition 4.3 the inequality in (4.14) now becomes

\left|\langle\mathbbm{1}_{B},\frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A_{-p(u)}f% \rangle\right|\leq\sqrt{2(q+2)\mu(B)\mu(C)}\big{/}|F^{*}|^{1/2^{q}}.

In conclusion, (4.10) implies that

\frac{1}{|F^{*}|}\sum_{u\in F^{*}}\mu(B\cap M_{u}A_{-p(u)}C)\geq\langle% \mathbbm{1}_{B},P_{M}P_{A}\mathbbm{1}_{C}\rangle-\sqrt{2(q+2)\mu(B)\mu(C)}\big% {/}|F^{*}|^{1/2^{q}}.

(4.15)

As we have alluded to in the beginning of this proof, there are now two routs. If $S_{A}$ acts ergodically, then (4.15) becomes

\frac{1}{|F^{*}|}\sum_{u\in F^{*}}\mu(B\cap M_{u}A_{-p(u)}C)\geq\mu(B)\mu(C)-% \sqrt{2(q+2)\mu(B)\mu(C)}\big{/}|F^{*}|^{1/2^{q}},

(4.16)

and this is positive whenever $\mu(B)\mu(C)>2(q+2)\big{/}|F^{*}|^{1/2^{q-1}}$ . If we don’t assume ergodicity and $B=C$ , then we have

\frac{1}{|F^{*}|}\sum_{u\in F^{*}}\mu(B\cap M_{u}A_{-p(u)}B)\geq(\mu(B))^{2}-% \sqrt{2(q+2)}\mu(B)\big{/}|F^{*}|^{1/2^{q}},

(4.17)

which is positive precisely when $(\mu(B))^{2}>2(q+2)\big{/}|F^{*}|^{1/2^{q-1}}.$ ∎

Some quantitative bounds for the set of return times in the previous theorem–which will be used in the proof of Theorem 1.14 given below and in Section 5–are the following.

Corollary 4.5.

Let $F$ be a finite field and assume that $\mathscr{A}_{F}$ acts on $(X,\mathscr{X},\mu)$ by m.p.t. Let also $p(x)\in F[x]\setminus F$ be an admissible polynomial of degree $q:=\deg(p(x))$ , $B\in\mathscr{X}$ and $\delta<\mu(B)$ . Then, the set of return times $D:=\{u\in F^{*}:\mu(B\cap M_{u}A_{-p(u)}B)>\delta\}$ satisfies

\frac{|D|}{|F^{*}|}\geq\frac{(\mu(B))^{2}-\sqrt{2(q+2)}\mu(B)\big{/}|F^{*}|^{1% /2^{q}}-\delta}{\mu(B)}.

(4.18)

If, in addition, the action of $S_{A}$ is ergodic, then for any $B,C\in\mathscr{X}$ and $\delta<\min{\{\mu(B),\mu(C)\}}$ , the set $D^{\prime}:=\{u\in F^{*}:\mu(B\cap M_{u}A_{-p(u)}C)>\delta\}$ satisfies

\frac{|D^{\prime}|}{|F^{*}|}\geq\frac{\mu(B)\mu(C)-\sqrt{2(q+2)\mu(B)\mu(C)}% \big{/}|F^{*}|^{1/2^{q}}-\delta}{\min{\{\mu(B),\mu(C)\}}}.

(4.19)

Proof.

By (4.17) we know that

\frac{1}{|F^{*}|}\sum_{u\in F^{*}}\mu(B\cap M_{u}A_{-p(u)}B)\geq(\mu(B))^{2}-% \sqrt{2(q+2)}\mu(B)\big{/}|F^{*}|^{1/2^{q}}.

At the same time, $\mu(B\cap M_{u}A_{-p(u)}B)\leq\mu(B)$ implies that

\frac{1}{|F^{*}|}\sum_{u\in F^{*}}\mu(B\cap M_{u}A_{-p(u)}B)\leq\frac{|D|}{|F^% {*}|}\mu(B)+\left(1-\frac{|D|}{|F^{*}|}\right)\delta=\delta+\frac{|D|}{|F^{*}|% }(\mu(B)-\delta).

Combining the two inequalities we see that

(\mu(B))^{2}-\sqrt{2(q+2)}\mu(B)\big{/}|F^{*}|^{1/2^{q}}\leq\delta+\frac{|D|}{% |F^{*}|}(\mu(B)-\delta)

and thus

\frac{|D|}{|F^{*}|}\mu(B)\geq(\mu(B))^{2}-\sqrt{2(q+2)}\mu(B)\big{/}|F^{*}|^{1% /2^{q}}-\delta,

which is (4.18). For the ergodic case we use (4.16) instead of (4.17) and the rest is similar. ∎

We shall conclude this section by proving Theorem 1.14.

Theorem 1.14.

Let $F$ be a finite field. Then, if $p(x)\in F[x]$ is an admissible polynomial over $F$ of degree $q:=\deg(p(x))$ and $E,G\subset F$ with $|E||G|>2(q+2)|F|^{2-(1/2^{q-1})}$ , there are $u,v\in F^{*}$ , so that $vu\in E$ and $p(u)+v\in G$ .

Remark.

To give a better taste of the bounds, if we are looking for patterns of the form $\{uv,u+v^{2}\}$ in a subset $E$ of a field of order $3^{6}=729$ , then our method demands that $|E|>2\sqrt{2\ 3^{9}}\approx 396$ , and for a field of order $3^{7}=2187$ , that $|E|>2\sqrt{2}\ 3^{21/4}\approx 904$ .

Proof.

Consider the action by affine transformations of $\mathscr{A}_{F}$ on $F$ with the normalised counting measure $\mu$ , i.e. $\mu(B)=|B|/|F|$ , for any $B\subset F$ . Then the action of $S_{A}$ is ergodic. Now, for $s<\min{\{|E|,|G|\}}$ , we let $\delta=s/|F|$ and $D:=\{u\in F^{*}:\mu(E\cap M_{u}A_{-p(u)}G)>\delta\}.$ By Corollary 4.5 we know that

\frac{|D|}{|F^{*}|}\geq\frac{\mu(E)\mu(G)-\sqrt{2(q+2)\mu(E)\mu(G)}\big{/}|F^{% *}|^{1/2^{q}}-\delta}{\min{\{\mu(E),\mu(G)\}}}.

This means that

|D|\geq\frac{|E||G||F^{*}|/|F|-|F^{*}|^{1-1/2^{q}}\sqrt{2(q+2)|E||G|}-s|F^{*}|% }{\min{\{|E|,|G|\}}}.

(4.20)

Observe that for $u\in D$ we have that

\frac{s}{|F|}=\delta\leq\mu(E\cap M_{u}A_{-p(u)}G)=\frac{\left|M_{1/u}E\cap A_% {-p(u)}G\right|}{|F|},

which means that for each $u\in D$ there are $s$ elements $v\in F$ , such that $vu\in E$ and $v+p(u)\in G$ . ∎

5 A new “colouring trick” and partition regularity for finite fields

In this section we will adapt the infinite “colouring trick” presented in Section $4$ of [3] in order to recover a partition regularity result for finite fields, namely Theorem 1.15, from weaker density results established in Section 4; essentially from the proof of Theorem 1.14. We recall Theorem 1.15 for convenience.

Theorem 1.15.

Let $r,q\in\mathbb{N}$ be fixed. Then, there is $n(r,q)\in\mathbb{N}$ , so that for a finite field $F$ with $|F|\geq n(r,q)$ and $\text{char}(F)>q$ and a polynomial $p(x)\in F[x]$ of $\deg(p(x))=q$ , any colouring $F=C_{1}\cup\dots\cup C_{r}$ contains monochromatic triples of the form $\{u,p(u)+v,uv\}$ .

Proof.

Let $r\in\mathbb{N}$ , $r>1$ , be fixed and let $F$ be any finite field with $|F|\geq n(r,q)$ , for $n(r,q)$ to be determined later. For an $r$ -colouring of such a field, we can permute the colours if necessary and assume that $|C_{1}|\geq|C_{2}|\geq\dots\geq|C_{r}|$ . Clearly then, $|C_{1}|\geq|F|\big{/}r$ . Next, we pick a number $1\leq r^{\prime}\leq r$ in the following manner. If $|C_{2}|<|F|\big{/}r^{4}$ , we set $r^{\prime}=1$ . Else, we have that $|C_{2}|\geq|F|\big{/}r^{4}$ and $r^{\prime}\geq 2$ . Then, we either have that $|C_{3}|\geq|F|\big{/}r^{8}$ , whence $r^{\prime}\geq 2$ or not and let $r^{\prime}=2$ . In this fashion we set

r^{\prime}:=\max{\Big{\{}1\leq j\leq r:|C_{1}|\geq|F|\big{/}r\ ,\ |C_{2}|\geq|% F|\big{/}r^{4}\ ,\ \dots\ ,\ |C_{j}|\geq|F|\big{/}r^{2^{j}}\Big{\}}}.

D=\{u\in F^{*}:\nu(C\cap M_{u}A_{-p(u)}C)>\delta\},

the size of which we can bound below by Corollary 4.5, which implies that

|D|\geq\frac{(\nu(C))^{2}|F^{*}|-\nu(C)\sqrt{2(q+2)}|F^{*}|^{1-1/2^{q}}-s}{\nu% (C)}.

(5.1)

Next, we show that

|D|>|F|-(|C_{1}|+\dots+|C_{r^{\prime}}|)=|C_{r^{\prime}+1}|+\dots+|C_{r}|.

(5.2)

Observe that by the definition of $r^{\prime}$ it follows that

|C_{r^{\prime}+1}|+\dots+|C_{r}|\leq(r-r^{\prime})|F|\big{/}r^{2^{(r^{\prime}+% 1)}}<|F|\big{/}r^{2^{(r^{\prime}+1)}-1}.

(5.3)

Combining (5.1) with (5.3), we see that (5.2) follows from

\nu(C)|F^{*}|-\sqrt{2(q+2)}|F^{*}|^{1-1/2^{q}}-s/\nu(C)>|F|\big{/}r^{2^{(r^{% \prime}+1)}-1},

or equivalently that,

\nu(C)>\sqrt{2(q+2)}\big{/}|F^{*}|^{1/2^{q}}+1\big{/}r^{2^{(r^{\prime}+1)}-1}+% s\big{/}\left(|F^{*}|\nu(C)\right)+1\big{/}\left(|F^{*}|r^{2^{(r^{\prime}+1)}-% 1}\right).

(5.4)

Using the definition of $C$ and $r^{\prime}$ it holds that

\nu(C)=\frac{|C_{1}|\cdots|C_{r^{\prime}}|}{|F^{r^{\prime}}|}\geq\frac{1}{r}% \cdot\frac{1}{r^{4}}\cdot\frac{1}{r^{8}}\cdots\frac{1}{r^{2^{r^{\prime}}}}=% \frac{1}{r^{(1+4+8+\dots+2^{r^{\prime}})}}.

Now, one can see that³³3For $r^{\prime}\geq 2$ we have that $2^{r^{\prime}+1}-\left(2^{r^{\prime}}+\dots+2^{2}\right)=4$

\frac{1}{r^{(1+4+\dots+2^{r^{\prime}})}}-\frac{1}{r^{2^{(r^{\prime}+1)}-1}}=% \frac{r^{2^{(r^{\prime}+1)}-1-(2^{r^{\prime}}+\dots+2^{2}+1)}-1}{r^{2^{(r^{% \prime}+1)}-1}}=\frac{r^{2}-1}{r^{2^{(r^{\prime}+1)}-1}},

when $r^{\prime}\geq 2$ . If $r^{\prime}=1$ , then the equation becomes $1\big{/}r-1\big{/}r^{3}=\left(r^{2}-1\right)\big{/}r^{3}$ . Finally, (5.4) follows from

\frac{r^{2}-1}{r^{2^{(r^{\prime}+1)}-1}}\geq\sqrt{2(q+2)}\big{/}|F^{*}|^{1/2^{% q}}+s\big{/}\left(|F^{*}|\nu(C)\right)+1\big{/}\left(|F^{*}|r^{2^{(r^{\prime}+% 1)}-1}\right),

(5.5)

which holds for $|F|\geq n(r,q)$ , with $n(r,q)$ large enough, since the RHS goes to $0$ as $|F|\to\infty$ , for $r,q$ fixed. By (5.2) we know that $D\cap\left(C_{1}\cup\dots\cup C_{r^{\prime}}\right)\neq\emptyset$ as

\left|D\cap\left(C_{1}\cup\dots\cup C_{r^{\prime}}\right)\right|\geq|D|-|C_{r^% {\prime}+1}|-\dots-|C_{r}|.

Thus, there must exist $u\in C_{1}\cup\dots\cup C_{r^{\prime}}$ , such that $\nu(C\cap M_{u}A_{-p(u)}C)>s\big{/}|F^{*}|$ . Then, if $u\in C_{j}$ , for $1\leq j\leq r^{\prime}$ , by the definition of $C$ and the measure $\nu$ we will also have that

\frac{|C_{j}/u\cap\left(C_{j}-p(u)\right)|}{|F|}=\mu(C_{j}\cap M_{u}A_{-p(u)}C% _{j})>\frac{s}{|F^{*}|}>\frac{s}{|F|}

(5.6)

and hence $C_{j}/u\cap(C_{j}-p(u))\neq\emptyset$ . This implies the existence of $u,v\in F$ with $u\neq 0$ such that $\{u,p(u)+v,uv\}\subset C_{j}$ . In particular, for each $u\in D\cap\left(C_{1}\cup\dots\cup C_{r^{\prime}}\right)$ there are, by (5.6), at least $s$ monochromatic triples $\{u,p(u)+v,uv\}$ . ∎

Remark 5.1.

The observant reader will have noticed that the proof above actually gives that

\left|D\cap\left(C_{1}\cup\dots\cup C_{r^{\prime}}\right)\right|\geq|F^{*}|% \left(\frac{r^{2}-1}{r^{2^{(r^{\prime}+1)}-1}}-\frac{\sqrt{2(q+2)}}{|F^{*}|^{1% /2^{q}}}-\frac{s}{|F^{*}|\nu(C)}-\frac{1}{|F^{*}|r^{2^{(r^{\prime}+1)}-1}}% \right).

Therefore, for any finite field with $|F^{*}|\geq n(r,q)$ we see that

\left|D\cap\left(C_{1}\cup\dots\cup C_{r^{\prime}}\right)\right|\geq c_{r,q}% \cdot|F|,

where, whenever $n(r,q)$ is large enough,

c_{r,q}=\frac{r^{2}-1}{r^{2^{(r^{\prime}+1)}-1}}-\frac{\sqrt{2(q+2)}}{n(r,q)^{% 1/2^{q}}}-\frac{s}{n(r,q)\cdot\nu(C)}-\frac{1}{n(r,q)\cdot r^{2^{(r^{\prime}+1% )}-1}}>0

is a constant that does not depend on $|F|$ . Using the concluding comments of the previous proof, as $s=\delta|F^{*}|$ we have a total of $c^{\prime}_{r,q}|F|^{2}$ monochromatic triples of the form $\{u,u+v,uv\}$ , where $c^{\prime}_{r,q}>0$ is a constant that does not depend on $|F|$ .

6 Proof of Theorem 1.16

Throughout this short section we will assume that $K$ is a countable field and $(F_{N})_{N\in\mathbb{N}}$ is a double Følner sequence in $K$ . We also let $(T_{g})_{g\in\mathscr{A}_{K}}$ denote an action of $\mathscr{A}_{K}$ on some probability space $(X,\mathscr{X},\mu)$ by measure preserving transformations. For reference, our main goal is to prove the next result, part of which was initially stated as Theorem 1.16.

Theorem 6.1.

Let $K$ , $(F_{N})_{N\in\mathbb{N}}$ , $(X,\mathscr{X},\mu)$ and $(T_{g})_{g\in\mathscr{A}_{K}}$ be as above. Also, we (crucially) further assume that the action of the additive subgroup $S_{A}=\{A_{u}:u\in K\}$ is ergodic. Then, given any $B\in\mathscr{X}$ , we have that

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}\mu(B\cap A_{-u}B\cap M_{1/% u}B)\geq(\mu(B))^{3}.

If, in addition, the action of $S_{M}$ is ergodic, then for any $B_{1},B_{2},B_{3}\in\mathscr{X}$ we have that

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}\mu(B_{1}\cap A_{-u}B_{2}% \cap M_{1/u}B_{3})\geq\mu(B_{1})\mu(B_{2})\mu(B_{3}).

The proof is based on the following (double) ergodic theorem.

Theorem 6.2.

Let $K$ , $(F_{N})_{N\in\mathbb{N}}$ , $(X,\mathscr{X},\mu)$ and $(T_{g})_{g\in\mathscr{A}_{K}}$ be as in the beginning of this section. We further assume that the action of the additive subgroup $S_{A}$ is ergodic. Then, for any $f,g\in L^{\infty}(X,\mu)$ we have that

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}M_{u}A_{-u}f\cdot M_{u}g=P_% {M}g\cdot P_{A}f,

where the limit is in $L^{2}$ .

Proof.

Without loss of generality we assume that $f$ and $g$ are real-valued functions. We begin by decomposing $f$ as $f=P_{A}f+\tilde{f}$ , where $\tilde{f}=f-P_{A}f$ . Then,

\frac{1}{|F_{N}|}\sum_{u\in F_{N}}M_{u}A_{-u}f\cdot M_{u}g=\frac{1}{|F_{N}|}% \sum_{u\in F_{N}}M_{u}A_{-u}P_{A}f\cdot M_{u}g+\frac{1}{|F_{N}|}\sum_{u\in F_{% N}}M_{u}A_{-u}\tilde{f}\cdot M_{u}g.

(6.1)

As $P_{A}f$ is a constant by the ergodicity of $S_{A}$ , it follows by (the ergodic) Theorem 2.4 that

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}M_{u}A_{-u}P_{A}f\cdot M_{u% }g=P_{M}g\cdot P_{A}f.

Hence, the proof will follow from (6.1) if we can show that

\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}M_{u}A_{-u}\tilde{f}\cdot M% _{u}g=0.

To this end, we let $a_{u}=M_{u}A_{-u}\tilde{f}\cdot M_{u}g$ , for $u\in K^{*}$ . By the van der Corput trick (see Lemma 2.5) for $(K^{*},\cdot)$ it suffices to show that

\lim_{M\to\infty}\frac{1}{|F_{M}|}\sum_{b\in F_{M}}\limsup_{N\to\infty}\left|% \frac{1}{|F_{N}|}\sum_{u\in F_{N}}\langle a_{ub},a_{u}\rangle\right|=0.

(6.2)

To this end we note that for $b\neq 0$ ,

	$\displaystyle\langle a_{ub},a_{u}\rangle=\langle M_{ub}A_{-ub}\tilde{f}\cdot M% _{ub}g,M_{u}A_{-u}\tilde{f}\cdot M_{u}g\rangle$	$\displaystyle=$
	$\displaystyle\langle M_{b}A_{-ub}\tilde{f}\cdot M_{b}g,A_{-u}\tilde{f}\cdot g\rangle$	$\displaystyle=\int_{X}g\cdot M_{b}g\cdot M_{b}A_{-ub}\tilde{f}\cdot A_{-u}% \tilde{f}\ d\mu,$

where we have used that $M_{v}$ preserves $\mu$ . Hence, using the equality $M_{u}A_{v}=A_{uv}M_{u}$ (see 2.1), for all $u,v\in K^{*}$ , we have

\frac{1}{|F_{N}|}\sum_{u\in F_{N}}\langle a_{ub},a_{u}\rangle=\frac{1}{|F_{N}|% }\sum_{u\in F_{N}}\int_{X}g\cdot M_{b}g\cdot A_{-ub^{2}}M_{b}\tilde{f}\cdot A_% {-u}\tilde{f}\ d\mu

and so it suffices to show that

\lim_{M\to\infty}\frac{1}{|F_{M}|}\sum_{b\in F_{M}}\limsup_{N\to\infty}\left|% \frac{1}{|F_{N}|}\sum_{u\in F_{N}}\int_{X}g\cdot M_{b}g\cdot A_{-u}\tilde{f}% \cdot A_{-ub^{2}}M_{b}\tilde{f}\right|=0.

(6.3)

By Cauchy-Schwarz’s inequality and Lemma 2.8 the convergence in (6.3) follows from

\lim_{M\to\infty}\frac{1}{|F_{M}|}\sum_{b\in F_{M}}\limsup_{N\to\infty}\left\|% \frac{1}{|F_{N}|}\sum_{u\in F_{N}}A_{-u}\tilde{f}\cdot A_{-ub^{2}}M_{b}\tilde{% f}\right\|^{2}_{2}=0.

Now, using Proposition 2.6 with $(G,\cdot)=(K,+)$ and $a_{u}(b)=A_{-u}\tilde{f}\cdot A_{-ub^{2}}M_{b}\tilde{f}$ , for any $u,b\in K$ , $b\neq 0$ , we reduce this to showing that

\lim_{M\to\infty}\frac{1}{|F_{M}|}\sum_{b\in F_{M}}\limsup_{N\to\infty}\frac{1% }{|F_{N}|}\sum_{u\in F_{N}}\langle a_{u+d}(b),a_{u}(b)\rangle=0,

(6.4)

for any $d\neq 0$ . As before we see that

\langle a_{u+d}(b),a_{u}(b)\rangle=\int_{X}A_{u(b^{2}-1)-d}\tilde{f}\cdot A_{-% db^{2}}M_{b}\tilde{f}\cdot A_{u(b^{2}-1)}\tilde{f}\cdot M_{b}\tilde{f}\ d\mu.

Now, since $A_{u(b^{2}-1)-d}\tilde{f}\cdot A_{u(b^{2}-1)}\tilde{f}=A_{u(b^{2}-1)}\left(% \tilde{f}\cdot A_{-d}\tilde{f}\right)$ and for $b\notin\{-1,1\}$ , $p(x)=(b^{2}-1)x$ is a polynomial of degree $1$ in $K[x]$ , we may use the mean ergodic Theorem 2.4 to obtain that the averages in (6.4) become

\lim_{M\to\infty}\frac{1}{|F_{M}|}\sum_{b\in F_{M}}\int_{X}P_{A}(\tilde{f}% \cdot A_{-d}\tilde{f})\cdot A_{-db^{2}}M_{b}\tilde{f}\cdot M_{b}\tilde{f}\ d\mu.

(6.5)

As $S_{A}$ is ergodic, the projection $P_{A}(\tilde{f}\cdot A_{-d}\tilde{f})$ is a constant and so, using (2.1) and the invariance of $\mu$ under $M_{v}$ once again, (6.5) becomes

\lim_{M\to\infty}\frac{1}{|F_{M}|}\sum_{b\in F_{M}}P_{A}(\tilde{f}\cdot A_{-d}% \tilde{f})\int_{X}A_{-db}\tilde{f}\cdot\tilde{f}\ d\mu.

(6.6)

Because $(F_{M})_{M\in\mathbb{N}}$ is a double Følner sequence in $K$ and $d\neq 0$ it follows by Proposition 2.2 and the mean ergodic theorem that

\lim_{M\to\infty}\frac{1}{|F_{M}|}\sum_{b\in F_{M}}\int_{X}A_{-db}\tilde{f}% \cdot\tilde{f}\ d\mu=\int_{X}P_{A}\tilde{f}\cdot\tilde{f}\ d\mu=0,

by the definition of $\tilde{f}$ . Therefore, the limit in (6.6) equals zero and so (6.2) follows. ∎

From Theorem 6.2 we can readily recover Theorem 6.1.

Proof of Theorem 6.1.

For $B\in\mathscr{X}$ we see that

\displaystyle\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}\mu(B\cap A_{-% u}B\cap M_{1/u}B)=\lim_{N\to\infty}\frac{1}{|F_{N}|}\sum_{u\in F_{N}}\int_{X}M% _{u}\mathbbm{1}_{B}\cdot M_{u}A_{-u}\mathbbm{1}_{B}\cdot\mathbbm{1}_{B}\ d\mu,

as in the proof of Corollary 3.1. By Theorem 6.2 for $f=g=\mathbbm{1}_{B}$ , this limit becomes

\int_{X}P_{A}\mathbbm{1}_{B}\cdot P_{M}\mathbbm{1}_{B}\cdot\mathbbm{1}_{B}\ d% \mu=P_{A}\mathbbm{1}_{B}\int_{X}P_{M}\mathbbm{1}_{B}\cdot\mathbbm{1}_{B}\ d\mu% \geq(\mu(B))^{3},

(6.7)

because $P_{A}\mathbbm{1}_{B}=\mu(B)$ , $P_{M}$ is an orthogonal projection and $P_{M}1=1$ .

For the second part, if in addition $S_{M}$ acts ergodically, then $P_{M}\mathbbm{1}_{B}=\mu(B)$ and the same method gives the result. ∎

7 Generalization of Shkredov’s theorem

This section is devoted to the proof of Theorem 1.18, which generalizes a result due to Shkredov pertaining to finite fields of prime order, as mentioned in Section 1.2. We actually prove the following slightly more general theorem.

Theorem 7.1.

Let $F$ be any finite field. Let also $B_{1},B_{2},B_{3}\subset F^{*}$ be any sets satisfying $|B_{1}||B_{2}||B_{3}|>7|F|^{5/2}$ . Then, there exists $u,v\in F^{*}$ such that $v\in B_{1},u+v\in B_{2}$ and $uv\in B_{3}$ .

We have stated Theorem 7.1 for subsets of $F^{*}$ because working with an indicator function $g=\mathbbm{1}_{B}$ of a set $B\subset F^{*}$ allows us to use inequalities like $\mu(B)\leq P_{M}g(x)\leq(|F|/|F^{*}|)\mu(B)$ , for all $x\neq 0$ , which simplifies the proof. However, we do not lose generality as our main result, Theorem 1.18, is an immediate corollary of Theorem 7.1.

Proof that Theorem 7.1 implies Theorem 1.18.

Let $B_{1},B_{2},B_{3}\subset F$ be any sets satisfying $|B_{1}||B_{2}||B_{3}|>8|F|^{5/2}$ and let $B^{\prime}_{i}=B_{i}\cap F^{*}\subset F^{*}$ , for $i=1,2,3$ . Then,

|B^{\prime}_{1}||B^{\prime}_{2}||B^{\prime}_{3}|\geq(|B_{1}|-1)(|B_{2}|-1)(|B_% {3}|-1)

and the right hand side is larger than

|B_{1}||B_{2}||B_{3}|-|B_{1}||B_{2}|-|B_{1}||B_{3}|-|B_{2}||B_{3}|\geq|B_{1}||% B_{2}||B_{3}|-3|F|^{2}>7|F|^{5/2},

where the last inequality holds because $3|F|^{2}\leq|F|^{5/2}$ , for any field of order at least $9$ . Then the result follows by an application of Theorem 7.1 for the sets $B^{\prime}_{1},B^{\prime}_{2},B^{\prime}_{3}$ . ∎

We now proceed to prove Theorem 7.1. This proof is an effort to a “finitise” the proof of Theorem 1.16. However, there are some additional technicalities here, because quantities that vanish in the infinite setting are replaced by “error” terms which are bounded (and go to $0$ asymptotically as $|F|$ increases to $\infty$ ).

As in the infinite setting, the proof of Theorem 7.1 relies on a finitistic version of the double ergodic theorem of Theorem 6.2, which is stated in Proposition 7.3 below. In order to ease the discussion, we first prove the following estimate that appears in the proof of the latter.

Proposition 7.2.

Let $F$ be any finite field and $f=\mathbbm{1}_{B}-\mu(B)$ for some $B\subset F^{*}$ . Then,

\frac{1}{|F^{*}|}\sum_{v\in F^{*}}\left\|\frac{1}{|F^{*}|}\sum_{u\in F}M_{v}A_% {-uv}f\cdot A_{-u}f\right\|^{2}_{2}\leq\frac{6}{|F|}\left\|f\right\|^{4}_{2}.

Proof.

By Proposition 2.7 we have that for any $v\in F^{*}$

\left\|\sum_{u\in F}M_{v}A_{-uv}f\cdot A_{-u}f\right\|^{2}_{2}=\sum_{u,w\in F}% \langle M_{v}A_{-(u+w)v}f\cdot A_{-(u+w)}f\ ,\ M_{v}A_{-uv}f\cdot A_{-u}f\rangle.

Now, as $M_{v}A_{-(u+w)v}=A_{-(u+w)v^{2}}M_{v}$ and $M_{v}A_{-uv}=A_{-uv^{2}}M_{v}$ by (2.1) and $A_{uv^{2}}$ preserves $\mu$ , we see that

\left\|\sum_{u\in F}M_{v}A_{-uv}f\cdot A_{-u}f\right\|^{2}_{2}=\sum_{u,w\in F}% \langle A_{-wv^{2}}M_{v}f\cdot A_{u(v^{2}-1)-w}f\ ,\ M_{v}f\cdot A_{u(v^{2}-1)% }f\rangle.

Observe that we can rewrite this as

\left\|\sum_{u\in F}M_{v}A_{-uv}f\cdot A_{-u}f\right\|^{2}_{2}=\sum_{u,w\in F}% \langle A_{u(v^{2}-1)}\left(f\cdot A_{-w}f\right)\ ,\ M_{v}\left(f\cdot A_{-wv% }f\right)\rangle.

(7.1)

Whenever $v^{2}\neq 1$ we have that

$\displaystyle\sum_{u,w\in F}\langle A_{u(v^{2}-1)}\left(f\cdot A_{-w}f\right)% \ ,\ M_{v}\left(f\cdot A_{-wv}f\right)\rangle$	$\displaystyle=$
$\displaystyle\sum_{w\in F}\langle\|F\|\cdot P_{A}\left(f\cdot A_{-w}f\right)\ ,% \ M_{v}\left(f\cdot A_{-wv}f\right)\rangle$	$\displaystyle=$	by definition of $P_{A}$
$\displaystyle\sum_{w\in F}\|F\|\cdot\int_{X}f\cdot A_{-w}f\ d\mu\int_{X}M_{v}% \left(f\cdot A_{-wv}f\right)\ d\mu$	$\displaystyle=$	by ergodicity of $S_{A}$
$\displaystyle\sum_{w\in F}\|F\|\cdot\int_{X}f\cdot A_{-w}f\ d\mu\int_{X}f\cdot A% _{-wv}f\ d\mu$	.	$\displaystyle\quad\text{by invariance of $M_{v}$}.$	(7.2)

Using (2.2) in (7.1) we see that

\frac{1}{|F^{*}|}\sum_{v\in F^{*}}\left\|\frac{1}{|F^{*}|}\sum_{u\in F}M_{v}A_% {-uv}f\cdot A_{-u}f\right\|^{2}_{2}=\\ \frac{|F|}{|F^{*}|^{3}}\sum_{v\notin\{0,1,-1\}}\sum_{w\in F}\int_{X}f\cdot A_{% -w}f\ d\mu\int_{X}f\cdot A_{-wv}f\ d\mu\ +\\ \frac{|F|}{|F^{*}|^{3}}\sum_{w\in F}\left(\langle f\cdot A_{-w}f\ ,\ f\cdot A_% {-w}f+M_{-1}\left(f\cdot A_{w}f\right)\rangle\right).

(7.3)

Moreover,

\sum_{w\in F}\langle f\cdot A_{-w}f\ ,\ f\cdot A_{-w}f\rangle=\langle f^{2}\ ,% \ \sum_{w\in F}A_{-w}f^{2}\rangle=|F|\cdot\left\|f\right\|^{4}_{2}

(7.4)

and similarly,

\sum_{w\in F}\langle f\cdot A_{-w}f\ ,\ M_{-1}(f\cdot A_{w}f)\rangle=\langle f% \cdot M_{-1}f\ ,\ \sum_{w\in F}A_{-w}(f\cdot M_{-1}f)\rangle\leq|F|\cdot\left% \|f\right\|^{4}_{2}.

(7.5)

Now, for each $w\neq 0$ , we have that

\sum_{v\in F}\int_{X}f\cdot A_{-w}f\ d\mu\int_{X}f\cdot A_{-wv}f\ d\mu=\int_{X% }f\cdot A_{-w}f\ d\mu\int_{X}f\cdot P_{A}f\ d\mu=0

and so

\sum_{v\in F}\sum_{w\in F}\int_{X}f\cdot A_{-w}f\ d\mu\int_{X}f\cdot A_{-wv}f% \ d\mu=\sum_{v\in F}\left(\int_{X}f^{2}\ d\mu\right)^{2}=|F|\cdot\left\|f% \right\|^{4}_{2}.

Therefore,

\sum_{v\notin\{0,1,-1\}}\sum_{w\in F}\int_{X}f\cdot A_{-w}f\ d\mu\int_{X}f% \cdot A_{-wv}f\ d\mu=\\ |F|\cdot\left\|f\right\|^{4}_{2}\ -\sum_{v\in\{0,1,-1\}}\sum_{w\in F}\int_{X}f% \cdot A_{-w}f\ d\mu\int_{X}f\cdot A_{-wv}f\ d\mu\leq 2\cdot|F|\cdot\left\|f% \right\|^{4}_{2}.

(7.6)

The last inequality follows because the rightmost sum vanishes for $v=0$ and is non-negative when $v=1$ . In view of (7.6), the equality in (7.3) is replaced by

\frac{1}{|F^{*}|}\sum_{v\in F^{*}}\left\|\frac{1}{|F^{*}|}\sum_{u\in F}M_{v}A_% {-uv}f\cdot A_{-u}f\right\|^{2}_{2}\leq 2\frac{|F|^{2}}{|F^{*}|^{3}}\left\|f% \right\|^{4}_{2}+2\frac{|F|^{2}}{|F^{*}|^{3}}\left\|f\right\|^{4}_{2}\leq\frac% {6}{|F|}\left\|f\right\|^{4}_{2},

where in the first inequality we also used (7.4) and (7.5) and the last inequality holds whenever $|F|\geq 8$ . ∎

We now prove Proposition 7.3.

Proposition 7.3.

Let $F$ be any finite field and let $f=\mathbbm{1}_{B}-\mu(B)$ for some $B\subset F^{*}$ and $g=\mathbbm{1}_{C}$ , for some $C\subset F^{*}$ . Then

\left\|\frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A_{-u}f\cdot M_{u}g\right\|_{2}^% {2}\leq\\ \frac{7}{\sqrt{|F|}}\mu(B)\mu(C).

(7.7)

Proof.

By Proposition 2.7 and the fact that $M_{u}$ preserves $\mu$ for all $u\in F^{*}$ we see that

\left\|\frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A_{-u}f\cdot M_{u}g\right\|_{2}^% {2}=\frac{1}{|F^{*}|^{2}}\sum_{u,v\in F^{*}}\langle M_{v}A_{-uv}f\cdot M_{v}g% \ ,\ A_{-u}f\cdot g\rangle.

As all functions are real-valued, the above can be rewritten as

\langle g\ ,\ \frac{1}{|F^{*}|}\sum_{v\in F^{*}}M_{v}g\cdot\left(\frac{1}{|F^{% *}|}\sum_{u\in F^{*}}M_{v}A_{-uv}f\cdot A_{-u}f\right)\rangle.

Hence, using the Cauchy-Schwarz inequality we see that

\left\|\frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A_{-u}f\cdot M_{u}g\right\|_{2}^% {2}\leq\left\|g\right\|_{2}\left\|\frac{1}{|F^{*}|}\sum_{v\in F^{*}}M_{v}g% \cdot\left(\frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{v}A_{-uv}f\cdot A_{-u}f\right)% \right\|_{2}.

(7.8)

By the triangle inequality, the right hand side in (7.8) is less than or equal to

\left\|g\right\|_{2}\left\|\frac{1}{|F^{*}|}\sum_{v\in F^{*}}M_{v}g\cdot\left(% \frac{1}{|F^{*}|}\sum_{u\in F}M_{v}A_{-uv}f\cdot A_{-u}f\right)\right\|_{2}+% \left\|g\right\|_{2}\left\|\frac{1}{|F^{*}|^{2}}\sum_{v\in F^{*}}M_{v}g\cdot M% _{v}f\cdot f\right\|_{2}

and then

\left\|g\right\|_{2}\left\|\frac{1}{|F^{*}|^{2}}\sum_{v\in F^{*}}M_{v}g\cdot M% _{v}f\cdot f\right\|_{2}=\frac{1}{|F^{*}|}\left\|g\right\|_{2}\left\|P_{M}(f% \cdot g)\cdot f\right\|_{2}\leq\frac{|F|}{|F^{*}|^{2}}\left\|g\right\|^{2}_{2}% \left\|f\right\|^{2}_{2},

as $g(0)=0$ and so $P_{M}(f\cdot g)\leq\left(|F|/|F^{*}|\right)\langle f,g\rangle\leq\left(|F|/|F^% {*}|\right)\left\|f\right\|\left\|g\right\|$ , by the comments after Theorem 7.1 and the Cauchy-Schwarz inequality. Therefore,

\left\|\frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A_{-u}f\cdot M_{u}g\right\|_{2}^% {2}\leq\\ \left\|g\right\|_{2}\left\|\frac{1}{|F^{*}|}\sum_{v\in F^{*}}M_{v}g\cdot\left(% \frac{1}{|F^{*}|}\sum_{u\in F}M_{v}A_{-uv}f\cdot A_{-u}f\right)\right\|_{2}+% \frac{|F|}{|F^{*}|^{2}}\left\|g\right\|^{2}_{2}\left\|f\right\|^{2}_{2}.\

(7.9)

By an application of Cauchy-Schwarz’s inequality for sums of products we have that

$\displaystyle\left\\|\frac{1}{\|F^{}\|}\sum_{v\in F^{}}M_{v}g\cdot\left(\frac{1% }{\|F^{*}\|}\sum_{u\in F}M_{v}A_{-uv}f\cdot A_{-u}f\right)\right\\|^{2}_{2}$	$\displaystyle\leq$
$\displaystyle\int_{X}\frac{1}{\|F^{}\|}\sum_{v\in F^{}}(M_{v}g)^{2}\cdot\frac{% 1}{\|F^{}\|}\sum_{v\in F^{}}\left(\frac{1}{\|F^{*}\|}\sum_{u\in F}M_{v}A_{-uv}f% \cdot A_{-u}f\right)^{2}\ d\mu$	$\displaystyle=$
$\displaystyle\int_{X}P_{M}g\cdot\frac{1}{\|F^{}\|}\sum_{v\in F^{}}\left(\frac{% 1}{\|F^{*}\|}\sum_{u\in F}M_{v}A_{-uv}f\cdot A_{-u}f\right)^{2}\ d\mu$	$\displaystyle\leq$
$\displaystyle\frac{\|F\|}{\|F^{}\|}\mu(C)\ \cdot\frac{1}{\|F^{}\|}\sum_{v\in F^{}% }\left\\|\frac{1}{\|F^{}\|}\sum_{u\in F}M_{v}A_{-uv}f\cdot A_{-u}f\right\\|^{2}_{2}$	$\displaystyle\ .$	(7.10)

By Proposition 7.2 we see that

\frac{1}{|F^{*}|}\sum_{v\in F^{*}}\left\|\frac{1}{|F^{*}|}\sum_{u\in F}M_{v}A_% {-uv}f\cdot A_{-u}f\right\|^{2}_{2}\leq\frac{6}{|F|}\left\|f\right\|^{4}_{2}.

Using this in (7.10) and the bound in (7.9) we have that

\left\|\frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A_{-u}f\cdot M_{u}g\right\|_{2}^% {2}\leq\frac{\sqrt{6}}{\sqrt{|F|}}\frac{\sqrt{|F|}}{\sqrt{|F^{*}|}}\left\|g% \right\|^{2}_{2}\left\|f\right\|^{2}_{2}+\frac{|F|}{|F^{*}|^{2}}\left\|g\right% \|^{2}_{2}\left\|f\right\|^{2}_{2}\leq\frac{\sqrt{6}+1}{\sqrt{|F^{*}|}}\left\|% g\right\|^{2}_{2}\left\|f\right\|^{2}_{2}

Finally, it follows by the definition of $f$ that $\left\|f\right\|^{2}_{2}\leq 2\mu(B)$ , as shown in the proof of Theorem $5.1$ in [3]. In conclusion, (7.9) becomes

\left\|\frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A_{-u}f\cdot M_{u}g\right\|_{2}^% {2}\leq\frac{8}{\sqrt{|F|}}\mu(B)\mu(C),

since $2(\sqrt{6}+1)\sqrt{|F|\big{/}|F^{*}|}\leq 8,$ whenever $|F|\geq 8$ . ∎

We are finally in the position to prove the main result of this section, Theorem 7.1.

Proof of Theorem 7.1.

Using the same notation as in Section 4, the assumption of Theorem 7.1 can be rewritten as $\mu(B_{1})\mu(B_{2})\mu(B_{3})>7/\sqrt{|F|}$ and its conclusion is equivalent to the existence of $u\in F^{*}$ so that $\mu(B_{1}\cap A_{-u}B_{2}\cap M_{1/u}B_{3})>0,$ where $\mu$ is the normalised counting measure on $F$ . It will thus suffice to show that $\sum_{u\in F^{*}}\mu(B_{1}\cap A_{-u}B_{2}\cap M_{1/u}B_{3})>0.$ Using the fact that $M_{u}$ preserves $\mu$ for all $u\in F^{*}$ , this is equivalent to

\langle\mathbbm{1}_{B_{3}}\ ,\ \frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A_{-u}% \mathbbm{1}_{B_{2}}\cdot M_{u}\mathbbm{1}_{B_{1}}\rangle>0.

(7.11)

We let $f=\mathbbm{1}_{B_{2}}-P_{A}\mathbbm{1}_{B_{2}}$ . Observe that $P_{A}f=0$ and $P_{A}\mathbbm{1}_{B_{2}}=\mu(B_{2})$ is a constant. Then,

\langle\mathbbm{1}_{B_{3}}\ ,\ \frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A_{-u}% \mathbbm{1}_{B_{2}}\cdot M_{u}\mathbbm{1}_{B_{1}}\rangle=\\ \mu(B_{2})\langle\mathbbm{1}_{B_{3}}\ ,\ \frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{% u}\mathbbm{1}_{B_{1}}\rangle+\langle\mathbbm{1}_{B_{3}}\ ,\ \frac{1}{|F^{*}|}% \sum_{u\in F^{*}}M_{u}A_{-u}f\cdot M_{u}\mathbbm{1}_{B_{1}}\rangle=\\ \mu(B_{2})\langle\mathbbm{1}_{B_{3}}\ ,P_{M}\mathbbm{1}_{B_{1}}\rangle+\langle% \mathbbm{1}_{B_{3}}\ ,\ \frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A_{-u}f\cdot M_% {u}\mathbbm{1}_{B_{1}}\rangle.

(7.12)

As $B_{1}\subset F^{*}$ it follows by the comments after Theorem 7.1 that

\mu(B_{2})\langle\mathbbm{1}_{B_{3}}\ ,P_{M}\mathbbm{1}_{B_{1}}\rangle\geq\mu(% B_{1})\mu(B_{2})\mu(B_{3}).

Using this in (7.12), we reduce (7.11) to showing that

\left|\langle\mathbbm{1}_{B_{3}}\ ,\ \frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A_% {-u}f\cdot M_{u}\mathbbm{1}_{B_{1}}\rangle\right|<\mu(B_{1})\mu(B_{2})\mu(B_{3% }).

Applying the Cauchy-Schwarz inequality the latter follows from showing that

\left\|\mathbbm{1}_{B_{3}}\right\|\left\|\frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{% u}A_{-u}f\cdot M_{u}\mathbbm{1}_{B_{1}}\right\|_{2}<\mu(B_{1})\mu(B_{2})\mu(B_% {3}).

(7.13)

In Proposition 7.3 we showed that

\left\|\frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A_{-u}f\cdot M_{u}\mathbbm{1}_{B% _{1}}\right\|_{2}^{2}\leq\frac{7}{\sqrt{|F|}}\mu(B_{1})\mu(B_{2})

and since $\left\|\mathbbm{1}_{B_{3}}\right\|=\sqrt{\mu(B_{3})}$ , we see that (7.13) holds whenever

\frac{\sqrt{7}}{|F|^{1/4}}\sqrt{\mu(B_{1})\mu(B_{2})\mu(B_{3})}<\mu(B_{1})\mu(% B_{2})\mu(B_{3}),

which is equivalent to our main assumption, namely that $7\big{/}\sqrt{|F|}<\mu(B_{1})\mu(B_{2})\mu(B_{3})$ . ∎

As a corollary of the proof we get the following quantitative result.

Corollary 7.4.

Let $F$ be any finite field. Let also $B_{1},B_{2},B_{3}\subset F^{*}$ be any sets satisfying $|B_{1}||B_{2}||B_{3}|>7|F|^{5/2}$ . Then, for each $s<\ell:=\min{\{|B_{1}|,|B_{2}|,|B_{3}|\}}$ there is a set $D\subset F^{*}$ of cardinality

|D|\geq\frac{|B_{1}||B_{2}||B_{3}||F^{*}|\big{/}|F|^{2}-\sqrt{7|B_{1}||B_{2}||% B_{3}||F^{*}|^{2}\big{/}|F|^{3/2}}-s|F^{*}|}{\ell},

so that for each $u\in D$ there are $s$ choices for $v\in F$ such that $v\in B_{1}$ , $u+v\in B_{2}$ and $uv\in B_{3}$ .

Proof.

Let $\delta=s/|F|$ for any $s$ as above and let

D=\{u\in F^{*}:\mu(B_{3}\cap M_{u}A_{-u}B_{2}\cap M_{u}B_{1})>\delta\}.

Similarly to the proof of Corollary 4.5, it follows from the proof of Theorem 7.1 that

\frac{|D|}{|F^{*}|}\geq\frac{\mu(B_{1})\mu(B_{2})\mu(B_{3})-\sqrt{7\mu(B_{1})% \mu(B_{2})\mu(B_{3})\big{/}|F|^{1/2}}-\delta}{m},

(7.14)

where $m:=\min{\{\mu(B_{1}),\mu(B_{2}),\mu(B_{3})\}}$ . By the definition of $\mu$ , (7.14) is equivalent to

|D|\geq\frac{|B_{1}||B_{2}||B_{3}||F^{*}|\big{/}|F|^{2}-\sqrt{7|B_{1}||B_{2}||% B_{3}||F^{*}|^{2}\big{/}|F|^{3/2}}-s|F^{*}|}{\ell}.

(7.15)

Finally, we see that for each $u\in D$ ,

\frac{s}{|F|}\leq\mu(B_{3}\cap M_{u}A_{-u}B_{2}\cap M_{u}B_{1})=\mu(M_{1/u}B_{% 3}\cap A_{-u}B_{2}\cap B_{1})=\frac{\left|M_{1/u}B_{3}\cap A_{-u}B_{2}\cap B_{% 1}\right|}{|F|}

and thus there are $s$ choices for $v\in F$ satisfying $v\in B_{1},v+u\in B_{2}$ and $vu\in B_{3}$ . ∎

Remark 7.5.

The proof of Corollary 7.4 shows in particular that if $A\subset F$ satisfies $|A|\geq\alpha|F|$ , for some $\alpha\in(0,1)$ , then $|D|\geq c_{\alpha}|F|$ , for some constant $c_{\alpha}>0$ that does not depend on $F$ . This follows by taking $B_{1}=B_{2}=B_{3}=A$ above and choosing $s=\alpha^{\prime}|F|$ for some $\alpha^{\prime}<\alpha$ and $n\in\mathbb{N}$ large enough so that the right hand side in (7.15) is positive whenever $|F|>n$ . Thus, there are $s|D|\geq c^{\prime}_{\alpha}|F|^{2}$ triples $\{v,v+u,vu\}\subset A$ , where $c^{\prime}_{\alpha}>0$ is another constant that does not depend on $|F|$ .

8 A conditional generalisation of Green and Sanders’ theorem

In Section 5 we devised a finitistic “colouring trick” to prove Theorem 1.15 from Corollary 4.5. Now, using a similar argument and a finitistic version of Conjecture 1.17 as our basis we will prove a generalisation of Green and Sanders’ theorem about “monochromatic sums and products” in finite fields as mentioned in the introduction.

Before stating the aforementioned conjecture, we make another related conjecture that would generalise a special case of Theorem 7.1.

Conjecture 8.1.

Let $F$ be any finite field and assume that $\mathscr{A}_{F}$ acts by m.p.t. on a probability space $(X,\mathscr{X},\nu)$ . Let $B\in\mathscr{X}$ be a set with $\nu(B)>\left(c\big{/}|F|\right)^{a}$ , for some constants $a,c>0$ . Then, there exists $u\in F^{*}$ such that

\nu(B\cap A_{-u}B\cap M_{1/u}B)>0.

Remark.

Observe that when $X=F$ and $\nu=\mu$ , the counting measure on $F$ , Theorem 7.1 with $B_{1}=B_{2}=B_{3}$ is a special of this conjecture with $a=1/6$ . However, for this special case we knew that the additive action of $S_{A}$ is ergodic, which seems to have been heavily used in the proof of Theorem 7.1, and is no longer true in the general case.

For the purpose of proving the generalisation of Green and Sanders’ theorem, that is, Conjecture 1.19, we actually need only consider a special case of Conjecture 8.1 with $X=F^{m}$ and $\nu=\mu^{m}$ , some $m\in\mathbb{N}$ , where $\mu$ is the counting measure on $F$ , and $B=B_{1}\times\dots\times B_{m}\subset F^{m}$ is a set with $\nu(B)>\left(c\big{/}|F|\right)^{a}$ , for some constants $a,c>0$ .

A way one could try to prove the aforementioned special case of Conjecture 8.1 would start by decomposing $g=\mathbbm{1}_{B}$ as $P_{A}g+f$ , where $f=g-P_{A}g$ . Then, following Section 7 and considering the inner product $\langle f,g\rangle=\frac{1}{|F^{m}|}\sum_{x\in F^{m}}f(x)\cdot\overline{g(x)}$ , one would have to show that

\frac{1}{|F|}\sum_{u\in F^{*}}\langle g,M_{u}A_{-u}P_{A}g\cdot M_{u}g\rangle+% \frac{1}{|F|}\sum_{u\in F^{*}}\langle g,M_{u}A_{-u}f\cdot M_{u}g\rangle>0.

(8.1)

This time $P_{A}g$ is not necessarily a constant, however we still have that

\frac{1}{|F|}\sum_{u\in F^{*}}\langle g,M_{u}A_{-u}P_{A}g\cdot M_{u}g\rangle=% \langle g,P_{M}(P_{A}g\cdot g)\rangle\geq(\nu(B))^{4}.

Indeed, as $P_{A}g\leq 1$ and $P_{M}$ is an orthogonal projection with $P_{M}1=1$ we have

\langle g,P_{M}(P_{A}g\cdot g)\rangle\geq\langle P_{A}g\cdot g,P_{M}(P_{A}g% \cdot g)\rangle=\left\|P_{M}(P_{A}g\cdot g)\right\|^{2}_{2}\geq\left(\int_{F^{% m}}P_{A}g\cdot g\ d\nu\right)^{2},

where the last inequality is Cauchy-Schwarz. Then, arguing similarly for $P_{A}$ we have

\left(\int_{F^{m}}P_{A}g\cdot g\ d\nu\right)^{2}\geq\left(\int_{F^{m}}g\ d\nu% \right)^{4}=(\nu(B))^{4}.

Therefore, the proof would follow from the following statement, which is precisely what we are going to use.

Conjecture 8.2.

Let $F$ be any finite field and let $m\in\mathbb{N}$ . Consider the coordinate-wise affine action of $\mathscr{A}_{F}$ by m.p.t. on $(F^{m},\nu)$ , where $\nu=\mu^{m}=\mu\times\dots\times\mu$ . Let $f=\mathbbm{1}_{B}-P_{A}(\mathbbm{1}_{B})$ , where $B=B_{1}\times\dots\times B_{m}\subset F^{m}$ and $g=\mathbbm{1}_{B}$ . Then,

\left\|\frac{1}{|F^{*}|}\sum_{u\in F^{*}}M_{u}A_{-u}f\cdot M_{u}g\right\|_{2}% \leq\frac{c}{|F|^{b}}\left\|f\right\|_{2}\left\|g\right\|_{2},

for some $b,c>0$ .

As a corollary of Conjecture 8.2 we get the following estimates on the set of return times in the special case of Conjecture 8.1 that we need. The (conditional) proof is a straightforward adjustment of the proof of Corollary 7.4 and so we omit it.

Conjecture 8.3.

Let $F$ be a finite field and $m\in\mathbb{N}$ . Assume that $\mathscr{A}_{F}$ acts on $(F^{m},\nu)$ by m.p.t. as above. Let $B=B_{1}\times\dots\times B_{m}\subset F^{m}$ and $\delta<\nu(B)$ . Then, the set

D:=\{u\in F^{*}:\nu(B\cap A_{-u}B\cap M_{1/u}B)>\delta\},

satisfies

\frac{|D|}{|F^{*}|}\geq\frac{(\nu(B))^{4}-c\cdot(\nu(B))^{3/2}\big{/}|F|^{b}-% \delta}{\nu(B)}.

(8.2)

We are now in a position to apply a version of the finitary “colouring trick” and recover Conjecture 1.19, which we recall for convenience.

Conjecture 1.19.

Let $r\in\mathbb{N}$ be a number of colours. Then, there is $n(r)\in\mathbb{N}$ , so that for any finite field $F$ with $|F|\geq n(r)$ , any colouring $F=C_{1}\cup\cdots\cup C_{r}$ contains $d_{r}|F|^{2}$ monochromatic quadruples $\{u,v,u+v,uv\}$ , where $d_{r}>0$ is some constant that does not depend on $|F|$ .

Remark 8.4.

Setting $d_{r}^{\prime}=d_{r}/r$ we get a colour class containing at least $d_{r}^{\prime}|F|^{2}$ monochromatic patterns of the form $\{u,v,u+v,uv\}$ . Moreover, the proof gives an upper bound smaller than $n(r)=r^{4^{(r+2)}}$ for the $r$ -Ramsey number for monochromatic patterns $\{u,v,u+v,uv\}$ in this setting. That is, this conditional proof guarantees that for any $r$ -colouring of a finite field $F$ with $|F|\geq r^{4^{(r+2)}}$ , one of the colours must contain a non-trivial quadruple $\{u,v,u+v,uv\}$ .

Proof.

Let $r\in\mathbb{N}$ , $r>1$ , be fixed and let $F$ be any finite field with $|F|\geq n(r)$ , for $n(r)$ to be determined later. For an $r$ -colouring of such a field we can permute the colours if necessary and assume that $|C_{1}|\geq|C_{2}|\geq\dots\geq|C_{r}|$ . Clearly, then, $|C_{1}|\geq|F|\big{/}r$ . Next, we pick a number $1\leq r^{\prime}\leq r$ in the following manner. If $|C_{2}|<|F|\big{/}r^{16}$ , we set $r^{\prime}=1$ . Else, we have that $|C_{2}|\geq|F|\big{/}r^{16}$ and $r^{\prime}\geq 2$ . Then, we either have that $|C_{3}|\geq|F|\big{/}r^{64}$ , whence $r^{\prime}\geq 2$ or not and let $r^{\prime}=2$ . Proceeding in this fashion we set

r^{\prime}:=\max{\Big{\{}1\leq j\leq r:|C_{1}|\geq|F|\big{/}r\ ,\ |C_{2}|\geq|% F|\big{/}r^{16}\ ,\ \dots\ ,\ |C_{j}|\geq|F|\big{/}r^{4^{j}}\Big{\}}}.

Let $C=C_{1}\times\dots\times C_{r^{\prime}}$ . We consider the natural measure preserving action of $\mathscr{A}_{F}$ on $F^{r^{\prime}}$ (defined coordinate-wise), with the counting measure $\nu$ given by $\nu(E)=|E|/|F^{r^{\prime}}|$ , for any $E\subset F^{r^{\prime}}$ . So, for $C_{1},\dots,C_{r^{\prime}}\subset F$ we have that $\nu(C_{1}\times\cdots\times C_{r^{\prime}})=\mu(C_{1})\cdots\mu(C_{r^{\prime}})$ , where $\mu$ is the normalised counting measure on $F$ . For any $\delta:=s\big{/}|F^{*}|<\nu(C)$ let

D=\{u\in F^{*}:\nu(C\cap A_{-u}C\cap M_{1/u}C)>\delta\}.

Then, by Corollary 8.3 we have that

\frac{|D|}{|F^{*}|}\geq\frac{(\nu(C))^{4}-c\cdot(\nu(C))^{3/2}\big{/}|F^{*}|^{% b}-\delta}{\nu(C)},

which implies that

|D|\geq(\nu(C))^{3}|F^{*}|-c\cdot|F^{*}|^{1-b}-\frac{|F^{*}|\delta}{\nu(C)}.

(8.3)

We want to bound below the size of $D\setminus\left(C_{r^{\prime}+1}\cup\cdots\cup C_{r}\right)$ , because, for any element $u$ in this set, it holds that $u\in C_{1}\cup\cdots\cup C_{r^{\prime}}$ and also that $\nu(C\cap A_{-u}C\cap M_{1/u}C)>\delta$ . Then, if $u\in C_{j}$ , for $1\leq j\leq r^{\prime}$ , by the definition of $C$ and the measure $\nu$ we have that $\mu(C_{j}\cap A_{-u}C_{j}\cap M_{1/u}C)>\delta$ and hence $|C_{j}\cap C_{j}/u\cap(C_{j}-u)|>s$ , which implies the existence of at least $s-$ elements $v\in F^{*}$ such that $\{u,v,u+v,uv\}\subset C_{j}$ . To this end, by the choice of $r^{\prime}$ we have

|C_{r^{\prime}+1}|+\dots+|C_{r}|\leq(r-r^{\prime})|F|\big{/}r^{4^{(r^{\prime}+% 1)}}<|F|\big{/}r^{4^{(r^{\prime}+1)}-1}.

(8.4)

Using the definition of $C$ and $r^{\prime}$ it holds that

\nu(C)=\frac{|C_{1}|\cdots|C_{r^{\prime}}|}{|F^{r^{\prime}}|}\geq\frac{1}{r}% \cdot\frac{1}{r^{16}}\cdot\frac{1}{r^{64}}\cdots\frac{1}{r^{4^{r^{\prime}}}}=% \frac{1}{r^{(1+16+64+\dots+4^{r^{\prime}})}}.

(8.5)

Now,

\left|D\setminus\left(C_{r^{\prime}+1}\cup\cdots\cup C_{r}\right)\right|\geq|D% |-\left(|C_{r^{\prime}+1}|+\dots+|C_{r}|\right)

and so by (8.3), (8.4) and (8.5) we see that

\left|D\setminus\left(C_{r^{\prime}+1}\cup\cdots\cup C_{r}\right)\right|\geq|F% ^{*}|\big{/}r^{3(1+16+64+\dots+4^{r^{\prime}})}-c\cdot|F^{*}|^{1-b}-\frac{|F^{% *}|\delta}{\nu(C)}-|F|\big{/}r^{4^{(r^{\prime}+1)}-1}.

(8.6)

The quantity at the right hand side of (8.6) can be rewritten as

|F^{*}|\left(1\big{/}r^{3(1+16+64+\dots+4^{r^{\prime}})}-1\big{/}r^{4^{(r^{% \prime}+1)}-1}-\delta\big{/}\nu(C)\right)-c\cdot|F^{*}|^{1-b}-1\big{/}r^{4^{(r% ^{\prime}+1)}-1}.

Now, one can see that⁴⁴4For $r^{\prime}\geq 2$ we have that $4^{(r^{\prime}+1)}-1-3\left(4^{r^{\prime}}+\dots+4^{2}+1\right)=12$

\frac{1}{r^{3(1+16+\dots+4^{r^{\prime}})}}-\frac{1}{r^{4^{(r^{\prime}+1)}-1}}=% \frac{r^{4^{(r^{\prime}+1)}-1-3(4^{r^{\prime}}+\dots+4^{2}+1)}-1}{r^{4^{(r^{% \prime}+1)}-1}}=\frac{r^{12}-1}{r^{4^{(r^{\prime}+1)}-1}}.

Therefore, the right hand side of (8.6) is greater than or equal to

|F^{*}|\left(\frac{r^{12}-1}{r^{4^{(r^{\prime}+1)}-1}}-\delta\cdot r^{(1+16+% \dots+4^{r^{\prime}})}\right)-c\cdot|F^{*}|^{1-b}-1\big{/}r^{4^{(r^{\prime}+1)% }-1}=c_{r}\cdot|F^{*}|,

(8.7)

which follows by setting

c_{r}=\frac{r^{12}-1}{r^{4^{(r^{\prime}+1)}-1}}-\delta\cdot r^{(1+16+\dots+4^{% r^{\prime}})}-c\big{/}|F^{*}|^{b}-1\big{/}\left(|F^{*}|r^{4^{(r^{\prime}+1)}-1% }\right).

Recall that $|F|\geq n(r)$ . We choose $n(r)$ large enough to guarantee that $c_{r}>0$ . Since $\delta=s\big{/}|F^{*}|$ and for any $u\in D\setminus\left(C_{r^{\prime}+1}\cup\cdots\cup C_{r}\right)$ we have at least $s$ monochromatic quadruples $\{u,v,u+v,uv\}$ , it follows by (8.7) that there are in total at least

s\cdot c_{r}\cdot|F^{*}|=\delta\cdot c_{r}\cdot|F^{*}|^{2}=d_{r}|F|^{2}\

monochromatic patterns of the form $\{u,v,u+v,uv\}$ , where $d_{r}>0$ is a constant that does not depend on the size of $F$ . ∎

REFERENCES

References

[1] V. Bergelson. Combinatorial and diophantine applications of ergodic theory. In B. Hasselblatt and A. Katok, editors, Handbook of Dynamical Systems, volume 1B, pages 745–841. Elsevier, 2006.
[2] V. Bergelson and J. Moreira. Van der Corput’s difference theorem: Some modern developments. Indagationes Mathematicae, 27(2), 437-479, 2016.
[3] V. Bergelson and J. Moreira. Ergodic theorem involving additive and multiplicative groups of a field and $\{x+y,xy\}$ patterns. Ergodic Theory and Dynamical Systems, 37(3), pp.673-692, 2017.
[4] M. Bowen and M. Sabok. Monochromatic products and sums in the rationals. arXiv preprint arXiv:2210.12290, 2022
[5] J. Cilleruelo. Combinatorial problems in finite fields and Sidon sets. Combinatorica, 32(5):497–511, 2012. 1, 3, 22
[6] H. Furstenberg. Ergodic behavior of diagonal measures and a theorem of Szemerédi on arithmetic progressions. J. d’Analyse Math., 31:204–256, 1977.
[7] B. Green and T. Sanders. Monochromatic sums and products. Discrete Analysis, pages 1–43, 2016:5.
[8] B. Hanson. Capturing forms in dense subsets of finite fields. Acta Arith., 160(3):277– 284, 2013. 3, 22
[9] N. Hindman, I. Leader, and D. Strauss. Open problems in partition regularity. Combin. Probab. Comput., 12(5-6):571–583, 2003. Special issue on Ramsey theory.
[10] B. Host and B. Kra. Nilpotent structures in ergodic theory, volume 236 of Mathematical Surveys and Monographs. American Mathematical Society, Providence, RI, 2018
[11] J. Moreira, Monochromatic sums and products in $\mathbb{N}$ , Ann.of Math.185 (2017), 1069– 1090.
[12] I. D. Shkredov. On monochromatic solutions of some nonlinear equations in $\mathbb{Z}/p\mathbb{Z}$ . Mat. Zametki, 88(4):625–634, 2010.

Department of Mathematics, University of Warwick

E-mail address : [email protected]

	$\displaystyle\frac{1}{\|F^{}\|}\sum_{u\in F^{}}\mu(B\cap M_{u}A_{-p(u)}C)\ =\$	$\displaystyle\langle\mathbbm{1}_{B},\frac{1}{\|F^{}\|}\sum_{u\in F^{}}M_{u}A_{% -p(u)}\mathbbm{1}_{C}\rangle=$
	$\displaystyle\langle\mathbbm{1}_{B},\frac{1}{\|F^{}\|}\sum_{u\in F^{}}M_{u}A_{% -p(u)}P_{A}\mathbbm{1}_{C}\rangle\ +$	$\displaystyle\ \langle\mathbbm{1}_{B},\frac{1}{\|F^{}\|}\sum_{u\in F^{}}M_{u}A% _{-p(u)}f\rangle,$		(4.10)