ErdΕ‘s-Rogers functions for arbitrary pairs of graphs

Dhruv Mubayi Department of Mathematics, Statistics and Computer Science, University of Illinois, Chicago, IL 60607. Email: [email protected]. Research partially supported by NSF Awards DMS-1952767 and DMS-2153576 and a Simons Fellowship.    Jacques VerstraΓ«te Department of Mathematics, University of California, San Diego, CA, 92093-0112 USA. Email: [email protected]. Research supported by NSF award DMS-1952786.
Abstract

Let fF,G⁒(n)subscript𝑓𝐹𝐺𝑛f_{F,G}(n)italic_f start_POSTSUBSCRIPT italic_F , italic_G end_POSTSUBSCRIPT ( italic_n ) be the largest size of an induced F𝐹Fitalic_F-free subgraph that every n𝑛nitalic_n-vertex G𝐺Gitalic_G-free graph is guaranteed to contain. We prove that for any triangle-free graph F𝐹Fitalic_F,

fF,K3⁒(n)=fK2,K3⁒(n)1+o⁒(1)=n12+o⁒(1).subscript𝑓𝐹subscript𝐾3𝑛subscript𝑓subscript𝐾2subscript𝐾3superscript𝑛1π‘œ1superscript𝑛12π‘œ1f_{F,K_{3}}(n)=f_{K_{2},K_{3}}(n)^{1+o(1)}=n^{\frac{1}{2}+o(1)}.italic_f start_POSTSUBSCRIPT italic_F , italic_K start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) = italic_f start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) start_POSTSUPERSCRIPT 1 + italic_o ( 1 ) end_POSTSUPERSCRIPT = italic_n start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG + italic_o ( 1 ) end_POSTSUPERSCRIPT .

Along the way we give a slight improvement of a construction of ErdΕ‘s-Frankl-RΓΆdl for the Brown-ErdΕ‘s-SΓ³s (3⁒rβˆ’3,3)3π‘Ÿ33(3r-3,3)( 3 italic_r - 3 , 3 )-problem when rπ‘Ÿritalic_r is large.

In contrast to our result for K3subscript𝐾3K_{3}italic_K start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, for any K4subscript𝐾4K_{4}italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT-free graph F𝐹Fitalic_F containing a cycle, we prove there exists cF>0subscript𝑐𝐹0c_{F}>0italic_c start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT > 0 such that

fF,K4⁒(n)>fK2,K4⁒(n)1+cF=n13+cF+o⁒(1).subscript𝑓𝐹subscript𝐾4𝑛subscript𝑓subscript𝐾2subscript𝐾4superscript𝑛1subscript𝑐𝐹superscript𝑛13subscriptπ‘πΉπ‘œ1f_{F,K_{4}}(n)>f_{K_{2},K_{4}}(n)^{1+c_{F}}=n^{\frac{1}{3}+c_{F}+o(1)}.italic_f start_POSTSUBSCRIPT italic_F , italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) > italic_f start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) start_POSTSUPERSCRIPT 1 + italic_c start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT end_POSTSUPERSCRIPT = italic_n start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 3 end_ARG + italic_c start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT + italic_o ( 1 ) end_POSTSUPERSCRIPT .

For every graph G𝐺Gitalic_G, we prove that there exists Ξ΅G>0subscriptπœ€πΊ0\varepsilon_{G}>0italic_Ξ΅ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT > 0 such that whenever F𝐹Fitalic_F is a non-empty graph such that G𝐺Gitalic_G is not contained in any blowup of F𝐹Fitalic_F, then fF,G⁒(n)=O⁒(n1βˆ’Ξ΅G)subscript𝑓𝐹𝐺𝑛𝑂superscript𝑛1subscriptπœ€πΊf_{F,G}(n)=O(n^{1-\varepsilon_{G}})italic_f start_POSTSUBSCRIPT italic_F , italic_G end_POSTSUBSCRIPT ( italic_n ) = italic_O ( italic_n start_POSTSUPERSCRIPT 1 - italic_Ξ΅ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ). On the other hand, for graph G𝐺Gitalic_G that is not a clique, and every Ξ΅>0πœ€0\varepsilon>0italic_Ξ΅ > 0, we exhibit a G𝐺Gitalic_G-free graph F𝐹Fitalic_F such that fF,G⁒(n)=Ω⁒(n1βˆ’Ξ΅)subscript𝑓𝐹𝐺𝑛Ωsuperscript𝑛1πœ€f_{F,G}(n)=\Omega(n^{1-\varepsilon})italic_f start_POSTSUBSCRIPT italic_F , italic_G end_POSTSUBSCRIPT ( italic_n ) = roman_Ξ© ( italic_n start_POSTSUPERSCRIPT 1 - italic_Ξ΅ end_POSTSUPERSCRIPT ).

1 Introduction

Say that a graph is F𝐹Fitalic_F-free if it contains no subgraph isomorphic to F𝐹Fitalic_F. Denote by fF,G⁒(n)subscript𝑓𝐹𝐺𝑛f_{F,G}(n)italic_f start_POSTSUBSCRIPT italic_F , italic_G end_POSTSUBSCRIPT ( italic_n ) the maximum mπ‘šmitalic_m such that every n𝑛nitalic_n-vertex G𝐺Gitalic_G-free graph contains an induced F𝐹Fitalic_F-free subgraph on at least mπ‘šmitalic_m vertices. Hence the assertion fF,G⁒(n)<bsubscript𝑓𝐹𝐺𝑛𝑏f_{F,G}(n)<bitalic_f start_POSTSUBSCRIPT italic_F , italic_G end_POSTSUBSCRIPT ( italic_n ) < italic_b means that there exists an n𝑛nitalic_n vertex G𝐺Gitalic_G-free graph H𝐻Hitalic_H such that every vertex subset of H𝐻Hitalic_H of size b𝑏bitalic_b contains a copy of F𝐹Fitalic_F. The case F=Ks𝐹subscript𝐾𝑠F=K_{s}italic_F = italic_K start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT and G=Kt𝐺subscript𝐾𝑑G=K_{t}italic_G = italic_K start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the ErdΕ‘s-RogersΒ [7] function fs,t⁒(n)subscript𝑓𝑠𝑑𝑛f_{s,t}(n)italic_f start_POSTSUBSCRIPT italic_s , italic_t end_POSTSUBSCRIPT ( italic_n ). Classical results in Ramsey TheoryΒ [1, 10] give r⁒(3,t)=Θ⁒(t2/log⁑t)π‘Ÿ3π‘‘Ξ˜superscript𝑑2𝑑r(3,t)=\Theta(t^{2}/\log t)italic_r ( 3 , italic_t ) = roman_Θ ( italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / roman_log italic_t ), which shows fK2,K3⁒(n)=Θ⁒(n⁒log⁑n)subscript𝑓subscript𝐾2subscript𝐾3π‘›Ξ˜π‘›π‘›f_{K_{2},K_{3}}(n)=\Theta(\sqrt{n\log n})italic_f start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) = roman_Θ ( square-root start_ARG italic_n roman_log italic_n end_ARG ). We prove that roughly the same holds for fF,K3⁒(n)subscript𝑓𝐹subscript𝐾3𝑛f_{F,K_{3}}(n)italic_f start_POSTSUBSCRIPT italic_F , italic_K start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) for any triangle-free graph F𝐹Fitalic_F:

Theorem 1.

For any triangle-free graph F𝐹Fitalic_F containing at least one edge,

fF,K3⁒(n)=n12+O⁒(log⁑log⁑nlog⁑n)=fK2,K3⁒(n)1+o⁒(1).subscript𝑓𝐹subscript𝐾3𝑛superscript𝑛12𝑂𝑛𝑛subscript𝑓subscript𝐾2subscript𝐾3superscript𝑛1π‘œ1f_{F,K_{3}}(n)=n^{\frac{1}{2}+O(\sqrt{\frac{\log\log n}{\log n}})}=f_{K_{2},K_% {3}}(n)^{1+o(1)}.italic_f start_POSTSUBSCRIPT italic_F , italic_K start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) = italic_n start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG + italic_O ( square-root start_ARG divide start_ARG roman_log roman_log italic_n end_ARG start_ARG roman_log italic_n end_ARG end_ARG ) end_POSTSUPERSCRIPT = italic_f start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) start_POSTSUPERSCRIPT 1 + italic_o ( 1 ) end_POSTSUPERSCRIPT .

Our bound in TheoremΒ 1 is much larger than fK2,K3⁒(n)=Θ⁒(n⁒log⁑n)subscript𝑓subscript𝐾2subscript𝐾3π‘›Ξ˜π‘›π‘›f_{K_{2},K_{3}}(n)=\Theta(\sqrt{n\log n})italic_f start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) = roman_Θ ( square-root start_ARG italic_n roman_log italic_n end_ARG ), and therefore the following problem seems natural.

Problem 1.

Find a triangle free F𝐹Fitalic_F for which fF,K3⁒(n)/fK2,K3⁒(n)β†’βˆžβ†’subscript𝑓𝐹subscript𝐾3𝑛subscript𝑓subscript𝐾2subscript𝐾3𝑛f_{F,K_{3}}(n)/f_{K_{2},K_{3}}(n)\rightarrow\inftyitalic_f start_POSTSUBSCRIPT italic_F , italic_K start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) / italic_f start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) β†’ ∞.

A large pseudorandom triangle free graph with many edges seems an obvious choice for F𝐹Fitalic_F in ProblemΒ 1. Perhaps the simpler F=Kt,t𝐹subscript𝐾𝑑𝑑F=K_{t,t}italic_F = italic_K start_POSTSUBSCRIPT italic_t , italic_t end_POSTSUBSCRIPT is another example. More generally, for each sβ‰₯3𝑠3s\geq 3italic_s β‰₯ 3, one can ask whether there exists a Kssubscript𝐾𝑠K_{s}italic_K start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT-free F𝐹Fitalic_F for which fF,Ks⁒(n)/fKsβˆ’1,Ks⁒(n)β†’βˆžβ†’subscript𝑓𝐹subscript𝐾𝑠𝑛subscript𝑓subscript𝐾𝑠1subscript𝐾𝑠𝑛f_{F,K_{s}}(n)/f_{K_{s-1},K_{s}}(n)\rightarrow\inftyitalic_f start_POSTSUBSCRIPT italic_F , italic_K start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) / italic_f start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_s - 1 end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) β†’ ∞.

Unlike the case of triangles, it appears that for sβ‰₯4𝑠4s\geq 4italic_s β‰₯ 4, it is difficult to determine for each Kssubscript𝐾𝑠K_{s}italic_K start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT-free graph F𝐹Fitalic_F a constant c=c⁒(F)𝑐𝑐𝐹c=c(F)italic_c = italic_c ( italic_F ) such that fF,Ks⁒(n)=nc+o⁒(1)subscript𝑓𝐹subscript𝐾𝑠𝑛superscriptπ‘›π‘π‘œ1f_{F,K_{s}}(n)=n^{c+o(1)}italic_f start_POSTSUBSCRIPT italic_F , italic_K start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) = italic_n start_POSTSUPERSCRIPT italic_c + italic_o ( 1 ) end_POSTSUPERSCRIPT. The second author and MattheusΒ [14] proved fK2,K4⁒(n)=n1/3+o⁒(1)subscript𝑓subscript𝐾2subscript𝐾4𝑛superscript𝑛13π‘œ1f_{K_{2},K_{4}}(n)=n^{1/3+o(1)}italic_f start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) = italic_n start_POSTSUPERSCRIPT 1 / 3 + italic_o ( 1 ) end_POSTSUPERSCRIPT whereas it is well-known that fK3,K4=n1/2+o⁒(1)subscript𝑓subscript𝐾3subscript𝐾4superscript𝑛12π‘œ1f_{K_{3},K_{4}}=n^{1/2+o(1)}italic_f start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_n start_POSTSUPERSCRIPT 1 / 2 + italic_o ( 1 ) end_POSTSUPERSCRIPT. WeΒ [16] recently proved fK3,K4⁒(n)=O⁒(n⁒log⁑n)subscript𝑓subscript𝐾3subscript𝐾4𝑛𝑂𝑛𝑛f_{K_{3},K_{4}}(n)=O(\sqrt{n}\log n)italic_f start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) = italic_O ( square-root start_ARG italic_n end_ARG roman_log italic_n ) and the proof can be extended to prove that for every K4subscript𝐾4K_{4}italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT-free graph F𝐹Fitalic_F, we have fF,K4⁒(n)=O⁒(n⁒log⁑n)subscript𝑓𝐹subscript𝐾4𝑛𝑂𝑛𝑛f_{F,K_{4}}(n)=O(\sqrt{n}\log n)italic_f start_POSTSUBSCRIPT italic_F , italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) = italic_O ( square-root start_ARG italic_n end_ARG roman_log italic_n ). Perhaps this can be improved for triangle-free F𝐹Fitalic_F as follows.

Problem 2.

Is it true that for every triangle-free graph F𝐹Fitalic_F there exists Ξ΅=Ξ΅F>0πœ€subscriptπœ€πΉ0\varepsilon=\varepsilon_{F}>0italic_Ξ΅ = italic_Ξ΅ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT > 0 such that fF,K4⁒(n)<n1/2βˆ’Ξ΅subscript𝑓𝐹subscript𝐾4𝑛superscript𝑛12πœ€f_{F,K_{4}}(n)<n^{1/2-\varepsilon}italic_f start_POSTSUBSCRIPT italic_F , italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) < italic_n start_POSTSUPERSCRIPT 1 / 2 - italic_Ξ΅ end_POSTSUPERSCRIPT?

Regardless of whether ProblemΒ 2 has an affirmative answer, one might suspect that there exists a sequence of triangle-free graphs where the exponent tends to 1/2121/21 / 2. We propose the following.

Problem 3.

Prove (or disprove) that fKt,t,K4⁒(n)=n1/2+ot⁒(1)subscript𝑓subscript𝐾𝑑𝑑subscript𝐾4𝑛superscript𝑛12subscriptπ‘œπ‘‘1f_{K_{t,t},K_{4}}(n)=n^{1/2+o_{t}(1)}italic_f start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_t , italic_t end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) = italic_n start_POSTSUPERSCRIPT 1 / 2 + italic_o start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( 1 ) end_POSTSUPERSCRIPT.

The method of proof of TheoremΒ 5 yields fKt,t,K4⁒(n)>n2/5βˆ’ot⁒(1)subscript𝑓subscript𝐾𝑑𝑑subscript𝐾4𝑛superscript𝑛25subscriptπ‘œπ‘‘1f_{K_{t,t},K_{4}}(n)>n^{2/5-o_{t}(1)}italic_f start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_t , italic_t end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) > italic_n start_POSTSUPERSCRIPT 2 / 5 - italic_o start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( 1 ) end_POSTSUPERSCRIPT.

Our next result shows that for sβ‰₯4𝑠4s\geq 4italic_s β‰₯ 4, we can find substantially larger F𝐹Fitalic_F-free sets in Kssubscript𝐾𝑠K_{s}italic_K start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT-free graphs than their conjecturedΒ [15] minimum independence number, which is n1/(sβˆ’1)+o⁒(1)superscript𝑛1𝑠1π‘œ1n^{1/(s-1)+o(1)}italic_n start_POSTSUPERSCRIPT 1 / ( italic_s - 1 ) + italic_o ( 1 ) end_POSTSUPERSCRIPT.

Theorem 2.

Let sβ‰₯4𝑠4s\geq 4italic_s β‰₯ 4 and let F𝐹Fitalic_F be any graph containing a cycle. Then there exists a constant cF>0subscript𝑐𝐹0c_{F}>0italic_c start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT > 0 such that

fF,Ks⁒(n)=Ω⁒(n1sβˆ’1+cF).subscript𝑓𝐹subscript𝐾𝑠𝑛Ωsuperscript𝑛1𝑠1subscript𝑐𝐹f_{F,K_{s}}(n)=\Omega(n^{\frac{1}{s-1}+c_{F}}).italic_f start_POSTSUBSCRIPT italic_F , italic_K start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) = roman_Ξ© ( italic_n start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_s - 1 end_ARG + italic_c start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) .

If F𝐹Fitalic_F is a cycle, then this bound is almost tight for K4subscript𝐾4K_{4}italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT, using the following proposition. Write r⁒(H,t)π‘Ÿπ»π‘‘r(H,t)italic_r ( italic_H , italic_t ) for the ramsey number of H𝐻Hitalic_H versus a clique on t𝑑titalic_t vertices.

Proposition 1.

For any graphs F𝐹Fitalic_F and G𝐺Gitalic_G,

fF,G⁒(r⁒(G,t)βˆ’1)<r⁒(F,t).subscriptπ‘“πΉπΊπ‘ŸπΊπ‘‘1π‘ŸπΉπ‘‘f_{F,G}(r(G,t)-1)<r(F,t).italic_f start_POSTSUBSCRIPT italic_F , italic_G end_POSTSUBSCRIPT ( italic_r ( italic_G , italic_t ) - 1 ) < italic_r ( italic_F , italic_t ) .

Indeed, let H𝐻Hitalic_H be a G𝐺Gitalic_G-free graph on r⁒(G,t)βˆ’1π‘ŸπΊπ‘‘1r(G,t)-1italic_r ( italic_G , italic_t ) - 1 vertices with no independent set of size t𝑑titalic_t. Then the maximum F𝐹Fitalic_F-free subset of H𝐻Hitalic_H has size less than m:=r⁒(F,t)assignπ‘šπ‘ŸπΉπ‘‘m:=r(F,t)italic_m := italic_r ( italic_F , italic_t ) for any set of mπ‘šmitalic_m vertices in H𝐻Hitalic_H must contain either a copy of F𝐹Fitalic_F or an independent set of size t𝑑titalic_t.

When F=C2⁒k𝐹subscript𝐢2π‘˜F=C_{2k}italic_F = italic_C start_POSTSUBSCRIPT 2 italic_k end_POSTSUBSCRIPT or F=C2⁒kβˆ’1𝐹subscript𝐢2π‘˜1F=C_{2k-1}italic_F = italic_C start_POSTSUBSCRIPT 2 italic_k - 1 end_POSTSUBSCRIPT we have r⁒(F,t)=O⁒(tk/(kβˆ’1)/(log⁑t)1/(kβˆ’1))π‘ŸπΉπ‘‘π‘‚superscriptπ‘‘π‘˜π‘˜1superscript𝑑1π‘˜1r(F,t)=O(t^{k/(k-1)}/(\log t)^{1/(k-1)})italic_r ( italic_F , italic_t ) = italic_O ( italic_t start_POSTSUPERSCRIPT italic_k / ( italic_k - 1 ) end_POSTSUPERSCRIPT / ( roman_log italic_t ) start_POSTSUPERSCRIPT 1 / ( italic_k - 1 ) end_POSTSUPERSCRIPT ) ([12, 20]). Moreover, recent results ofΒ [14], yield r⁒(K4,t)=Ω⁒(t3/log4⁑t)π‘Ÿsubscript𝐾4𝑑Ωsuperscript𝑑3superscript4𝑑r(K_{4},t)=\Omega(t^{3}/\log^{4}t)italic_r ( italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT , italic_t ) = roman_Ξ© ( italic_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT / roman_log start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_t ). Putting these together in PropositionΒ 1 yields

fF,K4⁒(n)=O⁒(nk3⁒kβˆ’3⁒(log⁑n)4⁒kβˆ’33⁒kβˆ’3)Β forΒ Β F∈{C2⁒k,C2⁒kβˆ’1}.subscript𝑓𝐹subscript𝐾4𝑛𝑂superscriptπ‘›π‘˜3π‘˜3superscript𝑛4π‘˜33π‘˜3Β forΒ Β F∈{C2⁒k,C2⁒kβˆ’1}f_{F,K_{4}}(n)=O(n^{\frac{k}{3k-3}}(\log n)^{\frac{4k-3}{3k-3}})\qquad\hbox{ % for }\qquad\hbox{ $F\in\{C_{2k},C_{2k-1}\}$}.italic_f start_POSTSUBSCRIPT italic_F , italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) = italic_O ( italic_n start_POSTSUPERSCRIPT divide start_ARG italic_k end_ARG start_ARG 3 italic_k - 3 end_ARG end_POSTSUPERSCRIPT ( roman_log italic_n ) start_POSTSUPERSCRIPT divide start_ARG 4 italic_k - 3 end_ARG start_ARG 3 italic_k - 3 end_ARG end_POSTSUPERSCRIPT ) for italic_F ∈ { italic_C start_POSTSUBSCRIPT 2 italic_k end_POSTSUBSCRIPT , italic_C start_POSTSUBSCRIPT 2 italic_k - 1 end_POSTSUBSCRIPT } . (1)

The constant in TheoremΒ 2 satisfies cF=Θ⁒(1/k)subscriptπ‘πΉΞ˜1π‘˜c_{F}=\Theta(1/k)italic_c start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT = roman_Θ ( 1 / italic_k ) for F=Ck𝐹subscriptπΆπ‘˜F=C_{k}italic_F = italic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and with (1) this gives

fCk,K4⁒(n)=fK2,K4⁒(n)1+Θ⁒(1k)+o⁒(1)=n13+Θ⁒(1k)+o⁒(1).subscript𝑓subscriptπΆπ‘˜subscript𝐾4𝑛subscript𝑓subscript𝐾2subscript𝐾4superscript𝑛1Θ1π‘˜π‘œ1superscript𝑛13Θ1π‘˜π‘œ1f_{C_{k},K_{4}}(n)=f_{K_{2},K_{4}}(n)^{1+\Theta(\frac{1}{k})+o(1)}=n^{\frac{1}% {3}+\Theta(\frac{1}{k})+o(1)}.italic_f start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) = italic_f start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) start_POSTSUPERSCRIPT 1 + roman_Θ ( divide start_ARG 1 end_ARG start_ARG italic_k end_ARG ) + italic_o ( 1 ) end_POSTSUPERSCRIPT = italic_n start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 3 end_ARG + roman_Θ ( divide start_ARG 1 end_ARG start_ARG italic_k end_ARG ) + italic_o ( 1 ) end_POSTSUPERSCRIPT . (2)

This shows that there are graphs F𝐹Fitalic_F for which fF,K4⁒(n)subscript𝑓𝐹subscript𝐾4𝑛f_{F,K_{4}}(n)italic_f start_POSTSUBSCRIPT italic_F , italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) does not have the same exponent as fK2,K4⁒(n)subscript𝑓subscript𝐾2subscript𝐾4𝑛f_{K_{2},K_{4}}(n)italic_f start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) or fK3,K4⁒(n)subscript𝑓subscript𝐾3subscript𝐾4𝑛f_{K_{3},K_{4}}(n)italic_f start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ), in contrast to the case of fF,K3⁒(n)=fK2,K3⁒(n)1+o⁒(1)subscript𝑓𝐹subscript𝐾3𝑛subscript𝑓subscript𝐾2subscript𝐾3superscript𝑛1π‘œ1f_{F,K_{3}}(n)=f_{K_{2},K_{3}}(n)^{1+o(1)}italic_f start_POSTSUBSCRIPT italic_F , italic_K start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) = italic_f start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) start_POSTSUPERSCRIPT 1 + italic_o ( 1 ) end_POSTSUPERSCRIPT from Theorem 1. Using the graphs constructed inΒ [14], and following the analysis along the lines of Janzer and SudakovΒ [9], Balogh et al.Β [2] improved the upper bound in (1) slightly in the case of even cycles, by showing

fC2⁒k,K4⁒(n)=O⁒(nk3⁒kβˆ’2⁒(log⁑n)6⁒k3⁒kβˆ’2).subscript𝑓subscript𝐢2π‘˜subscript𝐾4𝑛𝑂superscriptπ‘›π‘˜3π‘˜2superscript𝑛6π‘˜3π‘˜2f_{C_{2k},K_{4}}(n)=O(n^{\frac{k}{3k-2}}(\log n)^{\frac{6k}{3k-2}}).italic_f start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT 2 italic_k end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) = italic_O ( italic_n start_POSTSUPERSCRIPT divide start_ARG italic_k end_ARG start_ARG 3 italic_k - 2 end_ARG end_POSTSUPERSCRIPT ( roman_log italic_n ) start_POSTSUPERSCRIPT divide start_ARG 6 italic_k end_ARG start_ARG 3 italic_k - 2 end_ARG end_POSTSUPERSCRIPT ) .

They also showed for complete multipartite graphs

fKs1,…,sr,Kr+2⁒(n)=O⁒(n2⁒sβˆ’34⁒sβˆ’5⁒(log⁑n)3),subscript𝑓subscript𝐾subscript𝑠1…subscriptπ‘ π‘ŸsubscriptπΎπ‘Ÿ2𝑛𝑂superscript𝑛2𝑠34𝑠5superscript𝑛3f_{K_{s_{1},...,s_{r}},K_{r+2}}(n)=O(n^{\frac{2s-3}{4s-5}}(\log n)^{3}),italic_f start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_s start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT italic_r + 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) = italic_O ( italic_n start_POSTSUPERSCRIPT divide start_ARG 2 italic_s - 3 end_ARG start_ARG 4 italic_s - 5 end_ARG end_POSTSUPERSCRIPT ( roman_log italic_n ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) ,

where s=βˆ‘si𝑠subscript𝑠𝑖s=\sum s_{i}italic_s = βˆ‘ italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. In the special case of 4-cycles this gives fC4,K4⁒(n)=O⁒(n5/11)subscript𝑓subscript𝐢4subscript𝐾4𝑛𝑂superscript𝑛511f_{C_{4},K_{4}}(n)=O(n^{5/11})italic_f start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) = italic_O ( italic_n start_POSTSUPERSCRIPT 5 / 11 end_POSTSUPERSCRIPT ).

We now address general ErdΕ‘s-Rogers functions fF,G⁒(n)subscript𝑓𝐹𝐺𝑛f_{F,G}(n)italic_f start_POSTSUBSCRIPT italic_F , italic_G end_POSTSUBSCRIPT ( italic_n ). For a given G𝐺Gitalic_G, the first natural question is when fF,G⁒(n)subscript𝑓𝐹𝐺𝑛f_{F,G}(n)italic_f start_POSTSUBSCRIPT italic_F , italic_G end_POSTSUBSCRIPT ( italic_n ) can be n1βˆ’o⁒(1)superscript𝑛1π‘œ1n^{1-o(1)}italic_n start_POSTSUPERSCRIPT 1 - italic_o ( 1 ) end_POSTSUPERSCRIPT as |V⁒(F)|β†’βˆžβ†’π‘‰πΉ|V(F)|\rightarrow\infty| italic_V ( italic_F ) | β†’ ∞. A blowup of a graph F𝐹Fitalic_F is obtained by replacing each vertex v𝑣vitalic_v of F𝐹Fitalic_F with an independent set Ivsubscript𝐼𝑣I_{v}italic_I start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT and adding all edges between Iusubscript𝐼𝑒I_{u}italic_I start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT and Ivsubscript𝐼𝑣I_{v}italic_I start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT whenever {u,v}∈E⁒(F)𝑒𝑣𝐸𝐹\{u,v\}\in E(F){ italic_u , italic_v } ∈ italic_E ( italic_F ). The graph F𝐹Fitalic_F is a homomorphic image of G𝐺Gitalic_G if and only if some blowup of F𝐹Fitalic_F contains G𝐺Gitalic_G. Consequently, we say that F𝐹Fitalic_F is hom⁒(G)hom𝐺\mbox{hom}(G)hom ( italic_G )-free if no blowup of F𝐹Fitalic_F contains G𝐺Gitalic_G. For instance, if G𝐺Gitalic_G is bipartite and F𝐹Fitalic_F contains at least one edge, then blowups of F𝐹Fitalic_F contain arbitrarily large complete bipartite graphs, and therefore F𝐹Fitalic_F is not hom⁒(G)hom𝐺\mbox{hom}(G)hom ( italic_G )-free. This condition turns out to determine when ErdΕ‘s-Rogers functions fF,G⁒(n)subscript𝑓𝐹𝐺𝑛f_{F,G}(n)italic_f start_POSTSUBSCRIPT italic_F , italic_G end_POSTSUBSCRIPT ( italic_n ) can approach n1βˆ’o⁒(1)superscript𝑛1π‘œ1n^{1-o(1)}italic_n start_POSTSUPERSCRIPT 1 - italic_o ( 1 ) end_POSTSUPERSCRIPT as |V⁒(F)|β†’βˆžβ†’π‘‰πΉ|V(F)|\rightarrow\infty| italic_V ( italic_F ) | β†’ ∞:

Theorem 3.

For every graph G𝐺Gitalic_G, there exists Ξ΅G>0subscriptπœ€πΊ0\varepsilon_{G}>0italic_Ξ΅ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT > 0 such that if F𝐹Fitalic_F is any hom⁒(G)hom𝐺\mbox{hom}(G)hom ( italic_G )-free graph containing at least one edge, then

fF,G⁒(n)=O⁒(n1βˆ’Ξ΅G).subscript𝑓𝐹𝐺𝑛𝑂superscript𝑛1subscriptπœ€πΊf_{F,G}(n)=O(n^{1-\varepsilon_{G}}).italic_f start_POSTSUBSCRIPT italic_F , italic_G end_POSTSUBSCRIPT ( italic_n ) = italic_O ( italic_n start_POSTSUPERSCRIPT 1 - italic_Ξ΅ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) .

On the other hand, if G𝐺Gitalic_G is not a clique, then for any Ξ΅>0πœ€0\varepsilon>0italic_Ξ΅ > 0 there exists a G𝐺Gitalic_G-free graph F𝐹Fitalic_F such that fF,G⁒(n)=Ω⁒(n1βˆ’Ξ΅)subscript𝑓𝐹𝐺𝑛Ωsuperscript𝑛1πœ€f_{F,G}(n)=\Omega(n^{1-\varepsilon})italic_f start_POSTSUBSCRIPT italic_F , italic_G end_POSTSUBSCRIPT ( italic_n ) = roman_Ξ© ( italic_n start_POSTSUPERSCRIPT 1 - italic_Ξ΅ end_POSTSUPERSCRIPT ).

If G𝐺Gitalic_G is a clique, then every G𝐺Gitalic_G-free graph is also hom⁒(G)hom𝐺\mbox{hom}(G)hom ( italic_G )-free, hence the first part of TheoremΒ 3 applies to all G𝐺Gitalic_G-free graphs F𝐹Fitalic_F when G𝐺Gitalic_G is a clique. As mentioned earlier, in the case G=K4𝐺subscript𝐾4G=K_{4}italic_G = italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT, it turns out fF,G⁒(n)=O⁒(n1/2β‹…log⁑n)subscript𝑓𝐹𝐺𝑛𝑂⋅superscript𝑛12𝑛f_{F,G}(n)=O(n^{1/2}\cdot\log n)italic_f start_POSTSUBSCRIPT italic_F , italic_G end_POSTSUBSCRIPT ( italic_n ) = italic_O ( italic_n start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT β‹… roman_log italic_n ) due to our results inΒ [16], so we may take Ξ΅K4β‰₯1/2subscriptπœ€subscript𝐾412\varepsilon_{K_{4}}\geq 1/2italic_Ξ΅ start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT β‰₯ 1 / 2. It appears to be difficult to determine the largest possible value of Ξ΅Gsubscriptπœ€πΊ\varepsilon_{G}italic_Ξ΅ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT for each graph G𝐺Gitalic_G in Theorem 3.

2 Proof of Theorem 1

Ajtai, KomlΓ³s and SzemerΓ©diΒ [1] and ShearerΒ [17] proved that r⁒(3,t)=O⁒(t2/log⁑t)π‘Ÿ3𝑑𝑂superscript𝑑2𝑑r(3,t)=O(t^{2}/\log t)italic_r ( 3 , italic_t ) = italic_O ( italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / roman_log italic_t ). Using the random triangle-free process, KimΒ [10] (see also Fiz Pontiveros, Griffths and MorrisΒ [8] and Bohman and KeevashΒ [4]) showed r⁒(3,t)=Ω⁒(t2/log⁑t)π‘Ÿ3𝑑Ωsuperscript𝑑2𝑑r(3,t)=\Omega(t^{2}/\log t)italic_r ( 3 , italic_t ) = roman_Ξ© ( italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / roman_log italic_t ), thereby determining the order of magnitude of r⁒(3,t)π‘Ÿ3𝑑r(3,t)italic_r ( 3 , italic_t ). Consequently, for any non-empty graph F𝐹Fitalic_F,

fF,K3⁒(n)β‰₯fK2,K3⁒(n)=Θ⁒(n⁒log⁑n).subscript𝑓𝐹subscript𝐾3𝑛subscript𝑓subscript𝐾2subscript𝐾3π‘›Ξ˜π‘›π‘›f_{F,K_{3}}(n)\geq f_{K_{2},K_{3}}(n)=\Theta(\sqrt{n\log n}).italic_f start_POSTSUBSCRIPT italic_F , italic_K start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) β‰₯ italic_f start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) = roman_Θ ( square-root start_ARG italic_n roman_log italic_n end_ARG ) .

To prove Theorem 1 we employ a construction of ErdΕ‘s, Frankl and RΓΆdlΒ [6] of a linear triangle-free R𝑅Ritalic_R-uniform N𝑁Nitalic_N-vertex hypergraph. In the appendix, we give present a minor modification of their construction which gives a bound that is better than the bound fromΒ [6] when R>log⁑N𝑅𝑁R>\log Nitalic_R > roman_log italic_N; they prove a lower bound N2/eO⁒(log⁑R⁒log⁑N)superscript𝑁2superscript𝑒𝑂𝑅𝑁N^{2}/e^{O(\log R\sqrt{\log N})}italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / italic_e start_POSTSUPERSCRIPT italic_O ( roman_log italic_R square-root start_ARG roman_log italic_N end_ARG ) end_POSTSUPERSCRIPT while our bound is N2/eO⁒(log⁑R⁒log⁑N)superscript𝑁2superscript𝑒𝑂𝑅𝑁N^{2}/e^{O(\sqrt{\log R\log N})}italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / italic_e start_POSTSUPERSCRIPT italic_O ( square-root start_ARG roman_log italic_R roman_log italic_N end_ARG ) end_POSTSUPERSCRIPT.

Theorem 4.

(Proposition A in Appendix) For any R,Nβ‰₯3𝑅𝑁3R,N\geq 3italic_R , italic_N β‰₯ 3 and Nβ‰₯Rβ‰₯log⁑N𝑁𝑅𝑁N\geq R\geq\log Nitalic_N β‰₯ italic_R β‰₯ roman_log italic_N, there exists an N𝑁Nitalic_N-vertex R𝑅Ritalic_R-uniform hypergraph H𝐻Hitalic_H with the following properties:

(i)i\mathrm{(i)}( roman_i ) |E⁒(H)|β‰₯N2/R8⁒logR⁑N𝐸𝐻superscript𝑁2superscript𝑅8subscript𝑅𝑁|E(H)|\geq N^{2}/R^{8\sqrt{\log_{R}N}}| italic_E ( italic_H ) | β‰₯ italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / italic_R start_POSTSUPERSCRIPT 8 square-root start_ARG roman_log start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT italic_N end_ARG end_POSTSUPERSCRIPT
(ii)ii\mathrm{(ii)}( roman_ii ) H𝐻Hitalic_H is linear, that is, for any distinct edges e,f∈H𝑒𝑓𝐻e,f\in Hitalic_e , italic_f ∈ italic_H, |e∩f|≀1𝑒𝑓1|e\cap f|\leq 1| italic_e ∩ italic_f | ≀ 1.
(iii)iii\mathrm{(iii)}( roman_iii ) H𝐻Hitalic_H is triangle-free, that is, for any three distinct edges e,f,g∈H𝑒𝑓𝑔𝐻e,f,g\in Hitalic_e , italic_f , italic_g ∈ italic_H, if |e∩f|=|f∩g|=|g∩e|=1𝑒𝑓𝑓𝑔𝑔𝑒1|e\cap f|=|f\cap g|=|g\cap e|=1| italic_e ∩ italic_f | = | italic_f ∩ italic_g | = | italic_g ∩ italic_e | = 1 then |e∩f∩g|=1𝑒𝑓𝑔1|e\cap f\cap g|=1| italic_e ∩ italic_f ∩ italic_g | = 1.

Proof of TheoremΒ 1. We are to prove that

fF,K3⁒(n)=n12+O⁒(log⁑log⁑nlog⁑n).subscript𝑓𝐹subscript𝐾3𝑛superscript𝑛12𝑂𝑛𝑛f_{F,K_{3}}(n)=n^{\frac{1}{2}+O(\sqrt{\frac{\log\log n}{\log n}})}.italic_f start_POSTSUBSCRIPT italic_F , italic_K start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) = italic_n start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG + italic_O ( square-root start_ARG divide start_ARG roman_log roman_log italic_n end_ARG start_ARG roman_log italic_n end_ARG end_ARG ) end_POSTSUPERSCRIPT .

Let t=|V⁒(F)|𝑑𝑉𝐹t=|V(F)|italic_t = | italic_V ( italic_F ) |. We apply TheoremΒ 4 with R=⌈3⁒t⁒log⁑t⁒log⁑NβŒ‰π‘…3𝑑𝑑𝑁R=\lceil 3t\log t\log N\rceilitalic_R = ⌈ 3 italic_t roman_log italic_t roman_log italic_N βŒ‰, where t=|V⁒(F)|𝑑𝑉𝐹t=|V(F)|italic_t = | italic_V ( italic_F ) |. Then (i) yields

|E⁒(H)|β‰₯N2R8⁒logR⁑N=N2βˆ’O⁒(log⁑log⁑Nlog⁑N).𝐸𝐻superscript𝑁2superscript𝑅8subscript𝑅𝑁superscript𝑁2𝑂𝑁𝑁|E(H)|\geq\frac{N^{2}}{R^{8\sqrt{\log_{R}N}}}=N^{2-O(\sqrt{\frac{\log\log N}{% \log N}})}.| italic_E ( italic_H ) | β‰₯ divide start_ARG italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_R start_POSTSUPERSCRIPT 8 square-root start_ARG roman_log start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT italic_N end_ARG end_POSTSUPERSCRIPT end_ARG = italic_N start_POSTSUPERSCRIPT 2 - italic_O ( square-root start_ARG divide start_ARG roman_log roman_log italic_N end_ARG start_ARG roman_log italic_N end_ARG end_ARG ) end_POSTSUPERSCRIPT . (3)

Let G𝐺Gitalic_G be the graph whose vertex set is E⁒(H)𝐸𝐻E(H)italic_E ( italic_H ) and where E⁒(G)={e,f∈E⁒(H):e∩fβ‰ βˆ…}𝐸𝐺conditional-set𝑒𝑓𝐸𝐻𝑒𝑓E(G)=\{e,f\in E(H):e\cap f\neq\emptyset\}italic_E ( italic_G ) = { italic_e , italic_f ∈ italic_E ( italic_H ) : italic_e ∩ italic_f β‰  βˆ… }. For each vertex v∈V⁒(H)𝑣𝑉𝐻v\in V(H)italic_v ∈ italic_V ( italic_H ), the set Kv={e∈E⁒(H):v∈e}subscript𝐾𝑣conditional-set𝑒𝐸𝐻𝑣𝑒K_{v}=\{e\in E(H):v\in e\}italic_K start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT = { italic_e ∈ italic_E ( italic_H ) : italic_v ∈ italic_e } induces a clique in G𝐺Gitalic_G. If for some distinct v,w∈V⁒(H)𝑣𝑀𝑉𝐻v,w\in V(H)italic_v , italic_w ∈ italic_V ( italic_H ) there exist distinct e,f∈Kv∩Kw𝑒𝑓subscript𝐾𝑣subscript𝐾𝑀e,f\in K_{v}\cap K_{w}italic_e , italic_f ∈ italic_K start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ∩ italic_K start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT, then by definition v,w∈e∩f𝑣𝑀𝑒𝑓v,w\in e\cap fitalic_v , italic_w ∈ italic_e ∩ italic_f, which contradicts that H𝐻Hitalic_H is linear. Therefore |V(Kv)∩|V(Kw)|≀1|V(K_{v})\cap|V(K_{w})|\leq 1| italic_V ( italic_K start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) ∩ | italic_V ( italic_K start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT ) | ≀ 1 for all distinct v,w∈V⁒(H)𝑣𝑀𝑉𝐻v,w\in V(H)italic_v , italic_w ∈ italic_V ( italic_H ), and the cliques Kvsubscript𝐾𝑣K_{v}italic_K start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT are edge-disjoint in G𝐺Gitalic_G. Similarly, since H𝐻Hitalic_H is triangle-free, every triangle in G𝐺Gitalic_G is contained in a clique Kvsubscript𝐾𝑣K_{v}italic_K start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT for some v∈V⁒(H)𝑣𝑉𝐻v\in V(H)italic_v ∈ italic_V ( italic_H ).

Independently for v∈V⁒(H)𝑣𝑉𝐻v\in V(H)italic_v ∈ italic_V ( italic_H ), let Ο‡v:V⁒(Kv)β†’V⁒(F):subscriptπœ’π‘£β†’π‘‰subscript𝐾𝑣𝑉𝐹\chi_{v}:V(K_{v})\rightarrow V(F)italic_Ο‡ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT : italic_V ( italic_K start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) β†’ italic_V ( italic_F ) be a random coloring of Kvsubscript𝐾𝑣K_{v}italic_K start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT. Next, we remove all edges {x,y}π‘₯𝑦\{x,y\}{ italic_x , italic_y } of G⁒[Kv]𝐺delimited-[]subscript𝐾𝑣G[K_{v}]italic_G [ italic_K start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ] such that Ο‡v⁒(x)=Ο‡v⁒(y)subscriptπœ’π‘£π‘₯subscriptπœ’π‘£π‘¦\chi_{v}(x)=\chi_{v}(y)italic_Ο‡ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ( italic_x ) = italic_Ο‡ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ( italic_y ) or Ο‡v⁒(x)⁒χv⁒(y)βˆ‰E⁒(F)subscriptπœ’π‘£π‘₯subscriptπœ’π‘£π‘¦πΈπΉ\chi_{v}(x)\chi_{v}(y)\not\in E(F)italic_Ο‡ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ( italic_x ) italic_Ο‡ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ( italic_y ) βˆ‰ italic_E ( italic_F ). In other words, we have placed a blowup of a copy of F𝐹Fitalic_F in each set Kvsubscript𝐾𝑣K_{v}italic_K start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT.

Since F𝐹Fitalic_F contains no triangle, the resulting graph Gβˆ—superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT is triangle-free. We now prove that Gβˆ—superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT has no F𝐹Fitalic_F-free induced subgraph with at least N𝑁Nitalic_N vertices. To see this, fix a set Z𝑍Zitalic_Z of N𝑁Nitalic_N vertices of Gβˆ—superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT. The probability that Z𝑍Zitalic_Z is an F𝐹Fitalic_F-free set of Gβˆ—superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT is

ℙ⁒(Z)β‰€βˆv∈V⁒(H)tβ‹…(1βˆ’1t)|Kv∩Z|.ℙ𝑍subscriptproduct𝑣𝑉𝐻⋅𝑑superscript11𝑑subscript𝐾𝑣𝑍{\mathbb{P}}(Z)\leq\prod_{v\in V(H)}t\cdot\Bigl{(}1-\frac{1}{t}\Bigr{)}^{|K_{v% }\cap Z|}.blackboard_P ( italic_Z ) ≀ ∏ start_POSTSUBSCRIPT italic_v ∈ italic_V ( italic_H ) end_POSTSUBSCRIPT italic_t β‹… ( 1 - divide start_ARG 1 end_ARG start_ARG italic_t end_ARG ) start_POSTSUPERSCRIPT | italic_K start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ∩ italic_Z | end_POSTSUPERSCRIPT .

Since H𝐻Hitalic_H is R𝑅Ritalic_R-uniform,

βˆ‘v∈V⁒(H)|Kv∩Z|=βˆ‘e∈Z|e|=R⁒|Z|=R⁒N.subscript𝑣𝑉𝐻subscript𝐾𝑣𝑍subscript𝑒𝑍𝑒𝑅𝑍𝑅𝑁\sum_{v\in V(H)}|K_{v}\cap Z|=\sum_{e\in Z}|e|=R|Z|=RN.βˆ‘ start_POSTSUBSCRIPT italic_v ∈ italic_V ( italic_H ) end_POSTSUBSCRIPT | italic_K start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ∩ italic_Z | = βˆ‘ start_POSTSUBSCRIPT italic_e ∈ italic_Z end_POSTSUBSCRIPT | italic_e | = italic_R | italic_Z | = italic_R italic_N .

Using (1βˆ’x)y≀eβˆ’x⁒ysuperscript1π‘₯𝑦superscript𝑒π‘₯𝑦(1-x)^{y}\leq e^{-xy}( 1 - italic_x ) start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT ≀ italic_e start_POSTSUPERSCRIPT - italic_x italic_y end_POSTSUPERSCRIPT for 0≀x≀10π‘₯10\leq x\leq 10 ≀ italic_x ≀ 1 and yβ‰₯1𝑦1y\geq 1italic_y β‰₯ 1,

ℙ⁒(Z)≀tN⁒(1βˆ’1t)R⁒N≀eN⁒log⁑tβˆ’R⁒N/t<Nβˆ’2⁒N.ℙ𝑍superscript𝑑𝑁superscript11𝑑𝑅𝑁superscript𝑒𝑁𝑑𝑅𝑁𝑑superscript𝑁2𝑁{\mathbb{P}}(Z)\leq t^{N}\Bigl{(}1-\frac{1}{t}\Bigr{)}^{RN}\leq e^{N\log t-RN/% t}<N^{-2N}.blackboard_P ( italic_Z ) ≀ italic_t start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT ( 1 - divide start_ARG 1 end_ARG start_ARG italic_t end_ARG ) start_POSTSUPERSCRIPT italic_R italic_N end_POSTSUPERSCRIPT ≀ italic_e start_POSTSUPERSCRIPT italic_N roman_log italic_t - italic_R italic_N / italic_t end_POSTSUPERSCRIPT < italic_N start_POSTSUPERSCRIPT - 2 italic_N end_POSTSUPERSCRIPT .

The number of sets of size N𝑁Nitalic_N in Gβˆ—superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT is no more than

(N2N)≀N2⁒NN!.binomialsuperscript𝑁2𝑁superscript𝑁2𝑁𝑁{N^{2}\choose N}\leq\frac{N^{2N}}{N!}.( binomial start_ARG italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_N end_ARG ) ≀ divide start_ARG italic_N start_POSTSUPERSCRIPT 2 italic_N end_POSTSUPERSCRIPT end_ARG start_ARG italic_N ! end_ARG .

Therefore the expected number of F𝐹Fitalic_F-free sets Z𝑍Zitalic_Z of size N𝑁Nitalic_N in Gβˆ—superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT is less than 1/N!1𝑁1/N!1 / italic_N !. We may therefore select Gβˆ—superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT so as to contain no F𝐹Fitalic_F-free subgraph with at least N𝑁Nitalic_N vertices. Since Gβˆ—superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT is triangle-free, and n:=|V⁒(Gβˆ—)|=|E⁒(H)|assign𝑛𝑉superscript𝐺𝐸𝐻n:=|V(G^{*})|=|E(H)|italic_n := | italic_V ( italic_G start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT ) | = | italic_E ( italic_H ) |, the bound (3) gives

fF,K3⁒(n)<N=n12+O⁒(log⁑log⁑nlog⁑n).subscript𝑓𝐹subscript𝐾3𝑛𝑁superscript𝑛12𝑂𝑛𝑛f_{F,K_{3}}(n)<N=n^{\frac{1}{2}+O(\sqrt{\frac{\log\log n}{\log n}})}.italic_f start_POSTSUBSCRIPT italic_F , italic_K start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) < italic_N = italic_n start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG + italic_O ( square-root start_ARG divide start_ARG roman_log roman_log italic_n end_ARG start_ARG roman_log italic_n end_ARG end_ARG ) end_POSTSUPERSCRIPT .

This proves the theorem. ∎

3 Proof of Theorem 2: Large CksubscriptπΆπ‘˜C_{k}italic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT-free subsets

To prove Theorem 2, it is sufficient to prove the following theorem:

Theorem 5.

For any graph F𝐹Fitalic_F containing a cycle CksubscriptπΆπ‘˜C_{k}italic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, there exists Ο΅k>1/100⁒ksubscriptitalic-Ο΅π‘˜1100π‘˜\epsilon_{k}>1/100kitalic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT > 1 / 100 italic_k such that

fF,K4⁒(n)=Ω⁒(n13+Ο΅k).subscript𝑓𝐹subscript𝐾4𝑛Ωsuperscript𝑛13subscriptitalic-Ο΅π‘˜f_{F,K_{4}}(n)=\Omega(n^{\frac{1}{3}+\epsilon_{k}}).italic_f start_POSTSUBSCRIPT italic_F , italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) = roman_Ξ© ( italic_n start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 3 end_ARG + italic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) .

To see that this implies Theorem 2, let H𝐻Hitalic_H be a Kssubscript𝐾𝑠K_{s}italic_K start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT-free graph where sβ‰₯5𝑠5s\geq 5italic_s β‰₯ 5. If H𝐻Hitalic_H has maximum degree d𝑑ditalic_d, then by TurΓ‘n’s Theorem, H𝐻Hitalic_H has an independent set of size at least n/(d+1)𝑛𝑑1n/(d+1)italic_n / ( italic_d + 1 ), and the neighborhood of a vertex of degree d𝑑ditalic_d induces a Ksβˆ’1subscript𝐾𝑠1K_{s-1}italic_K start_POSTSUBSCRIPT italic_s - 1 end_POSTSUBSCRIPT-free subgraph. By induction, setting Ξ±k⁒(4)=1/3+Ο΅ksubscriptπ›Όπ‘˜413subscriptitalic-Ο΅π‘˜\alpha_{k}(4)=1/3+\epsilon_{k}italic_Ξ± start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( 4 ) = 1 / 3 + italic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, for sβ‰₯5𝑠5s\geq 5italic_s β‰₯ 5, there exists Ξ±=Ξ±k⁒(sβˆ’1)>1/(sβˆ’2)𝛼subscriptπ›Όπ‘˜π‘ 11𝑠2\alpha=\alpha_{k}(s-1)>1/(s-2)italic_Ξ± = italic_Ξ± start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_s - 1 ) > 1 / ( italic_s - 2 ) such that this Ksβˆ’1subscript𝐾𝑠1K_{s-1}italic_K start_POSTSUBSCRIPT italic_s - 1 end_POSTSUBSCRIPT-free subgraph has an F𝐹Fitalic_F-free subgraph with Ω⁒(dΞ±k⁒(sβˆ’1))Ξ©superscript𝑑subscriptπ›Όπ‘˜π‘ 1\Omega(d^{\alpha_{k}(s-1)})roman_Ξ© ( italic_d start_POSTSUPERSCRIPT italic_Ξ± start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_s - 1 ) end_POSTSUPERSCRIPT ) vertices. Therefore we have an F𝐹Fitalic_F-free subgraph of size at least

Ω⁒(max⁑{dΞ±k⁒(sβˆ’1),nd+1}).Ξ©superscript𝑑subscriptπ›Όπ‘˜π‘ 1𝑛𝑑1\Omega(\max\Bigl{\{}d^{\alpha_{k}(s-1)},\frac{n}{d+1}\Bigr{\}}).roman_Ξ© ( roman_max { italic_d start_POSTSUPERSCRIPT italic_Ξ± start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_s - 1 ) end_POSTSUPERSCRIPT , divide start_ARG italic_n end_ARG start_ARG italic_d + 1 end_ARG } ) .

Setting

Ξ±k⁒(s)=1βˆ’11+Ξ±k⁒(sβˆ’1),subscriptπ›Όπ‘˜π‘ 111subscriptπ›Όπ‘˜π‘ 1\alpha_{k}(s)=1-\frac{1}{1+\alpha_{k}(s-1)},italic_Ξ± start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_s ) = 1 - divide start_ARG 1 end_ARG start_ARG 1 + italic_Ξ± start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_s - 1 ) end_ARG ,

since Ξ±k⁒(4)>1/3subscriptπ›Όπ‘˜413\alpha_{k}(4)>1/3italic_Ξ± start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( 4 ) > 1 / 3 for all kβ‰₯3π‘˜3k\geq 3italic_k β‰₯ 3, by induction we have

Ξ±k⁒(s)>1βˆ’11+1sβˆ’2=1sβˆ’1subscriptπ›Όπ‘˜π‘ 1111𝑠21𝑠1\alpha_{k}(s)>1-\frac{1}{1+\frac{1}{s-2}}=\frac{1}{s-1}italic_Ξ± start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_s ) > 1 - divide start_ARG 1 end_ARG start_ARG 1 + divide start_ARG 1 end_ARG start_ARG italic_s - 2 end_ARG end_ARG = divide start_ARG 1 end_ARG start_ARG italic_s - 1 end_ARG

as required. Moreover, if Ξ±k⁒(sβˆ’1)β‰₯1/(sβˆ’2)+Ο΅subscriptπ›Όπ‘˜π‘ 11𝑠2italic-Ο΅\alpha_{k}(s-1)\geq 1/(s-2)+\epsilonitalic_Ξ± start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_s - 1 ) β‰₯ 1 / ( italic_s - 2 ) + italic_Ο΅ where ϡ≀1italic-Ο΅1\epsilon\leq 1italic_Ο΅ ≀ 1, the calculation above yields

Ξ±k⁒(s)β‰₯1βˆ’sβˆ’2sβˆ’1+ϡ⁒(sβˆ’2)subscriptπ›Όπ‘˜π‘ 1𝑠2𝑠1italic-ϡ𝑠2\displaystyle\alpha_{k}(s)\geq 1-\frac{s-2}{s-1+\epsilon(s-2)}italic_Ξ± start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_s ) β‰₯ 1 - divide start_ARG italic_s - 2 end_ARG start_ARG italic_s - 1 + italic_Ο΅ ( italic_s - 2 ) end_ARG =1sβˆ’1+ϡ⁒((sβˆ’3)+1sβˆ’1sβˆ’1+ϡ⁒(sβˆ’2))absent1𝑠1italic-ϡ𝑠31𝑠1𝑠1italic-ϡ𝑠2\displaystyle=\frac{1}{s-1}+\epsilon\left(\frac{(s-3)+\frac{1}{s-1}}{s-1+% \epsilon(s-2)}\right)= divide start_ARG 1 end_ARG start_ARG italic_s - 1 end_ARG + italic_Ο΅ ( divide start_ARG ( italic_s - 3 ) + divide start_ARG 1 end_ARG start_ARG italic_s - 1 end_ARG end_ARG start_ARG italic_s - 1 + italic_Ο΅ ( italic_s - 2 ) end_ARG )
>1sβˆ’1+ϡ⁒(sβˆ’32⁒(sβˆ’1))absent1𝑠1italic-ϡ𝑠32𝑠1\displaystyle>\frac{1}{s-1}+\epsilon\left(\frac{s-3}{2(s-1)}\right)> divide start_ARG 1 end_ARG start_ARG italic_s - 1 end_ARG + italic_Ο΅ ( divide start_ARG italic_s - 3 end_ARG start_ARG 2 ( italic_s - 1 ) end_ARG )
β‰₯1sβˆ’1+Ο΅4.absent1𝑠1italic-Ο΅4\displaystyle\geq\frac{1}{s-1}+\frac{\epsilon}{4}.β‰₯ divide start_ARG 1 end_ARG start_ARG italic_s - 1 end_ARG + divide start_ARG italic_Ο΅ end_ARG start_ARG 4 end_ARG .

With Ξ±k⁒(4)>1/3+1/100⁒ksubscriptπ›Όπ‘˜4131100π‘˜\alpha_{k}(4)>1/3+1/100kitalic_Ξ± start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( 4 ) > 1 / 3 + 1 / 100 italic_k, this gives Ξ±k⁒(s)=1/(sβˆ’1)+Ξ©s⁒(1/k)subscriptπ›Όπ‘˜π‘ 1𝑠1subscriptΩ𝑠1π‘˜\alpha_{k}(s)=1/(s-1)+\Omega_{s}(1/k)italic_Ξ± start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_s ) = 1 / ( italic_s - 1 ) + roman_Ξ© start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( 1 / italic_k ) as kβ†’βˆžβ†’π‘˜k\rightarrow\inftyitalic_k β†’ ∞.


We will prove TheoremΒ 5 as follows: a given K4subscript𝐾4K_{4}italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT-free graph H𝐻Hitalic_H either has few kπ‘˜kitalic_k-cycles going through every vertex or has a vertex that lies in many kπ‘˜kitalic_k-cycles. In the former case, we apply standard results about hypergraph independent sets (LemmaΒ 3) to obtain a large CksubscriptπΆπ‘˜C_{k}italic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT-free subset. In the latter case, we show that H𝐻Hitalic_H contains a dense bipartite graph and then use the dependent random choice technique to extract from this a large independent set in one of the parts. These assertions are stated in the next three lemmas.

For sets X,Yπ‘‹π‘ŒX,Yitalic_X , italic_Y of vertices in a graph G𝐺Gitalic_G, let e⁒(X,Y)π‘’π‘‹π‘Œe(X,Y)italic_e ( italic_X , italic_Y ) denote the number of edges {x,y}∈E⁒(G)π‘₯𝑦𝐸𝐺\{x,y\}\in E(G){ italic_x , italic_y } ∈ italic_E ( italic_G ) such that x∈Xπ‘₯𝑋x\in Xitalic_x ∈ italic_X and y∈Yπ‘¦π‘Œy\in Yitalic_y ∈ italic_Y.

Lemma 1.

Let G𝐺Gitalic_G be a graph of maximum degree d𝑑ditalic_d, and let Ξ΄>0𝛿0\delta>0italic_Ξ΄ > 0. Suppose the number of cycles of length kπ‘˜kitalic_k containing a vertex v0∈V⁒(G)subscript𝑣0𝑉𝐺v_{0}\in V(G)italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ italic_V ( italic_G ) is at least δ⁒dkβˆ’1𝛿superscriptπ‘‘π‘˜1\delta d^{k-1}italic_Ξ΄ italic_d start_POSTSUPERSCRIPT italic_k - 1 end_POSTSUPERSCRIPT. Then there exist sets X,YβŠ†V⁒(G)π‘‹π‘Œπ‘‰πΊX,Y\subseteq V(G)italic_X , italic_Y βŠ† italic_V ( italic_G ) such that e⁒(X,Y)β‰₯δ⁒|X|⁒|Y|/(2⁒log2⁑d)kπ‘’π‘‹π‘Œπ›Ώπ‘‹π‘Œsuperscript2subscript2π‘‘π‘˜e(X,Y)\geq\delta|X||Y|/(2\log_{2}d)^{k}italic_e ( italic_X , italic_Y ) β‰₯ italic_Ξ΄ | italic_X | | italic_Y | / ( 2 roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_d ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT and |X|,|Y|β‰₯δ⁒d/(log2⁑d)kβˆ’3π‘‹π‘Œπ›Ώπ‘‘superscriptsubscript2π‘‘π‘˜3|X|,|Y|\geq\delta d/(\log_{2}d)^{k-3}| italic_X | , | italic_Y | β‰₯ italic_Ξ΄ italic_d / ( roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_d ) start_POSTSUPERSCRIPT italic_k - 3 end_POSTSUPERSCRIPT.

Proof.

Let π’žπ’ž\mathcal{C}caligraphic_C be the set of kπ‘˜kitalic_k-cycles containing v0subscript𝑣0v_{0}italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. For each Οƒβˆˆπ’žπœŽπ’ž\sigma\in\mathcal{C}italic_Οƒ ∈ caligraphic_C, pick an ordering (Οƒ0,Οƒ1,…,Οƒkβˆ’1,Οƒ0)subscript𝜎0subscript𝜎1…subscriptπœŽπ‘˜1subscript𝜎0(\sigma_{0},\sigma_{1},\dots,\sigma_{k-1},\sigma_{0})( italic_Οƒ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_Οƒ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_Οƒ start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT , italic_Οƒ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) of the vertices of ΟƒπœŽ\sigmaitalic_Οƒ, where {Οƒi,Οƒi+1}∈E⁒(Οƒ)subscriptπœŽπ‘–subscriptπœŽπ‘–1𝐸𝜎\{\sigma_{i},\sigma_{i+1}\}\in E(\sigma){ italic_Οƒ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Οƒ start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT } ∈ italic_E ( italic_Οƒ ) with subscripts modulo kπ‘˜kitalic_k. Let Xi={Οƒi:Οƒβˆˆπ’ž}subscript𝑋𝑖conditional-setsubscriptπœŽπ‘–πœŽπ’žX_{i}=\{\sigma_{i}:\sigma\in\mathcal{C}\}italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { italic_Οƒ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : italic_Οƒ ∈ caligraphic_C } for 1≀i≀kβˆ’11π‘–π‘˜11\leq i\leq k-11 ≀ italic_i ≀ italic_k - 1. Then for 2≀i≀kβˆ’22π‘–π‘˜22\leq i\leq k-22 ≀ italic_i ≀ italic_k - 2 there exist sets Xiβ€²βŠ†Xisuperscriptsubscript𝑋𝑖′subscript𝑋𝑖X_{i}^{\prime}\subseteq X_{i}italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT βŠ† italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and ai∈{1,2,…,d}subscriptπ‘Žπ‘–12…𝑑a_{i}\in\{1,2,\dots,d\}italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ { 1 , 2 , … , italic_d } such that every vertex of Xiβ€²superscriptsubscript𝑋𝑖′X_{i}^{\prime}italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT has at least ai/2subscriptπ‘Žπ‘–2a_{i}/2italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT / 2 and at most aisubscriptπ‘Žπ‘–a_{i}italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT neighbors in Xiβˆ’1β€²superscriptsubscript𝑋𝑖1β€²X_{i-1}^{\prime}italic_X start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT, and the number of cycles Οƒβˆˆπ’žπœŽπ’ž\sigma\in\mathcal{C}italic_Οƒ ∈ caligraphic_C with Οƒi∈Xiβ€²subscriptπœŽπ‘–superscriptsubscript𝑋𝑖′\sigma_{i}\in X_{i}^{\prime}italic_Οƒ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT is at least δ⁒dkβˆ’1/(log2⁑d)kβˆ’3𝛿superscriptπ‘‘π‘˜1superscriptsubscript2π‘‘π‘˜3\delta d^{k-1}/(\log_{2}d)^{k-3}italic_Ξ΄ italic_d start_POSTSUPERSCRIPT italic_k - 1 end_POSTSUPERSCRIPT / ( roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_d ) start_POSTSUPERSCRIPT italic_k - 3 end_POSTSUPERSCRIPT. This can be done iteratively, starting by splitting X2subscript𝑋2X_{2}italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT into sets X2⁒jsubscript𝑋2𝑗X_{2j}italic_X start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT such that every vertex of X2⁒jsubscript𝑋2𝑗X_{2j}italic_X start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT has at least d/2j+1𝑑superscript2𝑗1d/2^{j+1}italic_d / 2 start_POSTSUPERSCRIPT italic_j + 1 end_POSTSUPERSCRIPT and at most d/2j𝑑superscript2𝑗d/2^{j}italic_d / 2 start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT neighbors in X1subscript𝑋1X_{1}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, for 0≀j≀log2⁑d0𝑗subscript2𝑑0\leq j\leq\log_{2}d0 ≀ italic_j ≀ roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_d, and considering an X2β€²=X2⁒jsuperscriptsubscript𝑋2β€²subscript𝑋2𝑗X_{2}^{\prime}=X_{2j}italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT = italic_X start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT for which at least |π’ž|/(log2⁑d)π’žsubscript2𝑑|\mathcal{C}|/(\log_{2}d)| caligraphic_C | / ( roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_d ) of the cycles use an edge between X1subscript𝑋1X_{1}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and X2β€²superscriptsubscript𝑋2β€²X_{2}^{\prime}italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT. Call this collection of cycles π’ž2subscriptπ’ž2\mathcal{C}_{2}caligraphic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. Then repeat the argument for the pair X2β€²superscriptsubscript𝑋2β€²X_{2}^{\prime}italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT and X3subscript𝑋3X_{3}italic_X start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, with collection of cycles π’ž2subscriptπ’ž2\mathcal{C}_{2}caligraphic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT so there exists X3β€²βŠ†X3superscriptsubscript𝑋3β€²subscript𝑋3X_{3}^{\prime}\subseteq X_{3}italic_X start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT βŠ† italic_X start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT and π’ž3βŠ‚π’ž2subscriptπ’ž3subscriptπ’ž2\mathcal{C}_{3}\subset\mathcal{C}_{2}caligraphic_C start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT βŠ‚ caligraphic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT with |π’ž3|β‰₯|π’ž2|/log2⁑dsubscriptπ’ž3subscriptπ’ž2subscript2𝑑|\mathcal{C}_{3}|\geq|\mathcal{C}_{2}|/\log_{2}d| caligraphic_C start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | β‰₯ | caligraphic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | / roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_d. We continue to obtain Xiβ€²βŠ†Xisuperscriptsubscript𝑋𝑖′subscript𝑋𝑖X_{i}^{\prime}\subseteq X_{i}italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT βŠ† italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for all i≀kβˆ’2π‘–π‘˜2i\leq k-2italic_i ≀ italic_k - 2 and set π’žβ€²:=π’žkβˆ’2assignsuperscriptπ’žβ€²subscriptπ’žπ‘˜2\mathcal{C}^{\prime}:=\mathcal{C}_{k-2}caligraphic_C start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT := caligraphic_C start_POSTSUBSCRIPT italic_k - 2 end_POSTSUBSCRIPT. Then

|π’žβ€²|β‰₯|π’ž|(log2⁑d)kβˆ’3β‰₯δ⁒dkβˆ’1(log2⁑d)kβˆ’3superscriptπ’žβ€²π’žsuperscriptsubscript2π‘‘π‘˜3𝛿superscriptπ‘‘π‘˜1superscriptsubscript2π‘‘π‘˜3|\mathcal{C}^{\prime}|\geq\frac{|\mathcal{C}|}{(\log_{2}d)^{k-3}}\geq\frac{% \delta d^{k-1}}{(\log_{2}d)^{k-3}}| caligraphic_C start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT | β‰₯ divide start_ARG | caligraphic_C | end_ARG start_ARG ( roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_d ) start_POSTSUPERSCRIPT italic_k - 3 end_POSTSUPERSCRIPT end_ARG β‰₯ divide start_ARG italic_Ξ΄ italic_d start_POSTSUPERSCRIPT italic_k - 1 end_POSTSUPERSCRIPT end_ARG start_ARG ( roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_d ) start_POSTSUPERSCRIPT italic_k - 3 end_POSTSUPERSCRIPT end_ARG

and for every Οƒβˆˆπ’žβ€²πœŽsuperscriptπ’žβ€²\sigma\in\mathcal{C}^{\prime}italic_Οƒ ∈ caligraphic_C start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT we have Οƒi∈Xiβ€²subscriptπœŽπ‘–superscriptsubscript𝑋𝑖′\sigma_{i}\in X_{i}^{\prime}italic_Οƒ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT for i≀kβˆ’2π‘–π‘˜2i\leq k-2italic_i ≀ italic_k - 2.

Let X=Xkβˆ’2′𝑋superscriptsubscriptπ‘‹π‘˜2β€²X=X_{k-2}^{\prime}italic_X = italic_X start_POSTSUBSCRIPT italic_k - 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT and Y=Xkβˆ’1β€²π‘Œsuperscriptsubscriptπ‘‹π‘˜1β€²Y=X_{k-1}^{\prime}italic_Y = italic_X start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT. The number of cycles in π’žβ€²superscriptπ’žβ€²\mathcal{C}^{\prime}caligraphic_C start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT containing an edge {x,y}π‘₯𝑦\{x,y\}{ italic_x , italic_y } with x∈Xπ‘₯𝑋x\in Xitalic_x ∈ italic_X and y∈Yπ‘¦π‘Œy\in Yitalic_y ∈ italic_Y is at most a2⁒⋯⁒akβˆ’2≀dkβˆ’3subscriptπ‘Ž2β‹―subscriptπ‘Žπ‘˜2superscriptπ‘‘π‘˜3a_{2}\cdots a_{k-2}\leq d^{k-3}italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT β‹― italic_a start_POSTSUBSCRIPT italic_k - 2 end_POSTSUBSCRIPT ≀ italic_d start_POSTSUPERSCRIPT italic_k - 3 end_POSTSUPERSCRIPT as the maximum degree is d𝑑ditalic_d. Consequently,

|π’žβ€²|≀e⁒(X,Y)β‹…a2⁒⋯⁒akβˆ’2≀e⁒(X,Y)β‹…dkβˆ’3superscriptπ’žβ€²β‹…π‘’π‘‹π‘Œsubscriptπ‘Ž2β‹―subscriptπ‘Žπ‘˜2β‹…π‘’π‘‹π‘Œsuperscriptπ‘‘π‘˜3|\mathcal{C}^{\prime}|\leq e(X,Y)\cdot a_{2}\cdots a_{k-2}\leq e(X,Y)\cdot d^{% k-3}| caligraphic_C start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT | ≀ italic_e ( italic_X , italic_Y ) β‹… italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT β‹― italic_a start_POSTSUBSCRIPT italic_k - 2 end_POSTSUBSCRIPT ≀ italic_e ( italic_X , italic_Y ) β‹… italic_d start_POSTSUPERSCRIPT italic_k - 3 end_POSTSUPERSCRIPT

and

dβ‹…min⁑{|X|,|Y|}β‰₯e⁒(X,Y)β‰₯|π’žβ€²|dkβˆ’3β‰₯δ⁒d2(log2⁑d)kβˆ’3.β‹…π‘‘π‘‹π‘Œπ‘’π‘‹π‘Œsuperscriptπ’žβ€²superscriptπ‘‘π‘˜3𝛿superscript𝑑2superscriptsubscript2π‘‘π‘˜3d\cdot\min\{|X|,|Y|\}\geq e(X,Y)\geq\frac{|\mathcal{C}^{\prime}|}{d^{k-3}}\geq% \frac{\delta d^{2}}{(\log_{2}d)^{k-3}}.italic_d β‹… roman_min { | italic_X | , | italic_Y | } β‰₯ italic_e ( italic_X , italic_Y ) β‰₯ divide start_ARG | caligraphic_C start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT | end_ARG start_ARG italic_d start_POSTSUPERSCRIPT italic_k - 3 end_POSTSUPERSCRIPT end_ARG β‰₯ divide start_ARG italic_Ξ΄ italic_d start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ( roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_d ) start_POSTSUPERSCRIPT italic_k - 3 end_POSTSUPERSCRIPT end_ARG .

Therefore min⁑{|X|,|Y|}β‰₯δ⁒d/(log2⁑d)kβˆ’3π‘‹π‘Œπ›Ώπ‘‘superscriptsubscript2π‘‘π‘˜3\min\{|X|,|Y|\}\geq\delta d/(\log_{2}d)^{k-3}roman_min { | italic_X | , | italic_Y | } β‰₯ italic_Ξ΄ italic_d / ( roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_d ) start_POSTSUPERSCRIPT italic_k - 3 end_POSTSUPERSCRIPT.

Next we prove that e⁒(X,Y)β‰₯δ⁒|X|⁒|Y|/(2⁒log2⁑d)kπ‘’π‘‹π‘Œπ›Ώπ‘‹π‘Œsuperscript2subscript2π‘‘π‘˜e(X,Y)\geq\delta|X||Y|/(2\log_{2}d)^{k}italic_e ( italic_X , italic_Y ) β‰₯ italic_Ξ΄ | italic_X | | italic_Y | / ( 2 roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_d ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT. By construction, for 2≀i≀kβˆ’22π‘–π‘˜22\leq i\leq k-22 ≀ italic_i ≀ italic_k - 2,

ai2⁒|Xiβ€²|≀e⁒(Xiβ€²,Xiβˆ’1β€²)≀d⁒|Xiβˆ’1β€²|subscriptπ‘Žπ‘–2superscriptsubscript𝑋𝑖′𝑒superscriptsubscript𝑋𝑖′superscriptsubscript𝑋𝑖1′𝑑superscriptsubscript𝑋𝑖1β€²\frac{a_{i}}{2}|X_{i}^{\prime}|\leq e(X_{i}^{\prime},X_{i-1}^{\prime})\leq d|X% _{i-1}^{\prime}|divide start_ARG italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG | italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT | ≀ italic_e ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT , italic_X start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT ) ≀ italic_d | italic_X start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT |

and therefore ai≀2⁒d⁒|Xiβˆ’1β€²|/|Xiβ€²|subscriptπ‘Žπ‘–2𝑑superscriptsubscript𝑋𝑖1β€²superscriptsubscript𝑋𝑖′a_{i}\leq 2d|X_{i-1}^{\prime}|/|X_{i}^{\prime}|italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≀ 2 italic_d | italic_X start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT | / | italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT |. Since |X1β€²|≀|N⁒(v0)|≀dsuperscriptsubscript𝑋1′𝑁subscript𝑣0𝑑|X_{1}^{\prime}|\leq|N(v_{0})|\leq d| italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT | ≀ | italic_N ( italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) | ≀ italic_d and |Y|≀|N⁒(v0)|≀dπ‘Œπ‘subscript𝑣0𝑑|Y|\leq|N(v_{0})|\leq d| italic_Y | ≀ | italic_N ( italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) | ≀ italic_d,

a2⁒a3⁒⋯⁒akβˆ’2β‰€βˆi=2kβˆ’22⁒d⁒|Xiβˆ’1β€²||Xiβ€²|=(2⁒d)kβˆ’3⁒|X1β€²||Xkβˆ’2β€²|=(2⁒d)kβˆ’3⁒|X1β€²||X|≀(2⁒d)kβˆ’1|X|⁒|Y|.subscriptπ‘Ž2subscriptπ‘Ž3β‹―subscriptπ‘Žπ‘˜2superscriptsubscriptproduct𝑖2π‘˜22𝑑superscriptsubscript𝑋𝑖1β€²superscriptsubscript𝑋𝑖′superscript2π‘‘π‘˜3superscriptsubscript𝑋1β€²superscriptsubscriptπ‘‹π‘˜2β€²superscript2π‘‘π‘˜3superscriptsubscript𝑋1′𝑋superscript2π‘‘π‘˜1π‘‹π‘Œa_{2}a_{3}\cdots a_{k-2}\leq\prod_{i=2}^{k-2}2d\frac{|X_{i-1}^{\prime}|}{|X_{i% }^{\prime}|}=(2d)^{k-3}\frac{|X_{1}^{\prime}|}{|X_{k-2}^{\prime}|}=(2d)^{k-3}% \frac{|X_{1}^{\prime}|}{|X|}\leq\frac{(2d)^{k-1}}{|X||Y|}.italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT β‹― italic_a start_POSTSUBSCRIPT italic_k - 2 end_POSTSUBSCRIPT ≀ ∏ start_POSTSUBSCRIPT italic_i = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k - 2 end_POSTSUPERSCRIPT 2 italic_d divide start_ARG | italic_X start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT | end_ARG start_ARG | italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT | end_ARG = ( 2 italic_d ) start_POSTSUPERSCRIPT italic_k - 3 end_POSTSUPERSCRIPT divide start_ARG | italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT | end_ARG start_ARG | italic_X start_POSTSUBSCRIPT italic_k - 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT | end_ARG = ( 2 italic_d ) start_POSTSUPERSCRIPT italic_k - 3 end_POSTSUPERSCRIPT divide start_ARG | italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT | end_ARG start_ARG | italic_X | end_ARG ≀ divide start_ARG ( 2 italic_d ) start_POSTSUPERSCRIPT italic_k - 1 end_POSTSUPERSCRIPT end_ARG start_ARG | italic_X | | italic_Y | end_ARG .

Consequently,

e⁒(X,Y)β‰₯|π’žβ€²|a2⁒…⁒akβˆ’2β‰₯|π’žβ€²|⁒|X|⁒|Y|(2⁒d)kβˆ’1β‰₯Ξ΄2kβˆ’1⁒(log2⁑d)kβˆ’3⁒|X|⁒|Y|π‘’π‘‹π‘Œsuperscriptπ’žβ€²subscriptπ‘Ž2…subscriptπ‘Žπ‘˜2superscriptπ’žβ€²π‘‹π‘Œsuperscript2π‘‘π‘˜1𝛿superscript2π‘˜1superscriptsubscript2π‘‘π‘˜3π‘‹π‘Œe(X,Y)\geq\frac{|\mathcal{C}^{\prime}|}{a_{2}\dots a_{k-2}}\geq\frac{|\mathcal% {C}^{\prime}||X||Y|}{(2d)^{k-1}}\geq\frac{\delta}{2^{k-1}(\log_{2}d)^{k-3}}|X|% |Y|italic_e ( italic_X , italic_Y ) β‰₯ divide start_ARG | caligraphic_C start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT | end_ARG start_ARG italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT … italic_a start_POSTSUBSCRIPT italic_k - 2 end_POSTSUBSCRIPT end_ARG β‰₯ divide start_ARG | caligraphic_C start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT | | italic_X | | italic_Y | end_ARG start_ARG ( 2 italic_d ) start_POSTSUPERSCRIPT italic_k - 1 end_POSTSUPERSCRIPT end_ARG β‰₯ divide start_ARG italic_Ξ΄ end_ARG start_ARG 2 start_POSTSUPERSCRIPT italic_k - 1 end_POSTSUPERSCRIPT ( roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_d ) start_POSTSUPERSCRIPT italic_k - 3 end_POSTSUPERSCRIPT end_ARG | italic_X | | italic_Y |

completing the proof. ∎

The following lemma is a standard consequence of the dependent random choice method and we omit the proof.

Lemma 2.

Let Ξ³β‰₯0𝛾0\gamma\geq 0italic_Ξ³ β‰₯ 0, sβ‰₯1𝑠1s\geq 1italic_s β‰₯ 1, and let X𝑋Xitalic_X and Yπ‘ŒYitalic_Y be disjoint sets of vertices in a graph, such that e⁒(X,Y)β‰₯γ⁒|X|⁒|Y|π‘’π‘‹π‘Œπ›Ύπ‘‹π‘Œe(X,Y)\geq\gamma|X||Y|italic_e ( italic_X , italic_Y ) β‰₯ italic_Ξ³ | italic_X | | italic_Y |. Then for any sβ‰₯1𝑠1s\geq 1italic_s β‰₯ 1, there exists a set ZβŠ†Yπ‘π‘ŒZ\subseteq Yitalic_Z βŠ† italic_Y such that

|Z|β‰₯12⁒γs⁒|Y|𝑍12superscriptπ›Ύπ‘ π‘Œ|Z|\geq\frac{1}{2}\gamma^{s}|Y|| italic_Z | β‰₯ divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_Ξ³ start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT | italic_Y |

and every pair of vertices in Z𝑍Zitalic_Z has at least γ⁒|X|⁒|Y|βˆ’1/s𝛾𝑋superscriptπ‘Œ1𝑠\gamma|X||Y|^{-1/s}italic_Ξ³ | italic_X | | italic_Y | start_POSTSUPERSCRIPT - 1 / italic_s end_POSTSUPERSCRIPT neighbors in X𝑋Xitalic_X.

Finally we need the following standard result about independent sets in hypergraphs first proved by SpencerΒ [19].

Lemma 3.

For every kβ‰₯2π‘˜2k\geq 2italic_k β‰₯ 2, every n𝑛nitalic_n-vertex kπ‘˜kitalic_k-uniform hypergraph with average degree d>0𝑑0d>0italic_d > 0 has an independent set of size at least (1βˆ’1/k)⁒n/d1/(kβˆ’1)11π‘˜π‘›superscript𝑑1π‘˜1(1-1/k)n/d^{1/(k-1)}( 1 - 1 / italic_k ) italic_n / italic_d start_POSTSUPERSCRIPT 1 / ( italic_k - 1 ) end_POSTSUPERSCRIPT.

We now have the necessary ingredients to prove TheoremΒ 5.

Proof of Theorem 5. For kβ‰₯3π‘˜3k\geq 3italic_k β‰₯ 3, let

Ο΅k=1100⁒(kβˆ’1).subscriptitalic-Ο΅π‘˜1100π‘˜1\epsilon_{k}=\frac{1}{100(k-1)}.italic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 100 ( italic_k - 1 ) end_ARG .

Let H𝐻Hitalic_H be an n𝑛nitalic_n-vertex K4subscript𝐾4K_{4}italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT-free graph with maximum degree d𝑑ditalic_d. We will find a CksubscriptπΆπ‘˜C_{k}italic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT-free subset of vertices in H𝐻Hitalic_H of size at least n1/3+Ο΅ksuperscript𝑛13subscriptitalic-Ο΅π‘˜n^{1/3+\epsilon_{k}}italic_n start_POSTSUPERSCRIPT 1 / 3 + italic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT. Suppose that β–³β–³\triangleβ–³ is the maximum number of copies of CksubscriptπΆπ‘˜C_{k}italic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT that a vertex is in. Define

Ξ΄:=β–³dkβˆ’1.assign𝛿△superscriptπ‘‘π‘˜1\delta:=\frac{\triangle}{d^{k-1}}.italic_Ξ΄ := divide start_ARG β–³ end_ARG start_ARG italic_d start_POSTSUPERSCRIPT italic_k - 1 end_POSTSUPERSCRIPT end_ARG .

We now obtain two different bounds on the maximum CksubscriptπΆπ‘˜C_{k}italic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT-free set.

Bound 1. Let β„‹β„‹\mathcal{H}caligraphic_H be the kπ‘˜kitalic_k-uniform hypergraph with V⁒(β„‹)=V⁒(H)𝑉ℋ𝑉𝐻V(\mathcal{H})=V(H)italic_V ( caligraphic_H ) = italic_V ( italic_H ) and E⁒(β„‹)={V⁒(Ck):CkβŠ†H}𝐸ℋconditional-set𝑉subscriptπΆπ‘˜subscriptπΆπ‘˜π»E(\mathcal{H})=\{V(C_{k}):C_{k}\subseteq H\}italic_E ( caligraphic_H ) = { italic_V ( italic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) : italic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT βŠ† italic_H }. Then β„‹β„‹\mathcal{H}caligraphic_H has maximum degree (and hence average degree) at most β–³β–³\triangleβ–³ and LemmaΒ 3 implies that H𝐻Hitalic_H has an independent set of size at least

Ω⁒(nβ–³1kβˆ’1)=Ω⁒(nΞ΄1kβˆ’1⁒d).Ω𝑛superscriptβ–³1π‘˜1Ω𝑛superscript𝛿1π‘˜1𝑑\Omega\left(\frac{n}{\triangle^{\frac{1}{k-1}}}\right)=\Omega\left(\frac{n}{% \delta^{\frac{1}{k-1}}d}\right).roman_Ξ© ( divide start_ARG italic_n end_ARG start_ARG β–³ start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_k - 1 end_ARG end_POSTSUPERSCRIPT end_ARG ) = roman_Ξ© ( divide start_ARG italic_n end_ARG start_ARG italic_Ξ΄ start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_k - 1 end_ARG end_POSTSUPERSCRIPT italic_d end_ARG ) .

Bound 2. Let v0∈V⁒(H)subscript𝑣0𝑉𝐻v_{0}\in V(H)italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ italic_V ( italic_H ) lie in β–³=δ⁒dkβˆ’1△𝛿superscriptπ‘‘π‘˜1\triangle=\delta d^{k-1}β–³ = italic_Ξ΄ italic_d start_POSTSUPERSCRIPT italic_k - 1 end_POSTSUPERSCRIPT copies of CksubscriptπΆπ‘˜C_{k}italic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. By LemmaΒ 1, there exist sets X,YβŠ†V⁒(H)π‘‹π‘Œπ‘‰π»X,Y\subseteq V(H)italic_X , italic_Y βŠ† italic_V ( italic_H ) such that

e(X,Y)β‰₯Ξ΄(2⁒log2⁑d)k|X||Y|=:Ξ³|X||Y|,e(X,Y)\geq\frac{\delta}{(2\log_{2}d)^{k}}|X||Y|=:\gamma|X||Y|,italic_e ( italic_X , italic_Y ) β‰₯ divide start_ARG italic_Ξ΄ end_ARG start_ARG ( 2 roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_d ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG | italic_X | | italic_Y | = : italic_Ξ³ | italic_X | | italic_Y | ,

where |X|β‰₯|Y|β‰₯γ⁒dπ‘‹π‘Œπ›Ύπ‘‘|X|\geq|Y|\geq\gamma d| italic_X | β‰₯ | italic_Y | β‰₯ italic_Ξ³ italic_d. By LemmaΒ 2 applied with s=3𝑠3s=3italic_s = 3, there exists ZβŠ†Yπ‘π‘ŒZ\subseteq Yitalic_Z βŠ† italic_Y with

|Z|β‰₯12⁒γ3⁒|Y|𝑍12superscript𝛾3π‘Œ|Z|\geq\frac{1}{2}\gamma^{3}|Y|| italic_Z | β‰₯ divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_Ξ³ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT | italic_Y |

such that every pair of vertices in Z𝑍Zitalic_Z has at least γ⁒|X|⁒|Y|βˆ’1/3β‰₯γ⁒|Y|2/3𝛾𝑋superscriptπ‘Œ13𝛾superscriptπ‘Œ23\gamma|X||Y|^{-1/3}\geq\gamma|Y|^{2/3}italic_Ξ³ | italic_X | | italic_Y | start_POSTSUPERSCRIPT - 1 / 3 end_POSTSUPERSCRIPT β‰₯ italic_Ξ³ | italic_Y | start_POSTSUPERSCRIPT 2 / 3 end_POSTSUPERSCRIPT common neighbors in X𝑋Xitalic_X. If Z𝑍Zitalic_Z is not an independent set in H𝐻Hitalic_H, then there exists {x,y}∈E⁒(H)π‘₯𝑦𝐸𝐻\{x,y\}\in E(H){ italic_x , italic_y } ∈ italic_E ( italic_H ) with x,y∈Zπ‘₯𝑦𝑍x,y\in Zitalic_x , italic_y ∈ italic_Z. Since H𝐻Hitalic_H is K4subscript𝐾4K_{4}italic_K start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT-free, N⁒(x)∩N⁒(y)𝑁π‘₯𝑁𝑦N(x)\cap N(y)italic_N ( italic_x ) ∩ italic_N ( italic_y ) is an independent set in H𝐻Hitalic_H of size at least γ⁒|Y|2/3β‰₯Ξ³5/3⁒d2/3𝛾superscriptπ‘Œ23superscript𝛾53superscript𝑑23\gamma|Y|^{2/3}\geq\gamma^{5/3}d^{2/3}italic_Ξ³ | italic_Y | start_POSTSUPERSCRIPT 2 / 3 end_POSTSUPERSCRIPT β‰₯ italic_Ξ³ start_POSTSUPERSCRIPT 5 / 3 end_POSTSUPERSCRIPT italic_d start_POSTSUPERSCRIPT 2 / 3 end_POSTSUPERSCRIPT. Otherwise, Z𝑍Zitalic_Z is an independent set in H𝐻Hitalic_H of size at least 12⁒γ3⁒|Y|β‰₯12⁒γ4⁒d12superscript𝛾3π‘Œ12superscript𝛾4𝑑\frac{1}{2}\gamma^{3}|Y|\geq\frac{1}{2}\gamma^{4}ddivide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_Ξ³ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT | italic_Y | β‰₯ divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_Ξ³ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_d. In particular, H𝐻Hitalic_H has a CksubscriptπΆπ‘˜C_{k}italic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT-free induced subgraph of size at least

h⁒(d,Ξ³)=min⁑{Ξ³5/3⁒d2/3,12⁒γ4⁒d}.β„Žπ‘‘π›Ύsuperscript𝛾53superscript𝑑2312superscript𝛾4𝑑h(d,\gamma)=\min\left\{\gamma^{5/3}d^{2/3},\frac{1}{2}\gamma^{4}d\right\}.italic_h ( italic_d , italic_Ξ³ ) = roman_min { italic_Ξ³ start_POSTSUPERSCRIPT 5 / 3 end_POSTSUPERSCRIPT italic_d start_POSTSUPERSCRIPT 2 / 3 end_POSTSUPERSCRIPT , divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_Ξ³ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_d } .

It is also the case that G𝐺Gitalic_G always contains an independent set with at least n/(d+1)𝑛𝑑1n/(d+1)italic_n / ( italic_d + 1 ) vertices, by TurΓ‘n’s Theorem. If d≀n2/3βˆ’Ο΅k𝑑superscript𝑛23subscriptitalic-Ο΅π‘˜d\leq n^{2/3-\epsilon_{k}}italic_d ≀ italic_n start_POSTSUPERSCRIPT 2 / 3 - italic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, this gives an independent set of size n1/3+Ο΅ksuperscript𝑛13subscriptitalic-Ο΅π‘˜n^{1/3+\epsilon_{k}}italic_n start_POSTSUPERSCRIPT 1 / 3 + italic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT in G𝐺Gitalic_G. If dβ‰₯n2/3+2⁒ϡk𝑑superscript𝑛232subscriptitalic-Ο΅π‘˜d\geq n^{2/3+2\epsilon_{k}}italic_d β‰₯ italic_n start_POSTSUPERSCRIPT 2 / 3 + 2 italic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, then since the neighborhood of a vertex of degree d𝑑ditalic_d induces a triangle-free graph, this neighborhood contains an independent set of size at least d1/2β‰₯n1/3+Ο΅ksuperscript𝑑12superscript𝑛13subscriptitalic-Ο΅π‘˜d^{1/2}\geq n^{1/3+\epsilon_{k}}italic_d start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT β‰₯ italic_n start_POSTSUPERSCRIPT 1 / 3 + italic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT in G𝐺Gitalic_G. Therefore we assume n2/3βˆ’Ο΅k≀d≀n2/3+2⁒ϡksuperscript𝑛23subscriptitalic-Ο΅π‘˜π‘‘superscript𝑛232subscriptitalic-Ο΅π‘˜n^{2/3-\epsilon_{k}}\leq d\leq n^{2/3+2\epsilon_{k}}italic_n start_POSTSUPERSCRIPT 2 / 3 - italic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ≀ italic_d ≀ italic_n start_POSTSUPERSCRIPT 2 / 3 + 2 italic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT. In that case, by Bounds 1 and 2, we obtain a CksubscriptπΆπ‘˜C_{k}italic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT-free set of size at least

Ω⁒(max⁑{nΞ΄1kβˆ’1⁒d,h⁒(d,Ξ³)}).Ω𝑛superscript𝛿1π‘˜1π‘‘β„Žπ‘‘π›Ύ\Omega\left(\max\left\{\frac{n}{\delta^{\frac{1}{k-1}}d},h(d,\gamma)\right\}% \right).roman_Ξ© ( roman_max { divide start_ARG italic_n end_ARG start_ARG italic_Ξ΄ start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_k - 1 end_ARG end_POSTSUPERSCRIPT italic_d end_ARG , italic_h ( italic_d , italic_Ξ³ ) } ) .

If Ξ΄<nβˆ’1/25𝛿superscript𝑛125\delta<n^{-1/25}italic_Ξ΄ < italic_n start_POSTSUPERSCRIPT - 1 / 25 end_POSTSUPERSCRIPT, then Bound 1 is at least

Ω⁒(nΞ΄1kβˆ’1⁒d)=Ω⁒(n13+125⁒(kβˆ’1)βˆ’2⁒ϡk)=Ω⁒(n13+Ο΅k)Ω𝑛superscript𝛿1π‘˜1𝑑Ωsuperscript𝑛13125π‘˜12subscriptitalic-Ο΅π‘˜Ξ©superscript𝑛13subscriptitalic-Ο΅π‘˜\Omega\left(\frac{n}{\delta^{\frac{1}{k-1}}d}\right)=\Omega\left(n^{\frac{1}{3% }+\frac{1}{25(k-1)}-2\epsilon_{k}}\right)=\Omega(n^{\frac{1}{3}+\epsilon_{k}})roman_Ξ© ( divide start_ARG italic_n end_ARG start_ARG italic_Ξ΄ start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_k - 1 end_ARG end_POSTSUPERSCRIPT italic_d end_ARG ) = roman_Ξ© ( italic_n start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 3 end_ARG + divide start_ARG 1 end_ARG start_ARG 25 ( italic_k - 1 ) end_ARG - 2 italic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) = roman_Ξ© ( italic_n start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 3 end_ARG + italic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT )

as Ο΅k<1/75⁒(kβˆ’1)subscriptitalic-Ο΅π‘˜175π‘˜1\epsilon_{k}<1/75(k-1)italic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT < 1 / 75 ( italic_k - 1 ). So we may assume that Ξ΄β‰₯nβˆ’1/25𝛿superscript𝑛125\delta\geq n^{-1/25}italic_Ξ΄ β‰₯ italic_n start_POSTSUPERSCRIPT - 1 / 25 end_POSTSUPERSCRIPT, and, as n𝑛nitalic_n is sufficiently large, we may assume that Ξ³=Ξ΄/(2⁒log2⁑d)k>nβˆ’1/24𝛾𝛿superscript2subscript2π‘‘π‘˜superscript𝑛124\gamma=\delta/(2\log_{2}d)^{k}>n^{-1/24}italic_Ξ³ = italic_Ξ΄ / ( 2 roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_d ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT > italic_n start_POSTSUPERSCRIPT - 1 / 24 end_POSTSUPERSCRIPT. In this case, Ο΅k<1/100subscriptitalic-Ο΅π‘˜1100\epsilon_{k}<1/100italic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT < 1 / 100 and d>n2/3βˆ’Ο΅k𝑑superscript𝑛23subscriptitalic-Ο΅π‘˜d>n^{2/3-\epsilon_{k}}italic_d > italic_n start_POSTSUPERSCRIPT 2 / 3 - italic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT yield

Ξ³53⁒d23>nβˆ’572+49βˆ’2⁒ϡk3>n13+Ο΅kΒ andΒ Ξ³4⁒d>nβˆ’16+23βˆ’Ο΅k>2⁒n13+Ο΅kformulae-sequencesuperscript𝛾53superscript𝑑23superscript𝑛572492subscriptitalic-Ο΅π‘˜3superscript𝑛13subscriptitalic-Ο΅π‘˜Β andΒ superscript𝛾4𝑑superscript𝑛1623subscriptitalic-Ο΅π‘˜2superscript𝑛13subscriptitalic-Ο΅π‘˜\gamma^{\frac{5}{3}}d^{\frac{2}{3}}>n^{\frac{-5}{72}+\frac{4}{9}-\frac{2% \epsilon_{k}}{3}}>n^{\frac{1}{3}+\epsilon_{k}}\qquad\hbox{ and }\qquad\gamma^{% 4}d>n^{-\frac{1}{6}+\frac{2}{3}-\epsilon_{k}}>2n^{\frac{1}{3}+\epsilon_{k}}italic_Ξ³ start_POSTSUPERSCRIPT divide start_ARG 5 end_ARG start_ARG 3 end_ARG end_POSTSUPERSCRIPT italic_d start_POSTSUPERSCRIPT divide start_ARG 2 end_ARG start_ARG 3 end_ARG end_POSTSUPERSCRIPT > italic_n start_POSTSUPERSCRIPT divide start_ARG - 5 end_ARG start_ARG 72 end_ARG + divide start_ARG 4 end_ARG start_ARG 9 end_ARG - divide start_ARG 2 italic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG 3 end_ARG end_POSTSUPERSCRIPT > italic_n start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 3 end_ARG + italic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT and italic_Ξ³ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_d > italic_n start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 6 end_ARG + divide start_ARG 2 end_ARG start_ARG 3 end_ARG - italic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT > 2 italic_n start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 3 end_ARG + italic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT

and therefore h⁒(d,Ξ³)>n1/3+Ο΅kβ„Žπ‘‘π›Ύsuperscript𝑛13subscriptitalic-Ο΅π‘˜h(d,\gamma)>n^{1/3+\epsilon_{k}}italic_h ( italic_d , italic_Ξ³ ) > italic_n start_POSTSUPERSCRIPT 1 / 3 + italic_Ο΅ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, completing the proof. ∎

4 Proof of TheoremΒ 3

A sunflower is a collection of sets every pair of which have the same intersection, called the core. We need the well-known Erdos-Rado sunflower lemma in the form below.

Lemma 4.

Fix t,m>0π‘‘π‘š0t,m>0italic_t , italic_m > 0. Every t𝑑titalic_t-uniform hypergraph with more than t!⁒(mβˆ’1)t𝑑superscriptπ‘š1𝑑t!(m-1)^{t}italic_t ! ( italic_m - 1 ) start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT edges has a sunflower of size mπ‘šmitalic_m.

Proof of TheoremΒ 3. Let |V⁒(G)|=kπ‘‰πΊπ‘˜|V(G)|=k| italic_V ( italic_G ) | = italic_k. We may assume that G𝐺Gitalic_G is not acyclic, since otherwise G𝐺Gitalic_G would be contained in a blowup of F𝐹Fitalic_F. Consider an n𝑛nitalic_n by n𝑛nitalic_n bipartite graph H𝐻Hitalic_H without cycles of length at most 2⁒k2π‘˜2k2 italic_k and where every vertex has degree d=n13⁒k𝑑superscript𝑛13π‘˜d=n^{\frac{1}{3k}}italic_d = italic_n start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 3 italic_k end_ARG end_POSTSUPERSCRIPT. Such graphs exist, for example the bipartite Ramanujan graphs of Lubotzky, Phillips and SarnakΒ [13], or even a random d𝑑ditalic_d-regular graph (if we are not fussy about the constant in the exponent). We now employ the methods ofΒ [15, 5]. Let Hβ€²superscript𝐻′H^{\prime}italic_H start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT be the restriction of the square of H𝐻Hitalic_H to one part of H𝐻Hitalic_H, so that Hβ€²superscript𝐻′H^{\prime}italic_H start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT has n𝑛nitalic_n vertices, and is a union of n𝑛nitalic_n edge-disjoint cliques K1,K2,…,Knsuperscript𝐾1superscript𝐾2…superscript𝐾𝑛K^{1},K^{2},\dots,K^{n}italic_K start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_K start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , … , italic_K start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT of order d𝑑ditalic_d. Since G𝐺Gitalic_G is not acyclic, every copy of G𝐺Gitalic_G in Hβ€²superscript𝐻′H^{\prime}italic_H start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT is contained in one of those cliques. In each of the cliques, take a random coloring with elements of V⁒(F)𝑉𝐹V(F)italic_V ( italic_F ), and put an edge between any two color classes corresponding to an edge of F𝐹Fitalic_F. Since F𝐹Fitalic_F is hom⁒(G)hom𝐺\mbox{hom}(G)hom ( italic_G )-free, this random graph Hβˆ—superscript𝐻H^{*}italic_H start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT is G𝐺Gitalic_G-free. We claim (similarly to the proof of Theorem 1), that every set of at least (2⁒n⁒|V⁒(F)|⁒log⁑|V⁒(F)|)/d2𝑛𝑉𝐹𝑉𝐹𝑑(2n|V(F)|\log|V(F)|)/d( 2 italic_n | italic_V ( italic_F ) | roman_log | italic_V ( italic_F ) | ) / italic_d vertices of Hβˆ—superscript𝐻H^{*}italic_H start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT induces a copy of F𝐹Fitalic_F. The probability that such a set X𝑋Xitalic_X does not induce a copy of F𝐹Fitalic_F is at most

∏i=1n|V⁒(F)|β‹…(1βˆ’1|V⁒(F)|)|X∩V⁒(Ki)|.superscriptsubscriptproduct𝑖1𝑛⋅𝑉𝐹superscript11𝑉𝐹𝑋𝑉superscript𝐾𝑖\prod_{i=1}^{n}|V(F)|\cdot\Bigl{(}1-\frac{1}{|V(F)|}\Bigr{)}^{|X\cap V(K^{i})|}.∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT | italic_V ( italic_F ) | β‹… ( 1 - divide start_ARG 1 end_ARG start_ARG | italic_V ( italic_F ) | end_ARG ) start_POSTSUPERSCRIPT | italic_X ∩ italic_V ( italic_K start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) | end_POSTSUPERSCRIPT .

Now we use

βˆ‘i=1n|X∩V⁒(Ki)|=d⁒|X|superscriptsubscript𝑖1𝑛𝑋𝑉subscript𝐾𝑖𝑑𝑋\sum_{i=1}^{n}|X\cap V(K_{i})|=d|X|βˆ‘ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT | italic_X ∩ italic_V ( italic_K start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) | = italic_d | italic_X |

and therefore the expected number of such X𝑋Xitalic_X is at most

(n|X|)β‹…|V⁒(F)|n⁒(1βˆ’1|V⁒(F)|)d⁒|X|<e|X|⁒log⁑nβˆ’d⁒|X|/|V⁒(F)|+n⁒log⁑|V⁒(F)|.β‹…binomial𝑛𝑋superscript𝑉𝐹𝑛superscript11𝑉𝐹𝑑𝑋superscript𝑒𝑋𝑛𝑑𝑋𝑉𝐹𝑛𝑉𝐹{n\choose|X|}\cdot|V(F)|^{n}\Bigl{(}1-\frac{1}{|V(F)|}\Bigr{)}^{d|X|}<e^{|X|% \log n-d|X|/|V(F)|+n\log|V(F)|}.( binomial start_ARG italic_n end_ARG start_ARG | italic_X | end_ARG ) β‹… | italic_V ( italic_F ) | start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( 1 - divide start_ARG 1 end_ARG start_ARG | italic_V ( italic_F ) | end_ARG ) start_POSTSUPERSCRIPT italic_d | italic_X | end_POSTSUPERSCRIPT < italic_e start_POSTSUPERSCRIPT | italic_X | roman_log italic_n - italic_d | italic_X | / | italic_V ( italic_F ) | + italic_n roman_log | italic_V ( italic_F ) | end_POSTSUPERSCRIPT .

This is vanishing since d⁒|X|/|V⁒(F)|>2⁒n⁒log⁑|V⁒(F)|𝑑𝑋𝑉𝐹2𝑛𝑉𝐹d|X|/|V(F)|>2n\log|V(F)|italic_d | italic_X | / | italic_V ( italic_F ) | > 2 italic_n roman_log | italic_V ( italic_F ) |. Therefore

fF,G⁒(n)=O⁒(n/d)=O⁒(n1βˆ’13⁒k)=O⁒(n1βˆ’13⁒|V⁒(G)|)subscript𝑓𝐹𝐺𝑛𝑂𝑛𝑑𝑂superscript𝑛113π‘˜π‘‚superscript𝑛113𝑉𝐺f_{F,G}(n)=O(n/d)=O(n^{1-\frac{1}{3k}})=O(n^{1-\frac{1}{3|V(G)|}})italic_f start_POSTSUBSCRIPT italic_F , italic_G end_POSTSUBSCRIPT ( italic_n ) = italic_O ( italic_n / italic_d ) = italic_O ( italic_n start_POSTSUPERSCRIPT 1 - divide start_ARG 1 end_ARG start_ARG 3 italic_k end_ARG end_POSTSUPERSCRIPT ) = italic_O ( italic_n start_POSTSUPERSCRIPT 1 - divide start_ARG 1 end_ARG start_ARG 3 | italic_V ( italic_G ) | end_ARG end_POSTSUPERSCRIPT )

and we may take Ξ΅G=1/3⁒|V⁒(G)|subscriptπœ€πΊ13𝑉𝐺\varepsilon_{G}=1/3|V(G)|italic_Ξ΅ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT = 1 / 3 | italic_V ( italic_G ) | in Theorem 3.


We now prove the second statement of the theorem. Let r:=|V⁒(G)|βˆ’1assignπ‘Ÿπ‘‰πΊ1r:=|V(G)|-1italic_r := | italic_V ( italic_G ) | - 1. If G𝐺Gitalic_G is acyclic, then any n𝑛nitalic_n-vertex G𝐺Gitalic_G-free graph has an independent set I𝐼Iitalic_I of size linear in n𝑛nitalic_n, and I𝐼Iitalic_I is certainly F𝐹Fitalic_F-free for any nonempty F𝐹Fitalic_F so we are done. If G𝐺Gitalic_G is not 2-connected, then let F=Kr𝐹subscriptπΎπ‘ŸF=K_{r}italic_F = italic_K start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT so that F𝐹Fitalic_F is clearly G𝐺Gitalic_G-free. Suppose that H𝐻Hitalic_H is an n𝑛nitalic_n-vertex G𝐺Gitalic_G-free graph. Then no two rπ‘Ÿritalic_r-cliques in H𝐻Hitalic_H have a point in common, for otherwise the subgraph of H𝐻Hitalic_H induced by their union contains G𝐺Gitalic_G. Indeed, we can pick some vertex in the intersection of the two cliques to be a cut vertex of G𝐺Gitalic_G, and then easily embed G𝐺Gitalic_G in the union of the two cliques (the embedding is even easier if G𝐺Gitalic_G is not connected). Consequently, the rπ‘Ÿritalic_r-cliques in H𝐻Hitalic_H are pairwise vertex disjoint. Then H𝐻Hitalic_H has a KrsubscriptπΎπ‘ŸK_{r}italic_K start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT-free induced subgraph of size at least (1βˆ’1/r)⁒n11π‘Ÿπ‘›(1-1/r)n( 1 - 1 / italic_r ) italic_n and we are done.

We may henceforth assume that G𝐺Gitalic_G is 2-connected. Since G𝐺Gitalic_G is not a clique, let v,w𝑣𝑀v,witalic_v , italic_w be nonadjacent vertices in G𝐺Gitalic_G. Let G+superscript𝐺G^{+}italic_G start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT be the graph obtained from G𝐺Gitalic_G by adding all edges that are not already in G𝐺Gitalic_G between {v,w}𝑣𝑀\{v,w\}{ italic_v , italic_w } and NG⁒(v)βˆͺNG⁒(w)subscript𝑁𝐺𝑣subscript𝑁𝐺𝑀N_{G}(v)\cup N_{G}(w)italic_N start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v ) βˆͺ italic_N start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_w ). So G+βŠƒG𝐺superscript𝐺G^{+}\supset Gitalic_G start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT βŠƒ italic_G, and v𝑣vitalic_v and w𝑀witalic_w are clones in G+superscript𝐺G^{+}italic_G start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT. Let Gβˆ—=G+βˆ’{w}superscript𝐺superscript𝐺𝑀G^{*}=G^{+}-\{w\}italic_G start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT = italic_G start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT - { italic_w } and let Gβˆ—βˆ—=G+βˆ’{v,w}=Gβˆ—βˆ’{v}superscript𝐺absentsuperscript𝐺𝑣𝑀superscript𝐺𝑣G^{**}=G^{+}-\{v,w\}=G^{*}-\{v\}italic_G start_POSTSUPERSCRIPT βˆ— βˆ— end_POSTSUPERSCRIPT = italic_G start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT - { italic_v , italic_w } = italic_G start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT - { italic_v }. So Gβˆ—superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT has rπ‘Ÿritalic_r vertices and Gβˆ—βˆ—superscript𝐺absentG^{**}italic_G start_POSTSUPERSCRIPT βˆ— βˆ— end_POSTSUPERSCRIPT has rβˆ’1π‘Ÿ1r-1italic_r - 1 vertices.

Assume that t𝑑titalic_t is sufficiently large in terms of rπ‘Ÿritalic_r and set Ξ΄=1/5⁒r2𝛿15superscriptπ‘Ÿ2\delta=1/5r^{2}italic_Ξ΄ = 1 / 5 italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Apply PropositionΒ B to obtain a t𝑑titalic_t-vertex rπ‘Ÿritalic_r-uniform hypergraph Fβˆ—superscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT with girth larger than r+1π‘Ÿ1r+1italic_r + 1, and the property that for every s𝑠sitalic_s-set S𝑆Sitalic_S with t1βˆ’Ξ΄β‰€s≀tβˆ’1superscript𝑑1𝛿𝑠𝑑1t^{1-\delta}\leq s\leq t-1italic_t start_POSTSUPERSCRIPT 1 - italic_Ξ΄ end_POSTSUPERSCRIPT ≀ italic_s ≀ italic_t - 1, the number of edges in Fβˆ—superscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT with exactly rβˆ’1π‘Ÿ1r-1italic_r - 1 vertices in S𝑆Sitalic_S is at least

110(srβˆ’1)(tβˆ’s)t1βˆ’r+12⁒r>(srβˆ’1)(tβˆ’s)t1βˆ’r+13⁒r=:qs.\frac{1}{10}{s\choose r-1}(t-s)t^{1-r+\frac{1}{2r}}>{s\choose r-1}(t-s)t^{1-r+% \frac{1}{3r}}=:q_{s}.divide start_ARG 1 end_ARG start_ARG 10 end_ARG ( binomial start_ARG italic_s end_ARG start_ARG italic_r - 1 end_ARG ) ( italic_t - italic_s ) italic_t start_POSTSUPERSCRIPT 1 - italic_r + divide start_ARG 1 end_ARG start_ARG 2 italic_r end_ARG end_POSTSUPERSCRIPT > ( binomial start_ARG italic_s end_ARG start_ARG italic_r - 1 end_ARG ) ( italic_t - italic_s ) italic_t start_POSTSUPERSCRIPT 1 - italic_r + divide start_ARG 1 end_ARG start_ARG 3 italic_r end_ARG end_POSTSUPERSCRIPT = : italic_q start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT .

Inside each hyperedge e𝑒eitalic_e of Fβˆ—superscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT, place randomly a copy of Gβˆ—superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT. More precisely, among all r!π‘Ÿr!italic_r ! ways to map the vertices of Gβˆ—superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT to e𝑒eitalic_e, we pick one with probability 1/r!1π‘Ÿ1/r!1 / italic_r !. Let F𝐹Fitalic_F be the resulting graph with V⁒(F)=V⁒(Fβˆ—)𝑉𝐹𝑉superscript𝐹V(F)=V(F^{*})italic_V ( italic_F ) = italic_V ( italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT ) and E⁒(F)𝐸𝐹E(F)italic_E ( italic_F ) comprises the graph edges in all copies of Gβˆ—superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT that lie in edges of Fβˆ—superscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT.

As G𝐺Gitalic_G has r+1π‘Ÿ1r+1italic_r + 1 vertices, and Fβˆ—superscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT is rπ‘Ÿritalic_r-uniform, there is no copy of G𝐺Gitalic_G in F𝐹Fitalic_F that lies entirely within an edge of Fβˆ—superscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT. If a copy of G𝐺Gitalic_G in F𝐹Fitalic_F has two vertices x,yπ‘₯𝑦x,yitalic_x , italic_y that do not lie in the same edge of Fβˆ—superscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT, then, since G𝐺Gitalic_G is 2-connected, there is a cycle in G𝐺Gitalic_G containing xπ‘₯xitalic_x and y𝑦yitalic_y and this cycle yields a hypergraph cycle in Fβˆ—superscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT of length at most r+1π‘Ÿ1r+1italic_r + 1 which does not exist by construction. We conclude that F𝐹Fitalic_F is G𝐺Gitalic_G-free.

Furthermore, we claim that for any s𝑠sitalic_s-set S𝑆Sitalic_S in F𝐹Fitalic_F, with t1βˆ’Ξ΄β‰€s≀tβˆ’1superscript𝑑1𝛿𝑠𝑑1t^{1-\delta}\leq s\leq t-1italic_t start_POSTSUPERSCRIPT 1 - italic_Ξ΄ end_POSTSUPERSCRIPT ≀ italic_s ≀ italic_t - 1, there exists an edge e𝑒eitalic_e of Fβˆ—superscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT with rβˆ’1π‘Ÿ1r-1italic_r - 1 vertices in S𝑆Sitalic_S and one vertex in V⁒(Fβˆ—)βˆ’S𝑉superscript𝐹𝑆V(F^{*})-Sitalic_V ( italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT ) - italic_S such that

the copy of Gβˆ—superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT placed inside e𝑒eitalic_e induces a copy of Gβˆ—βˆ—superscript𝐺absentG^{**}italic_G start_POSTSUPERSCRIPT βˆ— βˆ— end_POSTSUPERSCRIPT within e∩S𝑒𝑆e\cap Sitalic_e ∩ italic_S. (4)

Indeed, (4) follows from the following argument. For each of the qssubscriptπ‘žπ‘ q_{s}italic_q start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT edges e𝑒eitalic_e of Fβˆ—superscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT with exactly one vertex outside S𝑆Sitalic_S, the probability that e𝑒eitalic_e fails (4) is at most 1βˆ’1/r!11π‘Ÿ1-1/r!1 - 1 / italic_r !. Since any two such edges e,e′𝑒superscript𝑒′e,e^{\prime}italic_e , italic_e start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT share at most one vertex by the girth property of Fβˆ—superscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT, the probability that all of these qssubscriptπ‘žπ‘ q_{s}italic_q start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT edges e𝑒eitalic_e fail (4) is at most (1βˆ’1/r!)qssuperscript11π‘Ÿsubscriptπ‘žπ‘ (1-1/r!)^{q_{s}}( 1 - 1 / italic_r ! ) start_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_POSTSUPERSCRIPT. Consequently, the probability that there exists an s𝑠sitalic_s-set for which there is no e𝑒eitalic_e satisfying (4) is at most

βˆ‘s=t1βˆ’Ξ΄tβˆ’1(ts)⁒eβˆ’qs/r!=βˆ‘s=t1βˆ’Ξ΄tβˆ’1(ttβˆ’s)⁒eβˆ’qs/r!<t⁒(ttβˆ’s)⁒eβˆ’qs/r!<elog⁑t+(tβˆ’s)⁒log⁑tβˆ’qs/r!<1.superscriptsubscript𝑠superscript𝑑1𝛿𝑑1binomial𝑑𝑠superscript𝑒subscriptπ‘žπ‘ π‘Ÿsuperscriptsubscript𝑠superscript𝑑1𝛿𝑑1binomial𝑑𝑑𝑠superscript𝑒subscriptπ‘žπ‘ π‘Ÿπ‘‘binomial𝑑𝑑𝑠superscript𝑒subscriptπ‘žπ‘ π‘Ÿsuperscript𝑒𝑑𝑑𝑠𝑑subscriptπ‘žπ‘ π‘Ÿ1\sum_{s=t^{1-\delta}}^{t-1}{t\choose s}e^{-q_{s}/r!}=\sum_{s=t^{1-\delta}}^{t-% 1}{t\choose t-s}e^{-q_{s}/r!}<t{t\choose t-s}e^{-q_{s}/r!}<e^{\log t+(t-s)\log t% -q_{s}/r!}<1.βˆ‘ start_POSTSUBSCRIPT italic_s = italic_t start_POSTSUPERSCRIPT 1 - italic_Ξ΄ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT ( binomial start_ARG italic_t end_ARG start_ARG italic_s end_ARG ) italic_e start_POSTSUPERSCRIPT - italic_q start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT / italic_r ! end_POSTSUPERSCRIPT = βˆ‘ start_POSTSUBSCRIPT italic_s = italic_t start_POSTSUPERSCRIPT 1 - italic_Ξ΄ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT ( binomial start_ARG italic_t end_ARG start_ARG italic_t - italic_s end_ARG ) italic_e start_POSTSUPERSCRIPT - italic_q start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT / italic_r ! end_POSTSUPERSCRIPT < italic_t ( binomial start_ARG italic_t end_ARG start_ARG italic_t - italic_s end_ARG ) italic_e start_POSTSUPERSCRIPT - italic_q start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT / italic_r ! end_POSTSUPERSCRIPT < italic_e start_POSTSUPERSCRIPT roman_log italic_t + ( italic_t - italic_s ) roman_log italic_t - italic_q start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT / italic_r ! end_POSTSUPERSCRIPT < 1 .

The final inequality holds as Ξ΄=1/5⁒r2𝛿15superscriptπ‘Ÿ2\delta=1/5r^{2}italic_Ξ΄ = 1 / 5 italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT implies

1βˆ’r+13⁒r+(1βˆ’Ξ΄)⁒(rβˆ’1)>0.1π‘Ÿ13π‘Ÿ1π›Ώπ‘Ÿ101-r+\frac{1}{3r}+(1-\delta)(r-1)>0.1 - italic_r + divide start_ARG 1 end_ARG start_ARG 3 italic_r end_ARG + ( 1 - italic_Ξ΄ ) ( italic_r - 1 ) > 0 .

Hence we may assume that for all s𝑠sitalic_s-sets S𝑆Sitalic_S with t1βˆ’Ξ΄β‰€s≀tβˆ’1superscript𝑑1𝛿𝑠𝑑1t^{1-\delta}\leq s\leq t-1italic_t start_POSTSUPERSCRIPT 1 - italic_Ξ΄ end_POSTSUPERSCRIPT ≀ italic_s ≀ italic_t - 1 there exists an edge e𝑒eitalic_e of Fβˆ—superscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT with rβˆ’1π‘Ÿ1r-1italic_r - 1 vertices in S𝑆Sitalic_S and one vertex in V⁒(Fβˆ—)βˆ’S𝑉superscript𝐹𝑆V(F^{*})-Sitalic_V ( italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT ) - italic_S which satisfies (4).

Now let H𝐻Hitalic_H be any G𝐺Gitalic_G-free n𝑛nitalic_n-vertex graph. We are to find an F𝐹Fitalic_F-free set of size Ω⁒(n1βˆ’Ξ΅)Ξ©superscript𝑛1πœ€\Omega(n^{1-\varepsilon})roman_Ξ© ( italic_n start_POSTSUPERSCRIPT 1 - italic_Ξ΅ end_POSTSUPERSCRIPT ). Let R=r!+1π‘…π‘Ÿ1R=r!+1italic_R = italic_r ! + 1 and

T=t!⁒(R⁒(tβˆ’1rβˆ’1)βˆ’1)t.𝑇𝑑superscript𝑅binomial𝑑1π‘Ÿ11𝑑T=t!\left(R{t-1\choose r-1}-1\right)^{t}.italic_T = italic_t ! ( italic_R ( binomial start_ARG italic_t - 1 end_ARG start_ARG italic_r - 1 end_ARG ) - 1 ) start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT .

Set b=t1βˆ’Ξ΄π‘superscript𝑑1𝛿b=t^{1-\delta}italic_b = italic_t start_POSTSUPERSCRIPT 1 - italic_Ξ΄ end_POSTSUPERSCRIPT. We claim the number of copies of F𝐹Fitalic_F in H𝐻Hitalic_H is at most

T⁒(nb).𝑇binomial𝑛𝑏T{n\choose b}.italic_T ( binomial start_ARG italic_n end_ARG start_ARG italic_b end_ARG ) . (5)

If (5) holds, then there are at most O⁒(nb)𝑂superscript𝑛𝑏O(n^{b})italic_O ( italic_n start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT ) copies of F𝐹Fitalic_F in H𝐻Hitalic_H and we finish the proof as follows. Consider the t𝑑titalic_t-uniform hypergraph β„‹β„‹\mathcal{H}caligraphic_H with V⁒(β„‹)=V⁒(H)𝑉ℋ𝑉𝐻V(\mathcal{H})=V(H)italic_V ( caligraphic_H ) = italic_V ( italic_H ) and E⁒(β„‹)={V⁒(F):FβŠ†H}𝐸ℋconditional-set𝑉𝐹𝐹𝐻E(\mathcal{H})=\{V(F):F\subseteq H\}italic_E ( caligraphic_H ) = { italic_V ( italic_F ) : italic_F βŠ† italic_H }. The average degree of β„‹β„‹\mathcal{H}caligraphic_H is O⁒(nbβˆ’1)𝑂superscript𝑛𝑏1O(n^{b-1})italic_O ( italic_n start_POSTSUPERSCRIPT italic_b - 1 end_POSTSUPERSCRIPT ). By LemmaΒ 3, β„‹β„‹\mathcal{H}caligraphic_H contains an independent set I𝐼Iitalic_I of size Ω⁒(n1βˆ’(bβˆ’1)/(tβˆ’1))Ξ©superscript𝑛1𝑏1𝑑1\Omega(n^{1-(b-1)/(t-1)})roman_Ξ© ( italic_n start_POSTSUPERSCRIPT 1 - ( italic_b - 1 ) / ( italic_t - 1 ) end_POSTSUPERSCRIPT ). Since (bβˆ’1)/(tβˆ’1)<t1βˆ’Ξ΄/(tβˆ’1)<Ρ𝑏1𝑑1superscript𝑑1𝛿𝑑1πœ€(b-1)/(t-1)<t^{1-\delta}/(t-1)<\varepsilon( italic_b - 1 ) / ( italic_t - 1 ) < italic_t start_POSTSUPERSCRIPT 1 - italic_Ξ΄ end_POSTSUPERSCRIPT / ( italic_t - 1 ) < italic_Ξ΅ for large t𝑑titalic_t, we conclude that I𝐼Iitalic_I is an F𝐹Fitalic_F-free set of size at least Ω⁒(n1βˆ’Ξ΅)Ξ©superscript𝑛1πœ€\Omega(n^{1-\varepsilon})roman_Ξ© ( italic_n start_POSTSUPERSCRIPT 1 - italic_Ξ΅ end_POSTSUPERSCRIPT ).

We now prove (5). Assume to the contrary. Then to each copy of F𝐹Fitalic_F we may associate any b𝑏bitalic_b-subset of its vertices. By pigeonhole, there exists a set C𝐢Citalic_C of b𝑏bitalic_b vertices in H𝐻Hitalic_H and at least T𝑇Titalic_T copies of F𝐹Fitalic_F, say F1,F2,…,FTsubscript𝐹1subscript𝐹2…subscript𝐹𝑇F_{1},F_{2},\dots,F_{T}italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_F start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT for which V⁒(Fi)∩V⁒(Fj)βŠ‡C𝐢𝑉subscript𝐹𝑖𝑉subscript𝐹𝑗V(F_{i})\cap V(F_{j})\supseteq Citalic_V ( italic_F start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∩ italic_V ( italic_F start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) βŠ‡ italic_C. Amongst these sets of size t𝑑titalic_t, LemmaΒ 4 gives a sunflower of size R⁒(tβˆ’1rβˆ’1)𝑅binomial𝑑1π‘Ÿ1R{t-1\choose r-1}italic_R ( binomial start_ARG italic_t - 1 end_ARG start_ARG italic_r - 1 end_ARG ) with core SβŠ‡C𝐢𝑆S\supseteq Citalic_S βŠ‡ italic_C. As t1βˆ’Ξ΄β‰€|S|≀tβˆ’1superscript𝑑1𝛿𝑆𝑑1t^{1-\delta}\leq|S|\leq t-1italic_t start_POSTSUPERSCRIPT 1 - italic_Ξ΄ end_POSTSUPERSCRIPT ≀ | italic_S | ≀ italic_t - 1, by (4), for each of these R⁒(tβˆ’1rβˆ’1)𝑅binomial𝑑1π‘Ÿ1R{t-1\choose r-1}italic_R ( binomial start_ARG italic_t - 1 end_ARG start_ARG italic_r - 1 end_ARG ) copies A𝐴Aitalic_A of F𝐹Fitalic_F, there is a vertex vAsubscript𝑣𝐴v_{A}italic_v start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT outside S𝑆Sitalic_S that forms an edge eAsubscript𝑒𝐴e_{A}italic_e start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT in A𝐴Aitalic_A with rβˆ’1π‘Ÿ1r-1italic_r - 1 vertices in S𝑆Sitalic_S and vAsubscript𝑣𝐴v_{A}italic_v start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT plays the role of vertex v𝑣vitalic_v in the copy of Gβˆ—superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT within eAsubscript𝑒𝐴e_{A}italic_e start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT (in other words, eAβ€²=eAβˆ’{vA}subscriptsuperscript𝑒′𝐴subscript𝑒𝐴subscript𝑣𝐴e^{\prime}_{A}=e_{A}-\{v_{A}\}italic_e start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT = italic_e start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT - { italic_v start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT } induces a copy of Gβˆ—βˆ—superscript𝐺absentG^{**}italic_G start_POSTSUPERSCRIPT βˆ— βˆ— end_POSTSUPERSCRIPT). By pigeonhole, there exists a set eβ€²βŠ†Ssuperscript𝑒′𝑆e^{\prime}\subseteq Sitalic_e start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT βŠ† italic_S of size rβˆ’1π‘Ÿ1r-1italic_r - 1 and vertices v1,v2,…,vRβˆ‰Ssubscript𝑣1subscript𝑣2…subscript𝑣𝑅𝑆v_{1},v_{2},\ldots,v_{R}\not\in Sitalic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_v start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT βˆ‰ italic_S such that ei=eβ€²βˆͺ{vi}subscript𝑒𝑖superscript𝑒′subscript𝑣𝑖e_{i}=e^{\prime}\cup\{v_{i}\}italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT βˆͺ { italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } is an edge of Fβˆ—superscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT for all i∈[R]𝑖delimited-[]𝑅i\in[R]italic_i ∈ [ italic_R ] and visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT plays the role of v𝑣vitalic_v in the copy of Gβˆ—superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT in eisubscript𝑒𝑖e_{i}italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (in other words, eβ€²superscript𝑒′e^{\prime}italic_e start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT induces a copy of Gβˆ—βˆ—superscript𝐺absentG^{**}italic_G start_POSTSUPERSCRIPT βˆ— βˆ— end_POSTSUPERSCRIPT). Since R>r!π‘…π‘ŸR>r!italic_R > italic_r !, we can find vertices, say v1,v2subscript𝑣1subscript𝑣2v_{1},v_{2}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, such that the copies of Gβˆ—βˆ—superscript𝐺absentG^{**}italic_G start_POSTSUPERSCRIPT βˆ— βˆ— end_POSTSUPERSCRIPT within eβ€²superscript𝑒′e^{\prime}italic_e start_POSTSUPERSCRIPT β€² end_POSTSUPERSCRIPT for both v1subscript𝑣1v_{1}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are identical. This copy of Gβˆ—βˆ—superscript𝐺absentG^{**}italic_G start_POSTSUPERSCRIPT βˆ— βˆ— end_POSTSUPERSCRIPT together with v1subscript𝑣1v_{1}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT is a copy of G+superscript𝐺G^{+}italic_G start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT. We conclude HβŠ‡G+βŠ‡Gsuperset-of-or-equals𝐻superscript𝐺superset-of-or-equals𝐺H\supseteq G^{+}\supseteq Gitalic_H βŠ‡ italic_G start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT βŠ‡ italic_G, a contradiction. ∎

5 Appendix

Proposition A.

(ErdΕ‘s-Frankl-RΓΆdl) For any N,Rβ‰₯3𝑁𝑅3N,R\geq 3italic_N , italic_R β‰₯ 3 such that Nβ‰₯Rβ‰₯log⁑N𝑁𝑅𝑁N\geq R\geq\log Nitalic_N β‰₯ italic_R β‰₯ roman_log italic_N, there exists a linear triangle-free N𝑁Nitalic_N-vertex R𝑅Ritalic_R-uniform hypergraph H𝐻Hitalic_H with

|E⁒(H)|β‰₯N2R8⁒logR⁑N.𝐸𝐻superscript𝑁2superscript𝑅8subscript𝑅𝑁|E(H)|\geq\frac{N^{2}}{R^{8\sqrt{\log_{R}N}}}.| italic_E ( italic_H ) | β‰₯ divide start_ARG italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_R start_POSTSUPERSCRIPT 8 square-root start_ARG roman_log start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT italic_N end_ARG end_POSTSUPERSCRIPT end_ARG .
Proof.

The construction is based on the construction of BehrendΒ [3] of a dense subset of {1,2,…,n}12…𝑛\{1,2,\dots,n\}{ 1 , 2 , … , italic_n } with no three-term arithmetic progression. For completeness, we describe this construction here, which is slightly better than the construction of ErdΕ‘s, Frankl and RΓΆdlΒ [6]. Let A𝐴Aitalic_A be the set of positive integer points on the sphere of radius rπ‘Ÿritalic_r in ℝdsuperscriptℝ𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT. For any choice of positive integers x1,x2,…,xdβˆ’4≀r/dsubscriptπ‘₯1subscriptπ‘₯2…subscriptπ‘₯𝑑4π‘Ÿπ‘‘x_{1},x_{2},\dots,x_{d-4}\leq r/\sqrt{d}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_x start_POSTSUBSCRIPT italic_d - 4 end_POSTSUBSCRIPT ≀ italic_r / square-root start_ARG italic_d end_ARG, there exist positive integers xdβˆ’3,xdβˆ’2,xdβˆ’1,xdsubscriptπ‘₯𝑑3subscriptπ‘₯𝑑2subscriptπ‘₯𝑑1subscriptπ‘₯𝑑x_{d-3},x_{d-2},x_{d-1},x_{d}italic_x start_POSTSUBSCRIPT italic_d - 3 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_d - 2 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT such that x12+x22+β‹―+xd2=r2superscriptsubscriptπ‘₯12superscriptsubscriptπ‘₯22β‹―superscriptsubscriptπ‘₯𝑑2superscriptπ‘Ÿ2x_{1}^{2}+x_{2}^{2}+\dots+x_{d}^{2}=r^{2}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + β‹― + italic_x start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT by Lagrange’s four squares theorem. Therefore

|A|β‰₯(rd)dβˆ’4.𝐴superscriptπ‘Ÿπ‘‘π‘‘4|A|\geq\Bigl{(}\frac{r}{\sqrt{d}}\Bigr{)}^{d-4}.| italic_A | β‰₯ ( divide start_ARG italic_r end_ARG start_ARG square-root start_ARG italic_d end_ARG end_ARG ) start_POSTSUPERSCRIPT italic_d - 4 end_POSTSUPERSCRIPT .

Let Xi=[i⁒r]dsubscript𝑋𝑖superscriptdelimited-[]π‘–π‘Ÿπ‘‘X_{i}=[ir]^{d}italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = [ italic_i italic_r ] start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT. Then define an R𝑅Ritalic_R-uniform R𝑅Ritalic_R-partite hypergraph H𝐻Hitalic_H where V⁒(H)𝑉𝐻V(H)italic_V ( italic_H ) consists of X1βˆͺX2βˆͺβ‹―βˆͺXRsubscript𝑋1subscript𝑋2β‹―subscript𝑋𝑅X_{1}\cup X_{2}\cup\dots\cup X_{R}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT βˆͺ italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT βˆͺ β‹― βˆͺ italic_X start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT and let E⁒(H)={x,x+a,x+2⁒a,…,x+(Rβˆ’1)⁒a}𝐸𝐻π‘₯π‘₯π‘Žπ‘₯2π‘Žβ€¦π‘₯𝑅1π‘ŽE(H)=\{x,x+a,x+2a,\dots,x+(R-1)a\}italic_E ( italic_H ) = { italic_x , italic_x + italic_a , italic_x + 2 italic_a , … , italic_x + ( italic_R - 1 ) italic_a } where a∈Aπ‘Žπ΄a\in Aitalic_a ∈ italic_A and x∈X1π‘₯subscript𝑋1x\in X_{1}italic_x ∈ italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Then

|V⁒(H)|=N≀Rd+1⁒rdΒ andΒ |E⁒(H)|=|A|⁒|X1|β‰₯r2⁒dβˆ’4ddβˆ’4.formulae-sequence𝑉𝐻𝑁superscript𝑅𝑑1superscriptπ‘Ÿπ‘‘Β and 𝐸𝐻𝐴subscript𝑋1superscriptπ‘Ÿ2𝑑4superscript𝑑𝑑4|V(H)|=N\leq R^{d+1}r^{d}\quad\quad\mbox{ and }\quad\quad|E(H)|=|A||X_{1}|\geq% \frac{r^{2d-4}}{\sqrt{d}^{d-4}}.| italic_V ( italic_H ) | = italic_N ≀ italic_R start_POSTSUPERSCRIPT italic_d + 1 end_POSTSUPERSCRIPT italic_r start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT and | italic_E ( italic_H ) | = | italic_A | | italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | β‰₯ divide start_ARG italic_r start_POSTSUPERSCRIPT 2 italic_d - 4 end_POSTSUPERSCRIPT end_ARG start_ARG square-root start_ARG italic_d end_ARG start_POSTSUPERSCRIPT italic_d - 4 end_POSTSUPERSCRIPT end_ARG .

Put d=⌊logR⁑NβŒ‹<log⁑N≀R𝑑subscript𝑅𝑁𝑁𝑅d=\lfloor\sqrt{\log_{R}N}\rfloor<\log N\leq Ritalic_d = ⌊ square-root start_ARG roman_log start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT italic_N end_ARG βŒ‹ < roman_log italic_N ≀ italic_R and r=Rdπ‘Ÿsuperscript𝑅𝑑r=R^{d}italic_r = italic_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT. Then r4≀R4⁒dsuperscriptπ‘Ÿ4superscript𝑅4𝑑r^{4}\leq R^{4d}italic_r start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ≀ italic_R start_POSTSUPERSCRIPT 4 italic_d end_POSTSUPERSCRIPT and dd<Rdsuperscript𝑑𝑑superscript𝑅𝑑d^{d}<R^{d}italic_d start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT < italic_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT and hence

|E⁒(H)|β‰₯N2R2⁒d+2⁒r4⁒ddβˆ’42β‰₯N2R8⁒d>N2R8⁒logR⁑N.𝐸𝐻superscript𝑁2superscript𝑅2𝑑2superscriptπ‘Ÿ4superscript𝑑𝑑42superscript𝑁2superscript𝑅8𝑑superscript𝑁2superscript𝑅8subscript𝑅𝑁|E(H)|\geq\frac{N^{2}}{R^{2d+2}r^{4}d^{\frac{d-4}{2}}}\geq\frac{N^{2}}{R^{8d}}% >\frac{N^{2}}{R^{8\sqrt{\log_{R}N}}}.| italic_E ( italic_H ) | β‰₯ divide start_ARG italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_R start_POSTSUPERSCRIPT 2 italic_d + 2 end_POSTSUPERSCRIPT italic_r start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_d start_POSTSUPERSCRIPT divide start_ARG italic_d - 4 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT end_ARG β‰₯ divide start_ARG italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_R start_POSTSUPERSCRIPT 8 italic_d end_POSTSUPERSCRIPT end_ARG > divide start_ARG italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_R start_POSTSUPERSCRIPT 8 square-root start_ARG roman_log start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT italic_N end_ARG end_POSTSUPERSCRIPT end_ARG .

This establishes (i). If e={x,x+a,x+2⁒a,…,x+(Rβˆ’1)⁒a}𝑒π‘₯π‘₯π‘Žπ‘₯2π‘Žβ€¦π‘₯𝑅1π‘Že=\{x,x+a,x+2a,\dots,x+(R-1)a\}italic_e = { italic_x , italic_x + italic_a , italic_x + 2 italic_a , … , italic_x + ( italic_R - 1 ) italic_a } and f={y,y+b,y+2⁒b,…,y+(Rβˆ’1)⁒b}𝑓𝑦𝑦𝑏𝑦2𝑏…𝑦𝑅1𝑏f=\{y,y+b,y+2b,\dots,y+(R-1)b\}italic_f = { italic_y , italic_y + italic_b , italic_y + 2 italic_b , … , italic_y + ( italic_R - 1 ) italic_b } intersect in two vertices of H𝐻Hitalic_H, say x+i⁒a=y+i⁒bπ‘₯π‘–π‘Žπ‘¦π‘–π‘x+ia=y+ibitalic_x + italic_i italic_a = italic_y + italic_i italic_b and x+j⁒a=y+j⁒bπ‘₯π‘—π‘Žπ‘¦π‘—π‘x+ja=y+jbitalic_x + italic_j italic_a = italic_y + italic_j italic_b, then x=yπ‘₯𝑦x=yitalic_x = italic_y and a=bπ‘Žπ‘a=bitalic_a = italic_b, establishing (ii). If e,f𝑒𝑓e,fitalic_e , italic_f and g={z,z+c,z+2⁒c,…,z+(Rβˆ’1)⁒c}𝑔𝑧𝑧𝑐𝑧2𝑐…𝑧𝑅1𝑐g=\{z,z+c,z+2c,\dots,z+(R-1)c\}italic_g = { italic_z , italic_z + italic_c , italic_z + 2 italic_c , … , italic_z + ( italic_R - 1 ) italic_c } have |e∩f|=|f∩g|=|g∩e|=1𝑒𝑓𝑓𝑔𝑔𝑒1|e\cap f|=|f\cap g|=|g\cap e|=1| italic_e ∩ italic_f | = | italic_f ∩ italic_g | = | italic_g ∩ italic_e | = 1, then we may assume x+i⁒a=y+i⁒bπ‘₯π‘–π‘Žπ‘¦π‘–π‘x+ia=y+ibitalic_x + italic_i italic_a = italic_y + italic_i italic_b and y+j⁒b=z+j⁒c𝑦𝑗𝑏𝑧𝑗𝑐y+jb=z+jcitalic_y + italic_j italic_b = italic_z + italic_j italic_c and z+k⁒c=x+k⁒aπ‘§π‘˜π‘π‘₯π‘˜π‘Žz+kc=x+kaitalic_z + italic_k italic_c = italic_x + italic_k italic_a for some distinct i,j,k∈{0,1,2,…,Rβˆ’1}π‘–π‘—π‘˜012…𝑅1i,j,k\in\{0,1,2,\dots,R-1\}italic_i , italic_j , italic_k ∈ { 0 , 1 , 2 , … , italic_R - 1 } and a,b,c∈Aπ‘Žπ‘π‘π΄a,b,c\in Aitalic_a , italic_b , italic_c ∈ italic_A. This implies i⁒(bβˆ’a)+j⁒(cβˆ’b)+k⁒(aβˆ’c)=0π‘–π‘π‘Žπ‘—π‘π‘π‘˜π‘Žπ‘0i(b-a)+j(c-b)+k(a-c)=0italic_i ( italic_b - italic_a ) + italic_j ( italic_c - italic_b ) + italic_k ( italic_a - italic_c ) = 0 which means (kβˆ’i)⁒a+(iβˆ’j)⁒b+(jβˆ’k)⁒c=0π‘˜π‘–π‘Žπ‘–π‘—π‘π‘—π‘˜π‘0(k-i)a+(i-j)b+(j-k)c=0( italic_k - italic_i ) italic_a + ( italic_i - italic_j ) italic_b + ( italic_j - italic_k ) italic_c = 0. Since the sphere is strictly convex, a,b,cπ‘Žπ‘π‘a,b,citalic_a , italic_b , italic_c cannot all lie in a line, and hence we conclude two of i,j,kπ‘–π‘—π‘˜i,j,kitalic_i , italic_j , italic_k are identical, a contradiction. This proves (iii). ∎

For the next proposition, we need some definitions. A cycle of length two in a hypergraph is a set of two edges that share at least two vertices. A cycle of length β„“>2β„“2\ell>2roman_β„“ > 2 is a collection of β„“β„“\ellroman_β„“ distinct vertices v1,v2,…,vβ„“subscript𝑣1subscript𝑣2…subscript𝑣ℓv_{1},v_{2},\ldots,v_{\ell}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_v start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT and β„“β„“\ellroman_β„“ distinct edges e1,…,eβ„“subscript𝑒1…subscript𝑒ℓe_{1},\ldots,e_{\ell}italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_e start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT where ei∩ei+1={vi+1}subscript𝑒𝑖subscript𝑒𝑖1subscript𝑣𝑖1e_{i}\cap e_{i+1}=\{v_{i+1}\}italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∩ italic_e start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT = { italic_v start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT } (indices modulo β„“β„“\ellroman_β„“) and ei∩ej=βˆ…subscript𝑒𝑖subscript𝑒𝑗e_{i}\cap e_{j}=\emptysetitalic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∩ italic_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = βˆ… otherwise. So an β„“β„“\ellroman_β„“-cycle in an rπ‘Ÿritalic_r-uniform hypergraph (β„“>2β„“2\ell>2roman_β„“ > 2) has β„“β„“\ellroman_β„“ edges and ℓ⁒(rβˆ’1)β„“π‘Ÿ1\ell(r-1)roman_β„“ ( italic_r - 1 ) vertices (these are often called loose cycles). Say that a hypergraph H𝐻Hitalic_H has girth g𝑔gitalic_g if the length of the shortest cycle in H𝐻Hitalic_H is g𝑔gitalic_g.

Proposition B.

Fix rβ‰₯2π‘Ÿ2r\geq 2italic_r β‰₯ 2 and Ξ΄=1/5⁒r2𝛿15superscriptπ‘Ÿ2\delta=1/5r^{2}italic_Ξ΄ = 1 / 5 italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. For t𝑑titalic_t sufficiently large, there exists a t𝑑titalic_t-vertex rπ‘Ÿritalic_r-uniform hypergraph Fβˆ—superscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT with girth at least r+2π‘Ÿ2r+2italic_r + 2 such that for every s𝑠sitalic_s-subset S𝑆Sitalic_S with t1βˆ’Ξ΄<s<tsuperscript𝑑1𝛿𝑠𝑑t^{1-\delta}<s<titalic_t start_POSTSUPERSCRIPT 1 - italic_Ξ΄ end_POSTSUPERSCRIPT < italic_s < italic_t, the number of edges with exactly one vertex outside S𝑆Sitalic_S is at least

110⁒(srβˆ’1)⁒(tβˆ’s)⁒t1βˆ’r+12⁒r.110binomialπ‘ π‘Ÿ1𝑑𝑠superscript𝑑1π‘Ÿ12π‘Ÿ\frac{1}{10}{s\choose r-1}(t-s)t^{1-r+\frac{1}{2r}}.divide start_ARG 1 end_ARG start_ARG 10 end_ARG ( binomial start_ARG italic_s end_ARG start_ARG italic_r - 1 end_ARG ) ( italic_t - italic_s ) italic_t start_POSTSUPERSCRIPT 1 - italic_r + divide start_ARG 1 end_ARG start_ARG 2 italic_r end_ARG end_POSTSUPERSCRIPT . (6)
Proof.

Consider the binomial random rπ‘Ÿritalic_r-uniform hypergraph H∼H(r)⁒(t,p)similar-to𝐻superscriptπ»π‘Ÿπ‘‘π‘H\sim H^{(r)}(t,p)italic_H ∼ italic_H start_POSTSUPERSCRIPT ( italic_r ) end_POSTSUPERSCRIPT ( italic_t , italic_p ) with t𝑑titalic_t vertices where each edge appears independently with probability p=t1βˆ’r+12⁒r𝑝superscript𝑑1π‘Ÿ12π‘Ÿp=t^{1-r+\frac{1}{2r}}italic_p = italic_t start_POSTSUPERSCRIPT 1 - italic_r + divide start_ARG 1 end_ARG start_ARG 2 italic_r end_ARG end_POSTSUPERSCRIPT. For each 2≀ℓ≀r+12β„“π‘Ÿ12\leq\ell\leq r+12 ≀ roman_β„“ ≀ italic_r + 1, Let Cβ„“subscript𝐢ℓC_{\ell}italic_C start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT denote the cycle of length β„“β„“\ellroman_β„“ (this is unique except for β„“=2β„“2\ell=2roman_β„“ = 2) and let ℬℓsubscriptℬℓ\mathcal{B_{\ell}}caligraphic_B start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT denote a maximal collection of edge-disjoint copies of Cβ„“subscript𝐢ℓC_{\ell}italic_C start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT in H𝐻Hitalic_H. Form Fβˆ—superscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT by starting with H𝐻Hitalic_H and deleting all β„“β„“\ellroman_β„“ edges from every copy of Cβ„“subscript𝐢ℓC_{\ell}italic_C start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT in ℬℓsubscriptℬℓ\mathcal{B_{\ell}}caligraphic_B start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT for all 2≀ℓ≀r+12β„“π‘Ÿ12\leq\ell\leq r+12 ≀ roman_β„“ ≀ italic_r + 1. Then, by the maximality of ℬℓsubscriptℬℓ\mathcal{B_{\ell}}caligraphic_B start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT, the remaining hypergraph Fβˆ—superscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT has girth at least r+2π‘Ÿ2r+2italic_r + 2. We will now show that with high probability Fβˆ—superscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT has the required property.

Pick SβŠ‚V⁒(Fβˆ—)𝑆𝑉superscript𝐹S\subset V(F^{*})italic_S βŠ‚ italic_V ( italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT ) of size s𝑠sitalic_s where t1βˆ’Ξ΄<s<tsuperscript𝑑1𝛿𝑠𝑑t^{1-\delta}<s<titalic_t start_POSTSUPERSCRIPT 1 - italic_Ξ΄ end_POSTSUPERSCRIPT < italic_s < italic_t. Call an edge in H𝐻Hitalic_H with exactly one vertex outside S𝑆Sitalic_S an S𝑆Sitalic_S-edge. Let X=XS𝑋subscript𝑋𝑆X=X_{S}italic_X = italic_X start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT be the number of S𝑆Sitalic_S-edges, let Yβ„“=YS,β„“subscriptπ‘Œβ„“subscriptπ‘Œπ‘†β„“Y_{\ell}=Y_{S,\ell}italic_Y start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT = italic_Y start_POSTSUBSCRIPT italic_S , roman_β„“ end_POSTSUBSCRIPT be the number of copies of Cβ„“subscript𝐢ℓC_{\ell}italic_C start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT that contain at least one S𝑆Sitalic_S-edge and let Zβ„“=ZS,β„“subscript𝑍ℓsubscript𝑍𝑆ℓZ_{\ell}=Z_{S,\ell}italic_Z start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT = italic_Z start_POSTSUBSCRIPT italic_S , roman_β„“ end_POSTSUBSCRIPT be the maximal number of pairwise edge-disjoint copies of Cβ„“subscript𝐢ℓC_{\ell}italic_C start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT, each containing at least one S𝑆Sitalic_S-edge. Obviously, Zℓ≀Yβ„“subscript𝑍ℓsubscriptπ‘Œβ„“Z_{\ell}\leq Y_{\ell}italic_Z start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT ≀ italic_Y start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT. Define the event

Aβ„“=AS,β„“={X>10⁒r⁒ℓ⁒Zβ„“}.subscript𝐴ℓsubscript𝐴𝑆ℓ𝑋10π‘Ÿβ„“subscript𝑍ℓA_{\ell}=A_{S,\ell}=\{X>10r\ell\,Z_{\ell}\}.italic_A start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT = italic_A start_POSTSUBSCRIPT italic_S , roman_β„“ end_POSTSUBSCRIPT = { italic_X > 10 italic_r roman_β„“ italic_Z start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT } .

We note that if AS,β„“subscript𝐴𝑆ℓA_{S,\ell}italic_A start_POSTSUBSCRIPT italic_S , roman_β„“ end_POSTSUBSCRIPT holds for every appropriate S𝑆Sitalic_S, and 2≀ℓ≀r+12β„“π‘Ÿ12\leq\ell\leq r+12 ≀ roman_β„“ ≀ italic_r + 1, then the number of S𝑆Sitalic_S-edges in Fβˆ—superscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT is at least

Xβˆ’βˆ‘β„“=2r+1ℓ⁒Zβ„“β‰₯|X|βˆ’βˆ‘β„“=2r+1|X|10⁒r>(0.9)⁒|X|.𝑋superscriptsubscriptβ„“2π‘Ÿ1β„“subscript𝑍ℓ𝑋superscriptsubscriptβ„“2π‘Ÿ1𝑋10π‘Ÿ0.9𝑋X-\sum_{\ell=2}^{r+1}\ell\,Z_{\ell}\geq|X|-\sum_{\ell=2}^{r+1}\frac{|X|}{10r}>% (0.9)|X|.italic_X - βˆ‘ start_POSTSUBSCRIPT roman_β„“ = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r + 1 end_POSTSUPERSCRIPT roman_β„“ italic_Z start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT β‰₯ | italic_X | - βˆ‘ start_POSTSUBSCRIPT roman_β„“ = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r + 1 end_POSTSUPERSCRIPT divide start_ARG | italic_X | end_ARG start_ARG 10 italic_r end_ARG > ( 0.9 ) | italic_X | .

Moreover, 𝔼⁒(X)=(srβˆ’1)⁒(tβˆ’s)⁒p𝔼𝑋binomialπ‘ π‘Ÿ1𝑑𝑠𝑝{\mathbb{E}}(X)={s\choose r-1}(t-s)pblackboard_E ( italic_X ) = ( binomial start_ARG italic_s end_ARG start_ARG italic_r - 1 end_ARG ) ( italic_t - italic_s ) italic_p, so if it is also the case that X>𝔼⁒(X)/2𝑋𝔼𝑋2X>{\mathbb{E}}(X)/2italic_X > blackboard_E ( italic_X ) / 2, then the number of S𝑆Sitalic_S-edges in Fβˆ—superscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT is at least (0.4)⁒𝔼⁒(X)0.4𝔼𝑋(0.4){\mathbb{E}}(X)( 0.4 ) blackboard_E ( italic_X ) and S𝑆Sitalic_S satisfies (6).

We see that

𝔼⁒(Yβ„“)<(srβˆ’1)⁒(tβˆ’s)⁒tℓ⁒(rβˆ’1)βˆ’r⁒pβ„“.𝔼subscriptπ‘Œβ„“binomialπ‘ π‘Ÿ1𝑑𝑠superscriptπ‘‘β„“π‘Ÿ1π‘Ÿsuperscript𝑝ℓ{\mathbb{E}}(Y_{\ell})<{s\choose r-1}(t-s)t^{\ell(r-1)-r}p^{\ell}.blackboard_E ( italic_Y start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT ) < ( binomial start_ARG italic_s end_ARG start_ARG italic_r - 1 end_ARG ) ( italic_t - italic_s ) italic_t start_POSTSUPERSCRIPT roman_β„“ ( italic_r - 1 ) - italic_r end_POSTSUPERSCRIPT italic_p start_POSTSUPERSCRIPT roman_β„“ end_POSTSUPERSCRIPT .

As p=t1βˆ’r+12⁒r𝑝superscript𝑑1π‘Ÿ12π‘Ÿp=t^{1-r+\frac{1}{2r}}italic_p = italic_t start_POSTSUPERSCRIPT 1 - italic_r + divide start_ARG 1 end_ARG start_ARG 2 italic_r end_ARG end_POSTSUPERSCRIPT and ℓ≀r+1β„“π‘Ÿ1\ell\leq r+1roman_β„“ ≀ italic_r + 1, we have pℓ⁒tℓ⁒(rβˆ’1)βˆ’rβ‰ͺpmuch-less-thansuperscript𝑝ℓsuperscriptπ‘‘β„“π‘Ÿ1π‘Ÿπ‘p^{\ell}t^{\ell(r-1)-r}\ll pitalic_p start_POSTSUPERSCRIPT roman_β„“ end_POSTSUPERSCRIPT italic_t start_POSTSUPERSCRIPT roman_β„“ ( italic_r - 1 ) - italic_r end_POSTSUPERSCRIPT β‰ͺ italic_p. Therefore 𝔼⁒(Yβ„“)β‰ͺ𝔼⁒(X)much-less-than𝔼subscriptπ‘Œβ„“π”Όπ‘‹{\mathbb{E}}(Y_{\ell})\ll{\mathbb{E}}(X)blackboard_E ( italic_Y start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT ) β‰ͺ blackboard_E ( italic_X ). Now

ℙ⁒(Aβ„“Β―)=ℙ⁒(X≀10⁒r⁒ℓ⁒Zβ„“)≀ℙ⁒(X≀𝔼⁒(X)2)+ℙ⁒(Zβ„“β‰₯𝔼⁒(X)20⁒r⁒ℓ).β„™Β―subscript𝐴ℓℙ𝑋10π‘Ÿβ„“subscript𝑍ℓℙ𝑋𝔼𝑋2β„™subscript𝑍ℓ𝔼𝑋20π‘Ÿβ„“{\mathbb{P}}(\overline{A_{\ell}})={\mathbb{P}}(X\leq 10r\ell\,Z_{\ell})\leq{% \mathbb{P}}\left(X\leq\frac{{\mathbb{E}}(X)}{2}\right)+{\mathbb{P}}\left(Z_{% \ell}\geq\frac{{\mathbb{E}}(X)}{20r\ell}\right).blackboard_P ( overΒ― start_ARG italic_A start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT end_ARG ) = blackboard_P ( italic_X ≀ 10 italic_r roman_β„“ italic_Z start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT ) ≀ blackboard_P ( italic_X ≀ divide start_ARG blackboard_E ( italic_X ) end_ARG start_ARG 2 end_ARG ) + blackboard_P ( italic_Z start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT β‰₯ divide start_ARG blackboard_E ( italic_X ) end_ARG start_ARG 20 italic_r roman_β„“ end_ARG ) .

KrivelevichΒ [11, Claim 1] proved that in this setup, for any constant c>0𝑐0c>0italic_c > 0,

ℙ⁒(Zβ„“β‰₯c⁒𝔼⁒(Yβ„“))<eβˆ’c⁒(log⁑cβˆ’1)⁒𝔼⁒(Yβ„“).β„™subscript𝑍ℓ𝑐𝔼subscriptπ‘Œβ„“superscript𝑒𝑐𝑐1𝔼subscriptπ‘Œβ„“{\mathbb{P}}(Z_{\ell}\geq c\,{\mathbb{E}}(Y_{\ell}))<e^{-c\,(\log c-1){\mathbb% {E}}(Y_{\ell})}.blackboard_P ( italic_Z start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT β‰₯ italic_c blackboard_E ( italic_Y start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT ) ) < italic_e start_POSTSUPERSCRIPT - italic_c ( roman_log italic_c - 1 ) blackboard_E ( italic_Y start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT .

Using this and 𝔼⁒(Yβ„“)β‰ͺ𝔼⁒(X)much-less-than𝔼subscriptπ‘Œβ„“π”Όπ‘‹{\mathbb{E}}(Y_{\ell})\ll{\mathbb{E}}(X)blackboard_E ( italic_Y start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT ) β‰ͺ blackboard_E ( italic_X ) we have

ℙ⁒(Zβ„“β‰₯𝔼⁒(X)20⁒r⁒ℓ)=ℙ⁒(Zβ„“β‰₯𝔼⁒(X)20⁒r⁒ℓ⁒𝔼⁒(Yβ„“)⁒𝔼⁒(Yβ„“))<eβˆ’π”Όβ’(X)20⁒r⁒ℓ⁒(log⁑(𝔼⁒(X)20⁒r⁒ℓ⁒𝔼⁒(Yβ„“))βˆ’1)<eβˆ’π”Όβ’(X).β„™subscript𝑍ℓ𝔼𝑋20π‘Ÿβ„“β„™subscript𝑍ℓ𝔼𝑋20π‘Ÿβ„“π”Όsubscriptπ‘Œβ„“π”Όsubscriptπ‘Œβ„“superscript𝑒𝔼𝑋20π‘Ÿβ„“π”Όπ‘‹20π‘Ÿβ„“π”Όsubscriptπ‘Œβ„“1superscript𝑒𝔼𝑋{\mathbb{P}}\left(Z_{\ell}\geq\frac{{\mathbb{E}}(X)}{20r\ell}\right)={\mathbb{% P}}\left(Z_{\ell}\geq\frac{{\mathbb{E}}(X)}{20r\ell\,{\mathbb{E}}(Y_{\ell})}{% \mathbb{E}}(Y_{\ell})\right)<e^{\frac{-{\mathbb{E}}(X)}{20r\ell}(\log(\frac{{% \mathbb{E}}(X)}{20r\ell\,{\mathbb{E}}(Y_{\ell})})-1)}<e^{-{\mathbb{E}}(X)}.blackboard_P ( italic_Z start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT β‰₯ divide start_ARG blackboard_E ( italic_X ) end_ARG start_ARG 20 italic_r roman_β„“ end_ARG ) = blackboard_P ( italic_Z start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT β‰₯ divide start_ARG blackboard_E ( italic_X ) end_ARG start_ARG 20 italic_r roman_β„“ blackboard_E ( italic_Y start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT ) end_ARG blackboard_E ( italic_Y start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT ) ) < italic_e start_POSTSUPERSCRIPT divide start_ARG - blackboard_E ( italic_X ) end_ARG start_ARG 20 italic_r roman_β„“ end_ARG ( roman_log ( divide start_ARG blackboard_E ( italic_X ) end_ARG start_ARG 20 italic_r roman_β„“ blackboard_E ( italic_Y start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT ) end_ARG ) - 1 ) end_POSTSUPERSCRIPT < italic_e start_POSTSUPERSCRIPT - blackboard_E ( italic_X ) end_POSTSUPERSCRIPT .

The standard Chernoff bound gives ℙ⁒(X≀𝔼⁒(X)/2)<eβˆ’π”Όβ’(X)/8ℙ𝑋𝔼𝑋2superscript𝑒𝔼𝑋8{\mathbb{P}}(X\leq{\mathbb{E}}(X)/2)<e^{-{\mathbb{E}}(X)/8}blackboard_P ( italic_X ≀ blackboard_E ( italic_X ) / 2 ) < italic_e start_POSTSUPERSCRIPT - blackboard_E ( italic_X ) / 8 end_POSTSUPERSCRIPT so altogether we obtain ℙ⁒(Aβ„“Β―)<eβˆ’π”Όβ’(X)/9β„™Β―subscript𝐴ℓsuperscript𝑒𝔼𝑋9{\mathbb{P}}(\overline{A_{\ell}})<e^{-{\mathbb{E}}(X)/9}blackboard_P ( overΒ― start_ARG italic_A start_POSTSUBSCRIPT roman_β„“ end_POSTSUBSCRIPT end_ARG ) < italic_e start_POSTSUPERSCRIPT - blackboard_E ( italic_X ) / 9 end_POSTSUPERSCRIPT. Using the union bound, the probability that there exists an S𝑆Sitalic_S that fails (6) is at most

βˆ‘s=t1βˆ’Ξ΄tβˆ’1(ts)⁒eβˆ’(srβˆ’1)⁒(tβˆ’s)⁒p/9<elog⁑t+(tβˆ’s)⁒log⁑tβˆ’(srβˆ’1)⁒(tβˆ’s)⁒p/9.superscriptsubscript𝑠superscript𝑑1𝛿𝑑1binomial𝑑𝑠superscript𝑒binomialπ‘ π‘Ÿ1𝑑𝑠𝑝9superscript𝑒𝑑𝑑𝑠𝑑binomialπ‘ π‘Ÿ1𝑑𝑠𝑝9\sum_{s=t^{1-\delta}}^{t-1}{t\choose s}e^{-{s\choose r-1}(t-s)p/9}<e^{\log t+(% t-s)\log t-{s\choose r-1}(t-s)p/9}.βˆ‘ start_POSTSUBSCRIPT italic_s = italic_t start_POSTSUPERSCRIPT 1 - italic_Ξ΄ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT ( binomial start_ARG italic_t end_ARG start_ARG italic_s end_ARG ) italic_e start_POSTSUPERSCRIPT - ( binomial start_ARG italic_s end_ARG start_ARG italic_r - 1 end_ARG ) ( italic_t - italic_s ) italic_p / 9 end_POSTSUPERSCRIPT < italic_e start_POSTSUPERSCRIPT roman_log italic_t + ( italic_t - italic_s ) roman_log italic_t - ( binomial start_ARG italic_s end_ARG start_ARG italic_r - 1 end_ARG ) ( italic_t - italic_s ) italic_p / 9 end_POSTSUPERSCRIPT .

The power of t𝑑titalic_t in srβˆ’1⁒psuperscriptπ‘ π‘Ÿ1𝑝s^{r-1}pitalic_s start_POSTSUPERSCRIPT italic_r - 1 end_POSTSUPERSCRIPT italic_p is at least 1βˆ’r+1/2⁒r+(1βˆ’Ξ΄)⁒(rβˆ’1)>01π‘Ÿ12π‘Ÿ1π›Ώπ‘Ÿ101-r+1/2r+(1-\delta)(r-1)>01 - italic_r + 1 / 2 italic_r + ( 1 - italic_Ξ΄ ) ( italic_r - 1 ) > 0 as Ξ΄<1/2⁒r2𝛿12superscriptπ‘Ÿ2\delta<1/2r^{2}italic_Ξ΄ < 1 / 2 italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and hence the quantity above vanishes for large t𝑑titalic_t. We conclude that (6) holds in Fβˆ—superscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT βˆ— end_POSTSUPERSCRIPT with high probability. ∎

References

  • [1] M. Ajtai, J. KomlΓ³s and E. SzemerΓ©di, A note on Ramsey numbers, J. Comb. Theory (Series A), 29 (1980), 354–360.
  • [2] J. Balogh, C. Chen, H. Luo, On the maximum size of F𝐹Fitalic_F-free induced subgraphs in Ktsubscript𝐾𝑑K_{t}italic_K start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT-free graphs, preprint
  • [3] F. A. Behrend, On the sets of integers which contain no three in arithmetic progression. Proc. Nat. Acad. Sci., 23 (1946), 331–332.
  • [4] T. Bohman and P. Keevash, Dynamic concentration of the triangle-free process, Random Struct. Alg. 58 (2021), 221–293.
  • [5] D. Conlon, S. Mattheus, D. Mubayi, J. A. VerstraΓ«te, Ramsey numbers and the Zarankiewicz problem, Bull. Lond. Math. Soc. 56, 6 (2024), 2014–2023.
  • [6] P. ErdΕ‘s, P. Frankl, V. RΓΆdl, The asymptotic number of graphs not containing a fixed subgraph and a problem for hypergraphs having no exponent, Graphs and Combinatorics 2, (1986), 113–121.
  • [7] P. ErdΕ‘s, C. A. Rogers, The construction of certain graphs, Canad. J. Math 14 (1962), 702–707.
  • [8] G. Fiz Pontiveros, S. Griffiths and R. Morris The triangle-free process and the Ramsey numbers, Mem. Amer. Math. Soc., 263 (2020), 125pp.
  • [9] O. Janzer, B. Sudakov, Improved bounds for the ErdΕ‘s-Rogers (s,s+2)𝑠𝑠2(s,s+2)( italic_s , italic_s + 2 )-problem, Preprint (2023) https://arxiv.longhoe.net/pdf/2307.05441.pdf.
  • [10] J. H. Kim, The Ramsey Number R⁒(3,t)𝑅3𝑑R(3,t)italic_R ( 3 , italic_t ) has order of magnitude t2/log⁑tsuperscript𝑑2𝑑t^{2}/\log titalic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / roman_log italic_t, Random Structures and Algorithms 7 (1995), 173–207.
  • [11] M. Krivelevich, Bounding Ramsey numbers through large deviation inequalities, Random Structures Algorithms 7 (1995), 145–155.
  • [12] Y. Li, W. Zang, The Independence Number of Graphs with a Forbidden Cycle and Ramsey Numbers, Journal of Combinatorial Optimization, Springer, vol. 7(4), pages 353–359.
  • [13] A. Lubotzky, R. Phillips, P. Sarnak, Ramanujan graphs. Combinatorica 8 (1988), no. 3, 261–277.
  • [14] S. Mattheus, J. A. VerstraΓ«te, The asymptotics of r⁒(4,t)π‘Ÿ4𝑑r(4,t)italic_r ( 4 , italic_t ), Annals of Math. 199 (2024), no. 2 919–941
  • [15] D. Mubayi, J. A. Verstraete, A note on pseudorandom Ramsey graphs, J. Eur. Math. Soc. (JEMS) 26 (2024), no. 1, 153-161.
  • [16] D. Mubayi, J. A. VerstraΓ«te, On the order of ErdΕ‘s-Rogers functions, https://arxiv.longhoe.net/abs/2401.02548
  • [17] J. Shearer, On the independence number of sparse graphs, Random Structures and Algorithms 7 (1995), 269–271.
  • [18] J. Shearer, A note on the independence number of triangle-free graphs, Discrete Mathematics 46 no 1. (1983) 83–87.
  • [19] J. Spencer, TurΓ‘n’s theorem for kπ‘˜kitalic_k-graphs, Discrete Mathematics 2 no. 2, (1972), 183–186.
  • [20] B. Sudakov, A note on odd cycle-complete graph Ramsey numbers. Electron. J. Combin. 9 (2002), no. 1, Note 1, 4 pp.