-
Insect Diversity Estimation in Polarimetric Lidar
Authors:
Dolores Bernenko,
Meng Li,
Hampus MÃ¥nefjord,
Samuel Jansson,
Anna Runemark,
Carsten Kirkeby,
Mikkel Brydegaard
Abstract:
Identification of insects in flight is a particular challenge for ecologists in several settings with no other method able to count and classify insects at the pace of entomological lidar. Thus, it can play a unique role as a non-intrusive diagnostic tool to assess insect biodiversity, inform planning, and evaluate mitigation efforts aimed at tackling declines in insect abundance and diversity. Wh…
▽ More
Identification of insects in flight is a particular challenge for ecologists in several settings with no other method able to count and classify insects at the pace of entomological lidar. Thus, it can play a unique role as a non-intrusive diagnostic tool to assess insect biodiversity, inform planning, and evaluate mitigation efforts aimed at tackling declines in insect abundance and diversity. While species richness of co-existing insects could reach tens of thousands, to date, photonic sensors and lidars can differentiate roughly one hundred signal types. This taxonomic specificity or number of discernible signal types is currently limited by instrumentation and algorithm sophistication. In this study we report 32,533 observations of wild flying insects along a 500-meter transect. We report the benefits of lidar polarization bands for differentiating species and compare the performance of two unsupervised clustering algorithms, namely Hierarchical Cluster Analysis and Gaussian Mixture Model. We demonstrate that polarimetric properties could be partially predicted even with unpolarized light, thus polarimetric lidar bands provide only a minor improvement in specificity. Finally, we use physical properties of the clustered observation, such as wing beat frequency, daily activity patterns, and spatial distribution, to establish a lower bound for the number of species represented by the differentiated signal types.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Exploring 3D community inconsistency in human chromosome contact networks
Authors:
Dolores Bernenko,
Sang Hoon Lee,
Ludvig Lizana
Abstract:
Researchers developed chromosome capture methods such as Hi-C to better understand DNA's 3D folding in nuclei. The Hi-C method captures contact frequencies between DNA segment pairs across the genome. When analyzing Hi-C data sets, it is common to group these pairs using standard bioinformatics methods (e.g., PCA). Other approaches handle Hi-C data as weighted networks, where connected node repres…
▽ More
Researchers developed chromosome capture methods such as Hi-C to better understand DNA's 3D folding in nuclei. The Hi-C method captures contact frequencies between DNA segment pairs across the genome. When analyzing Hi-C data sets, it is common to group these pairs using standard bioinformatics methods (e.g., PCA). Other approaches handle Hi-C data as weighted networks, where connected node represent DNA segments in 3D proximity. In this representation, one can leverage community detection techniques developed in complex network theory to group nodes into mesoscale communities containing similar connection patterns. While there are several successful attempts to analyze Hi-C data in this way, it is common to report and study the most typical community structure. But in reality, there are often several valid candidates. Therefore, depending on algorithm design, different community detection methods focusing on slightly different connectivity features may have differing views on the ideal node grou**s. In fact, even the same community detection method may yield different results if using a stochastic algorithm. This ambiguity is fundamental to community detection and shared by most complex networks whenever interactions span all scales in the network. This is known as community inconsistency. This paper explores this inconsistency of 3D communities in Hi-C data for all human chromosomes. We base our analysis on two inconsistency metrics, one local and one global, and quantify the network scales where the community separation is most variable. For example, we find that TADs are less reliable than A/B compartments and that nodes with highly variable node-community memberships are associated with open chromatin. Overall, our study provides a helpful framework for data-driven researchers and increases awareness of some inherent challenges when clustering Hi-C data into 3D communities.
△ Less
Submitted 28 February, 2023;
originally announced February 2023.
-
Map** robust multiscale communities in chromosome contact networks
Authors:
Anton Holmgren,
Dolores Bernenko,
Ludvig Lizana
Abstract:
To better understand DNA's 3D folding in cell nuclei, researchers developed chromosome capture methods such as Hi-C that measure the contact frequencies between all DNA segment pairs across the genome. As Hi-C data sets often are massive, it is common to use bioinformatics methods to group DNA segments into 3D regions with correlated contact patterns, such as Topologically Associated Domains (TADs…
▽ More
To better understand DNA's 3D folding in cell nuclei, researchers developed chromosome capture methods such as Hi-C that measure the contact frequencies between all DNA segment pairs across the genome. As Hi-C data sets often are massive, it is common to use bioinformatics methods to group DNA segments into 3D regions with correlated contact patterns, such as Topologically Associated Domains (TADs) and A/B compartments. Recently, another research direction emerged that treats the Hi-C data as a network of 3D contacts. In this representation, one can use community detection algorithms from complex network theory that group nodes into tightly connected mesoscale communities. However, because Hi-C networks are so densely connected, several node partitions may represent feasible solutions to the community detection problem but are indistinguishable unless including other data. Because this limitation is a fundamental property of the network, this problem persists regardless of the community-finding or data-clustering method. To help remedy this problem, we developed a method that charts the solution landscape of network partitions in Hi-C data from human cells. Our approach allows us to scan seamlessly through the scales of the network and determine regimes where we can expect reliable community structures. We find that some scales are more robust than others and that strong clusters may differ significantly. Our work highlights that finding a robust community structure hinges on thoughtful algorithm design or method cross-evaluation.
△ Less
Submitted 16 December, 2022;
originally announced December 2022.