Search | arXiv e-print repository

arXiv:2406.19678 [pdf, other]

doi 10.31256/HSMR2024.60

UltraGelBot: Autonomous Gel Dispenser for Robotic Ultrasound

Authors: Deepak Raina, Ziming Zhao, Richard Voyles, Juan Wachs, Subir K. Saha, S. H. Chandrashekhara

Abstract: Telerobotic and Autonomous Robotic Ultrasound Systems (RUS) help alleviate the need for operator-dependability in free-hand ultrasound examinations. However, the state-of-the-art RUSs still rely on a human operator to apply the ultrasound gel. The lack of standardization in this process often leads to poor imaging of the scanned region. The reason for this has to do with air-gaps between the probe… ▽ More Telerobotic and Autonomous Robotic Ultrasound Systems (RUS) help alleviate the need for operator-dependability in free-hand ultrasound examinations. However, the state-of-the-art RUSs still rely on a human operator to apply the ultrasound gel. The lack of standardization in this process often leads to poor imaging of the scanned region. The reason for this has to do with air-gaps between the probe and the human body. In this paper, we developed a end-of-arm tool for RUS, referred to as UltraGelBot. This bot can autonomously detect and dispense the gel. It uses a deep learning model to detect the gel from images acquired using an on-board camera. A motorized mechanism is also developed, which will use this feedback and dispense the gel. Experiments on phantom revealed that UltraGelBot increases the acquired image quality by $18.6\%$ and reduces the procedure time by $37.2\%$. △ Less

Submitted 28 June, 2024; originally announced June 2024.

Comments: 2024 16th Hamlyn Symposium on Medical Robotics (HSMR)

arXiv:2405.00540 [pdf, other]

Heat, Health, and Habitats: Analyzing the Intersecting Risks of Climate and Demographic Shifts in Austrian Districts

Authors: Hannah Schuster, Axel Polleres, Amin Anjomshoaa, Johannes Wachs

Abstract: The impact of hot weather on health outcomes of a population is mediated by a variety of factors, including its age profile and local green infrastructure. The combination of warming due to climate change and demographic aging suggests that heat-related health outcomes will deteriorate in the coming decades. Here, we measure the relationship between weekly all-cause mortality and heat days in Aust… ▽ More The impact of hot weather on health outcomes of a population is mediated by a variety of factors, including its age profile and local green infrastructure. The combination of warming due to climate change and demographic aging suggests that heat-related health outcomes will deteriorate in the coming decades. Here, we measure the relationship between weekly all-cause mortality and heat days in Austrian districts using a panel dataset covering $2015-2022$. An additional day reaching $30$ degrees is associated with a $2.4\%$ increase in mortality per $1000$ inhabitants during summer. This association is roughly doubled in districts with a two standard deviation above average share of the population over $65$. Using forecasts of hot days (RCP) and demographics in $2050$, we observe that districts will have elderly populations and hot days $2-5$ standard deviations above the current mean in just $25$ years. This predicts a drastic increase in heat-related mortality. At the same time, district green scores, measured using $10\times 10$ meter resolution satellite images of residential areas, significantly moderate the relationship between heat and mortality. Thus, although local policies likely cannot reverse warming or demographic trends, they can take measures to mediate the health consequences of these growing risks, which are highly heterogeneous across regions, even in Austria. △ Less

Submitted 1 May, 2024; originally announced May 2024.

arXiv:2404.18833 [pdf, other]

The dynamics of leadership and success in software development teams

Authors: Lorenzo Betti, Luca Gallo, Johannes Wachs, Federico Battiston

Abstract: From science to industry, teamwork plays a crucial role in knowledge production and innovation. Most studies consider teams as static groups of individuals, thereby failing to capture how the micro-dynamics of collaborative processes and organizational changes determine team success. Here, we leverage fine-grained temporal data on software development teams to gain insights into the dynamics of on… ▽ More From science to industry, teamwork plays a crucial role in knowledge production and innovation. Most studies consider teams as static groups of individuals, thereby failing to capture how the micro-dynamics of collaborative processes and organizational changes determine team success. Here, we leverage fine-grained temporal data on software development teams to gain insights into the dynamics of online collaborative projects. Our analysis reveals an uneven workload distribution in teams, with stronger heterogeneity correlated with higher success, and the early emergence of a lead developer carrying out the majority of work. Moreover, we find that a sizeable fraction of projects experience a change of lead developer, with such a transition being more likely in projects led by inexperienced users. Finally, we show that leadership change is associated with faster success growth, in particular for the least successful projects. Our work contributes to a deeper understanding of the link between team evolution and success in collaborative processes. △ Less

Submitted 29 April, 2024; originally announced April 2024.

arXiv:2310.07392 [pdf, other]

doi 10.1109/ISMR57123.2023.10130193

Deep Kernel and Image Quality Estimators for Optimizing Robotic Ultrasound Controller using Bayesian Optimization

Authors: Deepak Raina, SH Chandrashekhara, Richard Voyles, Juan Wachs, Subir Kumar Saha

Abstract: Ultrasound is a commonly used medical imaging modality that requires expert sonographers to manually maneuver the ultrasound probe based on the acquired image. Autonomous Robotic Ultrasound (A-RUS) is an appealing alternative to this manual procedure in order to reduce sonographers' workload. The key challenge to A-RUS is optimizing the ultrasound image quality for the region of interest across di… ▽ More Ultrasound is a commonly used medical imaging modality that requires expert sonographers to manually maneuver the ultrasound probe based on the acquired image. Autonomous Robotic Ultrasound (A-RUS) is an appealing alternative to this manual procedure in order to reduce sonographers' workload. The key challenge to A-RUS is optimizing the ultrasound image quality for the region of interest across different patients. This requires knowledge of anatomy, recognition of error sources and precise probe position, orientation and pressure. Sample efficiency is important while optimizing these parameters associated with the robotized probe controller. Bayesian Optimization (BO), a sample-efficient optimization framework, has recently been applied to optimize the 2D motion of the probe. Nevertheless, further improvements are needed to improve the sample efficiency for high-dimensional control of the probe. We aim to overcome this problem by using a neural network to learn a low-dimensional kernel in BO, termed as Deep Kernel (DK). The neural network of DK is trained using probe and image data acquired during the procedure. The two image quality estimators are proposed that use a deep convolution neural network and provide real-time feedback to the BO. We validated our framework using these two feedback functions on three urinary bladder phantoms. We obtained over 50% increase in sample efficiency for 6D control of the robotized probe. Furthermore, our results indicate that this performance enhancement in BO is independent of the specific training dataset, demonstrating inter-patient adaptability. △ Less

Submitted 11 October, 2023; originally announced October 2023.

Comments: Accepted in IEEE International Symposium on Medical Robotics (ISMR) 2023

Journal ref: IEEE International Symposium on Medical Robotics (ISMR) 2023

arXiv:2310.03406 [pdf, other]

doi 10.1109/CASE56687.2023.10260479

RUSOpt: Robotic UltraSound Probe Normalization with Bayesian Optimization for In-plane and Out-plane Scanning

Authors: Deepak Raina, Abhishek Mathur, Richard M. Voyles, Juan Wachs, SH Chandrashekhara, Subir Kumar Saha

Abstract: The one of the significant challenges faced by autonomous robotic ultrasound systems is acquiring high-quality images across different patients. The proper orientation of the robotized probe plays a crucial role in governing the quality of ultrasound images. To address this challenge, we propose a sample-efficient method to automatically adjust the orientation of the ultrasound probe normal to the… ▽ More The one of the significant challenges faced by autonomous robotic ultrasound systems is acquiring high-quality images across different patients. The proper orientation of the robotized probe plays a crucial role in governing the quality of ultrasound images. To address this challenge, we propose a sample-efficient method to automatically adjust the orientation of the ultrasound probe normal to the point of contact on the scanning surface, thereby improving the acoustic coupling of the probe and resulting image quality. Our method utilizes Bayesian Optimization (BO) based search on the scanning surface to efficiently search for the normalized probe orientation. We formulate a novel objective function for BO that leverages the contact force measurements and underlying mechanics to identify the normal. We further incorporate a regularization scheme in BO to handle the noisy objective function. The performance of the proposed strategy has been assessed through experiments on urinary bladder phantoms. These phantoms included planar, tilted, and rough surfaces, and were examined using both linear and convex probes with varying search space limits. Further, simulation-based studies have been carried out using 3D human mesh models. The results demonstrate that the mean ($\pm$SD) absolute angular error averaged over all phantoms and 3D models is $\boldsymbol{2.4\pm0.7^\circ}$ and $\boldsymbol{2.1\pm1.3^\circ}$, respectively. △ Less

Submitted 5 October, 2023; originally announced October 2023.

Comments: Accepted in IEEE International Conference on Automation Science and Engineering (CASE) 2023

Journal ref: IEEE International Conference on Automation Science and Engineering (CASE) 2023

arXiv:2307.07367 [pdf, other]

Are Large Language Models a Threat to Digital Public Goods? Evidence from Activity on Stack Overflow

Authors: Maria del Rio-Chanona, Nadzeya Laurentsyeva, Johannes Wachs

Abstract: Large language models like ChatGPT efficiently provide users with information about various topics, presenting a potential substitute for searching the web and asking people for help online. But since users interact privately with the model, these models may drastically reduce the amount of publicly available human-generated data and knowledge resources. This substitution can present a significant… ▽ More Large language models like ChatGPT efficiently provide users with information about various topics, presenting a potential substitute for searching the web and asking people for help online. But since users interact privately with the model, these models may drastically reduce the amount of publicly available human-generated data and knowledge resources. This substitution can present a significant problem in securing training data for future models. In this work, we investigate how the release of ChatGPT changed human-generated open data on the web by analyzing the activity on Stack Overflow, the leading online Q\&A platform for computer programming. We find that relative to its Russian and Chinese counterparts, where access to ChatGPT is limited, and to similar forums for mathematics, where ChatGPT is less capable, activity on Stack Overflow significantly decreased. A difference-in-differences model estimates a 16\% decrease in weekly posts on Stack Overflow. This effect increases in magnitude over time, and is larger for posts related to the most widely used programming languages. Posts made after ChatGPT get similar voting scores than before, suggesting that ChatGPT is not merely displacing duplicate or low-quality content. These results suggest that more users are adopting large language models to answer questions and they are better substitutes for Stack Overflow for languages for which they have more training data. Using models like ChatGPT may be more efficient for solving certain programming problems, but its widespread adoption and the resulting shift away from public exchange on the web will limit the open data people and models can learn from in the future. △ Less

Submitted 14 July, 2023; originally announced July 2023.

arXiv:2307.02442 [pdf, other]

doi 10.1109/ICRA48891.2023.10161542

Robotic Sonographer: Autonomous Robotic Ultrasound using Domain Expertise in Bayesian Optimization

Authors: Deepak Raina, SH Chandrashekhara, Richard Voyles, Juan Wachs, Subir Kumar Saha

Abstract: Ultrasound is a vital imaging modality utilized for a variety of diagnostic and interventional procedures. However, an expert sonographer is required to make accurate maneuvers of the probe over the human body while making sense of the ultrasound images for diagnostic purposes. This procedure requires a substantial amount of training and up to a few years of experience. In this paper, we propose a… ▽ More Ultrasound is a vital imaging modality utilized for a variety of diagnostic and interventional procedures. However, an expert sonographer is required to make accurate maneuvers of the probe over the human body while making sense of the ultrasound images for diagnostic purposes. This procedure requires a substantial amount of training and up to a few years of experience. In this paper, we propose an autonomous robotic ultrasound system that uses Bayesian Optimization (BO) in combination with the domain expertise to predict and effectively scan the regions where diagnostic quality ultrasound images can be acquired. The quality map, which is a distribution of image quality in a scanning region, is estimated using Gaussian process in BO. This relies on a prior quality map modeled using expert's demonstration of the high-quality probing maneuvers. The ultrasound image quality feedback is provided to BO, which is estimated using a deep convolution neural network model. This model was previously trained on database of images labelled for diagnostic quality by expert radiologists. Experiments on three different urinary bladder phantoms validated that the proposed autonomous ultrasound system can acquire ultrasound images for diagnostic purposes with a probing position and force accuracy of 98.7% and 97.8%, respectively. △ Less

Submitted 5 July, 2023; originally announced July 2023.

Comments: Accepted in IEEE International Conference on Robotics and Automation (ICRA) 2023

arXiv:2307.02250 [pdf, other]

Stress-testing Road Networks and Access to Medical Care

Authors: Hannah Schuster, Axel Polleres, Johannes Wachs

Abstract: This research studies how populations depend on road networks for access to health care during crises or natural disasters. So far, most researchers rather studied the accessibility of the whole network or the cost of network disruptions in general, rather than as a function of the accessibility of specific priority destinations like hospitals. Even short delays in accessing healthcare can have si… ▽ More This research studies how populations depend on road networks for access to health care during crises or natural disasters. So far, most researchers rather studied the accessibility of the whole network or the cost of network disruptions in general, rather than as a function of the accessibility of specific priority destinations like hospitals. Even short delays in accessing healthcare can have significant adverse consequences. We carry out a comprehensive stress test of the entire Austrian road network from this perspective. We simplify the whole network into one consisting of what we call accessibility corridors, deleting single corridors to evaluate the change in accessibility of populations to healthcare. The data created by our stress test was used to generate an importance ranking of the corridors. The findings suggest that certain road segments and corridors are orders of magnitude more important in terms of access to hospitals than the typical one. Our method also highlights vulnerable municipalities and hospitals who may experience demand surges as populations are cut off from their usual nearest hospitals. Even though the skewed importance of some corridors highlights vulnerabilities, they provide policymakers with a clear agenda. △ Less

Submitted 5 July, 2023; originally announced July 2023.

arXiv:2306.15684 [pdf, other]

Understanding (Ir)rational Herding Online

Authors: Henry K. Dambanemuya, Johannes Wachs, Emőke-Ágnes Horvát

Abstract: Investigations of social influence in collective decision-making have become possible due to recent technologies and platforms that record interactions in far larger groups than could be studied before. Herding and its impact on decision-making are critical areas of practical interest and research study. However, despite theoretical work suggesting that it matters whether individuals choose who to… ▽ More Investigations of social influence in collective decision-making have become possible due to recent technologies and platforms that record interactions in far larger groups than could be studied before. Herding and its impact on decision-making are critical areas of practical interest and research study. However, despite theoretical work suggesting that it matters whether individuals choose who to imitate based on cues such as experience or whether they herd at random, there is little empirical analysis of this distinction. To demonstrate the distinction between what the literature calls "rational" and "irrational" herding, we use data on tens of thousands of loans from a well-established online peer-to-peer (p2p) lending platform. First, we employ an empirical measure of memory in complex systems to measure herding in lending. Then, we illustrate a network-based approach to visualize herding. Finally, we model the impact of herding on collective outcomes. Our study reveals that loan performance is not solely determined by whether the lenders engage in herding or not. Instead, the interplay between herding and the imitated lenders' prior success on the platform predicts loan outcomes. In short, herds led by expert lenders tend to pick loans that do not default. We discuss the implications of this under-explored aspect of herding for platform designers, borrowers, and lenders. Our study advances collective intelligence theories based on a case of high-stakes group decision-making online. △ Less

Submitted 22 June, 2023; originally announced June 2023.

ACM Class: J.4

arXiv:2210.13535 [pdf, other]

Human-centered XAI for Burn Depth Characterization

Authors: Maxwell J. Jacobson, Daniela Chanci Arrubla, Maria Romeo Tricas, Gayle Gordillo, Yexiang Xue, Chandan Sen, Juan Wachs

Abstract: Approximately 1.25 million people in the United States are treated each year for burn injuries. Precise burn injury classification is an important aspect of the medical AI field. In this work, we propose an explainable human-in-the-loop framework for improving burn ultrasound classification models. Our framework leverages an explanation system based on the LIME classification explainer to corrobor… ▽ More Approximately 1.25 million people in the United States are treated each year for burn injuries. Precise burn injury classification is an important aspect of the medical AI field. In this work, we propose an explainable human-in-the-loop framework for improving burn ultrasound classification models. Our framework leverages an explanation system based on the LIME classification explainer to corroborate and integrate a burn expert's knowledge -- suggesting new features and ensuring the validity of the model. Using this framework, we discover that B-mode ultrasound classifiers can be enhanced by supplying textural features. More specifically, we confirm that texture features based on the Gray Level Co-occurance Matrix (GLCM) of ultrasound frames can increase the accuracy of transfer learned burn depth classifiers. We test our hypothesis on real data from porcine subjects. We show improvements in the accuracy of burn depth classification -- from ~88% to ~94% -- once modified according to our framework. △ Less

Submitted 2 January, 2023; v1 submitted 24 October, 2022; originally announced October 2022.

arXiv:2209.01041 [pdf, other]

Digital Traces of Brain Drain: Developers during the Russian Invasion of Ukraine

Authors: Johannes Wachs

Abstract: The Russian invasion of Ukraine has caused large scale destruction, significant loss of life, and the displacement of millions of people. Besides those fleeing direct conflict in Ukraine, many individuals in Russia are also thought to have moved to third countries. In particular the exodus of skilled human capital, sometimes called brain drain, out of Russia may have a significant effect on the co… ▽ More The Russian invasion of Ukraine has caused large scale destruction, significant loss of life, and the displacement of millions of people. Besides those fleeing direct conflict in Ukraine, many individuals in Russia are also thought to have moved to third countries. In particular the exodus of skilled human capital, sometimes called brain drain, out of Russia may have a significant effect on the course of the war and the Russian economy in the long run. Yet quantifying brain drain, especially during crisis situations is generally difficult. This hinders our ability to understand its drivers and to anticipate its consequences. To address this gap, I draw on and extend a large scale dataset of the locations of highly active software developers collected in February 2021, one year before the invasion. Revisiting those developers that had been located in Russia in 2021, I confirm an ongoing exodus of developers from Russia in snapshots taken in June and November 2022. By November 11.1% of Russian developers list a new country, compared with 2.8% of developers from comparable countries in the region but not directly involved in the conflict. 13.2% of Russian developers have obscured their location (vs. 2.4% in the comparison set). Developers leaving Russia were significantly more active and central in the collaboration network than those who remain. This suggests that many of the most important developers have already left Russia. In some receiving countries the number of arrivals is significant: I estimate an increase in the number of local software developers of 42% in Armenia, 60% in Cyprus and 94% in Georgia. △ Less

Submitted 26 January, 2023; v1 submitted 2 September, 2022; originally announced September 2022.

arXiv:2207.07436 [pdf, other]

Specialization in Criminal Careers

Authors: Georg Heiler, Tuan Pham, Jan Korbel, Johannes Wachs, Stefan Thurner

Abstract: We use a comprehensive longitudinal dataset on criminal acts over five years in a European country to study specialization in criminal careers. We cluster crime categories by their relative co-occurrence within criminal careers, deriving a natural, data-based taxonomy of criminal specialization. Defining specialists as active criminals who stay within one category of offending behavior, we study t… ▽ More We use a comprehensive longitudinal dataset on criminal acts over five years in a European country to study specialization in criminal careers. We cluster crime categories by their relative co-occurrence within criminal careers, deriving a natural, data-based taxonomy of criminal specialization. Defining specialists as active criminals who stay within one category of offending behavior, we study their socio-demographic attributes, geographic range, and positions in their collaboration networks, relative to their generalist counterparts. In comparison to generalists, specialists tend to be older, more likely to be female, operate within a smaller geographic range, and collaborate in smaller, more tightly-knit local networks. We observe that specialists are more intensely embedded in criminal networks and find evidence that specialization indeed reflects division of labor and organization. △ Less

Submitted 15 July, 2022; originally announced July 2022.

arXiv:2205.04268 [pdf, other]

Modeling Interconnected Social and Technical Risks in Open Source Software Ecosystems

Authors: William Schueller, Johannes Wachs

Abstract: Open source software ecosystems consist of thousands of interdependent libraries, which users can combine to great effect. Recent work has pointed out two kinds of risks in these systems: that technical problems like bugs and vulnerabilities can spread through dependency links, and that relatively few developers are responsible for maintaining even the most widely used libraries. However, a more h… ▽ More Open source software ecosystems consist of thousands of interdependent libraries, which users can combine to great effect. Recent work has pointed out two kinds of risks in these systems: that technical problems like bugs and vulnerabilities can spread through dependency links, and that relatively few developers are responsible for maintaining even the most widely used libraries. However, a more holistic diagnosis of systemic risk in software ecosystem should consider how these social and technical sources of risk interact and amplify one another. Motivated by the observation that the same individuals maintain several libraries within dependency networks, we present a methodological framework to measure risk in software ecosystems as a function of both dependencies and developers. In our models, a library's chance of failure increases as its developers leave and as its upstream dependencies fail. We apply our method to data from the Rust ecosystem, highlighting several systemically important libraries that are overlooked when only considering technical dependencies. We compare potential interventions, seeking better ways to deploy limited developer resources with a view to improving overall ecosystem health and software supply chain resilience. △ Less

Submitted 10 May, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

arXiv:2205.03597 [pdf, other]

doi 10.1038/s41597-022-01819-z

Evolving Collaboration, Dependencies, and Use in the Rust Open Source Software Ecosystem

Authors: William Schueller, Johannes Wachs, Vito D. P. Servedio, Stefan Thurner, Vittorio Loreto

Abstract: Open-source software (OSS) is widely spread in industry, research, and government. OSS represents an effective development model because it harnesses the decentralized efforts of many developers in a way that scales. As OSS developers work independently on interdependent modules, they create a larger cohesive whole in the form of an ecosystem, leaving traces of their contributions and collaboratio… ▽ More Open-source software (OSS) is widely spread in industry, research, and government. OSS represents an effective development model because it harnesses the decentralized efforts of many developers in a way that scales. As OSS developers work independently on interdependent modules, they create a larger cohesive whole in the form of an ecosystem, leaving traces of their contributions and collaborations. Data harvested from these traces enable the study of large-scale decentralized collaborative work. We present curated data on the activity of tens of thousands of developers in the Rust ecosystem and the evolving dependencies between their libraries. The data covers seven years of developer contributions to Rust libraries and can be used to reconstruct the ecosystem's development history, such as growing developer collaboration networks or dependency networks. These are complemented by statistics on downloads and popularity, tracking the dynamics of use and success over time. Altogether the data give a comprehensive view of several dimensions of the ecosystem. △ Less

Submitted 23 November, 2022; v1 submitted 7 May, 2022; originally announced May 2022.

Journal ref: Scientific Data 9, 703 (2022)

arXiv:2204.06905 [pdf, other]

Making Markets for Information Security: The Role of Online Platforms in Bug Bounty Programs

Authors: Johannes Wachs

Abstract: Security is an essential cornerstone of functioning digital marketplaces and communities. If users doubt that data shared online will remain secure, they will withdraw from platforms. Even when firms take these risks seriously, security expertise is expensive and vulnerabilities are diverse in nature. Increasingly, firms and governments are turning to bug bounty programs (BBPs) to crowdsource thei… ▽ More Security is an essential cornerstone of functioning digital marketplaces and communities. If users doubt that data shared online will remain secure, they will withdraw from platforms. Even when firms take these risks seriously, security expertise is expensive and vulnerabilities are diverse in nature. Increasingly, firms and governments are turning to bug bounty programs (BBPs) to crowdsource their cybersecurity, in which they pay individuals for reporting vulnerabilities in their systems. And while the use of BBPs has grown significantly in recent years, research on the actors in this market and their incentives remains limited. Using the lens of transaction cost economics, this paper examines the incentives of firms and researchers (sometimes called hackers) participating in BBPs. We study the crucial role that centralized platforms that organize BBPs play in this emerging market. We carry out an analysis of the HackerOne BBP platform, using a novel dataset on over 14,000 researchers reporting over 125,000 public vulnerabilities to over 500 firms from 2014 to the end of 2021. We outline how platforms like HackerOne make a market for information security vulnerabilities by reducing information asymmetries and their associated transaction costs. △ Less

Submitted 14 April, 2022; originally announced April 2022.

arXiv:2201.11880 [pdf, other]

Computational Approaches to the Study of Corruption

Authors: Isabela Villamil, János Kertész, Johannes Wachs

Abstract: Studying corruption presents unique challenges. Recent work in the spirit of computational social science exploits newly available data and methods to give a fresh perspective on this important topic. In this chapter we highlight some of these works, describing how they provide insights into classic social scientific questions about the structure and dynamics of corruption in society from micro to… ▽ More Studying corruption presents unique challenges. Recent work in the spirit of computational social science exploits newly available data and methods to give a fresh perspective on this important topic. In this chapter we highlight some of these works, describing how they provide insights into classic social scientific questions about the structure and dynamics of corruption in society from micro to macro scales. We argue that corruption is fruitfully understood as a collective action problem that happens between embedded people and organizations. Computational methods like network science and agent-based modeling can give insights into such situations. We also present various (big) data sources that have been exploited to study corruption. We conclude by highlighting work in adjacent fields, for instance on the problems of collusion, tax evasion, organized crime, and the darkweb, and promising avenues for future work. △ Less

Submitted 2 February, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

Comments: 14 pages, 3 figures

arXiv:2109.03976 [pdf, other]

doi 10.1109/TRO.2022.3182487

Active Multi-Object Exploration and Recognition via Tactile Whiskers

Authors: Chenxi Xiao, Shujia Xu, Wenzhuo Wu, Juan Wachs

Abstract: Robotic exploration under uncertain environments is challenging when optical information is not available. In this paper, we propose an autonomous solution of exploring an unknown task space based on tactile sensing alone. We first designed a whisker sensor based on MEMS barometer devices. This sensor can acquire contact information by interacting with the environment non-intrusively. This sensor… ▽ More Robotic exploration under uncertain environments is challenging when optical information is not available. In this paper, we propose an autonomous solution of exploring an unknown task space based on tactile sensing alone. We first designed a whisker sensor based on MEMS barometer devices. This sensor can acquire contact information by interacting with the environment non-intrusively. This sensor is accompanied by a planning technique to generate exploration trajectories by using mere tactile perception. This technique relies on a hybrid policy for tactile exploration, which includes a proactive informative path planner for object searching, and a reactive Hopf oscillator for contour tracing. Results indicate that the hybrid exploration policy can increase the efficiency of object discovery. Last, scene understanding was facilitated by segmenting objects and classification. A classifier was developed to recognize the object categories based on the geometric features collected by the whisker sensor. Such an approach demonstrates the whisker sensor, together with the tactile intelligence, can provide sufficiently discriminative features to distinguish objects. △ Less

Submitted 2 July, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

Comments: Copyright 20xx IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

arXiv:2107.03200 [pdf, other]

doi 10.1016/j.techfore.2022.121478

The Geography of Open Source Software: Evidence from GitHub

Authors: Johannes Wachs, Mariusz Nitecki, William Schueller, Axel Polleres

Abstract: Open Source Software (OSS) plays an important role in the digital economy. Yet although software production is amenable to remote collaboration and its outputs are easily shared across distances, software development seems to cluster geographically in places such as Silicon Valley, London, or Berlin. And while recent work indicates that OSS activity creates positive externalities which accrue loca… ▽ More Open Source Software (OSS) plays an important role in the digital economy. Yet although software production is amenable to remote collaboration and its outputs are easily shared across distances, software development seems to cluster geographically in places such as Silicon Valley, London, or Berlin. And while recent work indicates that OSS activity creates positive externalities which accrue locally through knowledge spillovers and information effects, up-to-date data on the geographic distribution of active open source developers is limited. This presents a significant blindspot for policymakers, who tend to promote OSS at the national level as a cost-saving tool for public sector institutions. We address this gap by geolocating more than half a million active contributors to GitHub in early 2021 at various spatial scales. Compared to results from 2010, we find a significant increase in the share of developers based in Asia, Latin America and Eastern Europe, suggesting a more even spread of OSS developers globally. Within countries, however, we find significant concentration in regions, exceeding the concentration of workers in high-tech fields. Social and economic development indicators predict at most half of regional variation in OSS activity in the EU, suggesting that clusters of OSS have idiosyncratic roots. We argue that policymakers seeking to foster OSS should focus locally rather than nationally, using the tools of cluster policy to support networks of OSS developers. △ Less

Submitted 12 October, 2021; v1 submitted 7 July, 2021; originally announced July 2021.

Journal ref: Technological Forecasting and Social Change (2022)

arXiv:2103.17054 [pdf, other]

doi 10.1109/MSR52588.2021.00053

Mining DEV for social and technical insights about software development

Authors: Maria Papoutsoglou, Johannes Wachs, Georgia M. Kapitsaki

Abstract: Software developers are social creatures: they communicate, collaborate, and promote their work in a variety of channels. Twitter, GitHub, Stack Overflow, and other platforms offer developers opportunities to network and exchange ideas. Researchers analyze content on these sites to learn about trends and topics in software engineering. However, insight mined from the text of Stack Overflow questio… ▽ More Software developers are social creatures: they communicate, collaborate, and promote their work in a variety of channels. Twitter, GitHub, Stack Overflow, and other platforms offer developers opportunities to network and exchange ideas. Researchers analyze content on these sites to learn about trends and topics in software engineering. However, insight mined from the text of Stack Overflow questions or GitHub issues is highly focused on detailed and technical aspects of software development. In this paper, we present a relatively new online community for software developers called DEV. On DEV users write long-form posts about their experiences, preferences, and working life in software, zooming out from specific issues and files to reflect on broader topics. About 50,000 users have posted over 140,000 articles related to software development. In this work, we describe the content of posts on DEV using a topic model, showing that developers discuss a rich variety and mixture of social and technical aspects of software development. We show that developers use DEV to promote themselves and their work: 83% link their profiles to their GitHub profiles and 56% to their Twitter profiles. 14% of users pin specific GitHub repos in their profiles. We argue that DEV is emerging as an important hub for software developers, and a valuable source of insight for researchers to complement data from platforms like GitHub and Stack Overflow. △ Less

Submitted 16 May, 2021; v1 submitted 31 March, 2021; originally announced March 2021.

Comments: To appear in the Proceedings of the 18th International Conference on Mining Software Repositories (MSR 2021)

arXiv:2103.01296 [pdf, other]

Learning Multimodal Contact-Rich Skills from Demonstrations Without Reward Engineering

Authors: Mythra V. Balakuntala, Upinder Kaur, Xin Ma, Juan Wachs, Richard M. Voyles

Abstract: Everyday contact-rich tasks, such as peeling, cleaning, and writing, demand multimodal perception for effective and precise task execution. However, these present a novel challenge to robots as they lack the ability to combine these multimodal stimuli for performing contact-rich tasks. Learning-based methods have attempted to model multi-modal contact-rich tasks, but they often require extensive t… ▽ More Everyday contact-rich tasks, such as peeling, cleaning, and writing, demand multimodal perception for effective and precise task execution. However, these present a novel challenge to robots as they lack the ability to combine these multimodal stimuli for performing contact-rich tasks. Learning-based methods have attempted to model multi-modal contact-rich tasks, but they often require extensive training examples and task-specific reward functions which limits their practicality and scope. Hence, we propose a generalizable model-free learning-from-demonstration framework for robots to learn contact-rich skills without explicit reward engineering. We present a novel multi-modal sensor data representation which improves the learning performance for contact-rich skills. We performed training and experiments using the real-life Sawyer robot for three everyday contact-rich skills -- cleaning, writing, and peeling. Notably, the framework achieves a success rate of 100\% for the peeling and writing skill, and 80\% for the cleaning skill. Hence, this skill learning framework can be extended for learning other physical manipulation skills. △ Less

Submitted 14 April, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

Comments: Accepted at IEEE ICRA 2021. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2101.05044 [pdf, other]

Publishing patterns reflect political polarization in news media

Authors: Nick Hagar, Johannes Wachs, Emőke-Ágnes Horvát

Abstract: Digital news outlets rely on a variety of outside contributors, from freelance journalists, to political commentators, to executives and politicians. These external dependencies create a network among news outlets, traced along the contributors they share. Using connections between outlets, we demonstrate how contributors' publishing trajectories tend to align with outlet political leanings. We al… ▽ More Digital news outlets rely on a variety of outside contributors, from freelance journalists, to political commentators, to executives and politicians. These external dependencies create a network among news outlets, traced along the contributors they share. Using connections between outlets, we demonstrate how contributors' publishing trajectories tend to align with outlet political leanings. We also show how polarized clustering of outlets translates to differences in the topics of news covered and the style and tone of articles published. In addition, we demonstrate how contributors who cross partisan divides tend to focus on less explicitly political topics. This work addresses an important gap in the media polarization literature, by highlighting how structural factors on the production side of news media create an ecosystem shaped by political leanings, independent of the priorities of any one person or organization. △ Less

Submitted 14 January, 2021; v1 submitted 13 January, 2021; originally announced January 2021.

arXiv:2101.02683 [pdf, other]

doi 10.1016/j.techfore.2021.120747

Does Crowdfunding Really Foster Innovation? Evidence from the Board Game Industry

Authors: Johannes Wachs, Balazs Vedres

Abstract: Crowdfunding offers inventors and entrepreneurs alternative access to resources with which they can develop and realize their ideas. Besides hel** to secure capital, crowdfunding also connects creators with engaged early supporters who provide public feedback. But does this process foster truly innovative outcomes? Does the proliferation of crowdfunding in an industry make it more innovative ove… ▽ More Crowdfunding offers inventors and entrepreneurs alternative access to resources with which they can develop and realize their ideas. Besides hel** to secure capital, crowdfunding also connects creators with engaged early supporters who provide public feedback. But does this process foster truly innovative outcomes? Does the proliferation of crowdfunding in an industry make it more innovative overall? Prior studies investigating the link between crowdfunding and innovation do not compare traditional and crowdfunded products and so while claims that crowdfunding supports innovation are theoretically sound, they lack empirical backing. We address this gap using a unique dataset of board games, an industry with significant crowdfunding activity in recent years. Each game is described by how it combines fundamental mechanisms such as dice-rolling, negotiation, and resource-management, from which we develop quantitative measures of innovation in game design. Using these measures to compare games, we find that crowdfunded games tend to be more distinctive from previous games than their traditionally published counterparts. They are also significantly more likely to implement novel combinations of mechanisms. Crowdfunded games are not just transient experiments: subsequent games imitate their novel ideas. These results hold in regression models controlling for game and designer-level confounders. Our findings demonstrate that the innovative potential of crowdfunding goes beyond individual products to entire industries, as new ideas spill over to traditionally funded products. △ Less

Submitted 10 May, 2021; v1 submitted 7 January, 2021; originally announced January 2021.

Journal ref: Technological Forecasting and Social Change, Volume 168, 2021

arXiv:2012.00781 [pdf, other]

Pose-based Sign Language Recognition using GCN and BERT

Authors: Anirudh Tunga, Sai Vidyaranya Nuthalapati, Juan Wachs

Abstract: Sign language recognition (SLR) plays a crucial role in bridging the communication gap between the hearing and vocally impaired community and the rest of the society. Word-level sign language recognition (WSLR) is the first important step towards understanding and interpreting sign language. However, recognizing signs from videos is a challenging task as the meaning of a word depends on a combinat… ▽ More Sign language recognition (SLR) plays a crucial role in bridging the communication gap between the hearing and vocally impaired community and the rest of the society. Word-level sign language recognition (WSLR) is the first important step towards understanding and interpreting sign language. However, recognizing signs from videos is a challenging task as the meaning of a word depends on a combination of subtle body motions, hand configurations, and other movements. Recent pose-based architectures for WSLR either model both the spatial and temporal dependencies among the poses in different frames simultaneously or only model the temporal information without fully utilizing the spatial information. We tackle the problem of WSLR using a novel pose-based approach, which captures spatial and temporal information separately and performs late fusion. Our proposed architecture explicitly captures the spatial interactions in the video using a Graph Convolutional Network (GCN). The temporal dependencies between the frames are captured using Bidirectional Encoder Representations from Transformers (BERT). Experimental results on WLASL, a standard word-level sign language recognition dataset show that our model significantly outperforms the state-of-the-art on pose-based methods by achieving an improvement in the prediction accuracy by up to 5%. △ Less

Submitted 1 December, 2020; originally announced December 2020.

arXiv:2011.15100 [pdf]

From the DESK (Dexterous Surgical Skill) to the Battlefield -- A Robotics Exploratory Study

Authors: Glebys T. Gonzalez, Upinder Kaur, Masudur Rahma, Vishnunandan Venkatesh, Natalia Sanchez, Gregory Hager, Yexiang Xue, Richard Voyles, Juan Wachs

Abstract: Short response time is critical for future military medical operations in austere settings or remote areas. Such effective patient care at the point of injury can greatly benefit from the integration of semi-autonomous robotic systems. To achieve autonomy, robots would require massive libraries of maneuvers. While this is possible in controlled settings, obtaining surgical data in austere settings… ▽ More Short response time is critical for future military medical operations in austere settings or remote areas. Such effective patient care at the point of injury can greatly benefit from the integration of semi-autonomous robotic systems. To achieve autonomy, robots would require massive libraries of maneuvers. While this is possible in controlled settings, obtaining surgical data in austere settings can be difficult. Hence, in this paper, we present the Dexterous Surgical Skill (DESK) database for knowledge transfer between robots. The peg transfer task was selected as it is one of 6 main tasks of laparoscopic training. Also, we provide a ML framework to evaluate novel transfer learning methodologies on this database. The collected DESK dataset comprises a set of surgical robotic skills using the four robotic platforms: Taurus II, simulated Taurus II, YuMi, and the da Vinci Research Kit. Then, we explored two different learning scenarios: no-transfer and domain-transfer. In the no-transfer scenario, the training and testing data were obtained from the same domain; whereas in the domain-transfer scenario, the training data is a blend of simulated and real robot data that is tested on a real robot. Using simulation data enhances the performance of the real robot where limited or no real data is available. The transfer model showed an accuracy of 81% for the YuMi robot when the ratio of real-to-simulated data was 22%-78%. For Taurus II and da Vinci robots, the model showed an accuracy of 97.5% and 93% respectively, training only with simulation data. Results indicate that simulation can be used to augment training data to enhance the performance of models in real scenarios. This shows the potential for future use of surgical data from the operating room in deployable surgical robots in remote areas. △ Less

Submitted 30 November, 2020; originally announced November 2020.

Comments: First 3 authors share equal contribution

Journal ref: Published in MHSRS 2020

arXiv:2010.04025 [pdf, other]

From Asking to Answering: Getting More Involved on Stack Overflow

Authors: Timur Bachschi, Aniko Hannak, Florian Lemmerich, Johannes Wachs

Abstract: Online knowledge platforms such as Stack Overflow and Wikipedia rely on a large and diverse contributor community. Despite efforts to facilitate onboarding of new users, relatively few users become core contributors, suggesting the existence of barriers or hurdles that hinder full involvement in the community. This paper investigates such issues on Stack Overflow, a widely popular question and ans… ▽ More Online knowledge platforms such as Stack Overflow and Wikipedia rely on a large and diverse contributor community. Despite efforts to facilitate onboarding of new users, relatively few users become core contributors, suggesting the existence of barriers or hurdles that hinder full involvement in the community. This paper investigates such issues on Stack Overflow, a widely popular question and answer community for computer programming. We document evidence of a "leaky pipeline", specifically that there are many active users on the platform who never post an answer. Using this as a starting point, we investigate potential factors that can be linked to the transition of new contributors from asking questions to posting answers. We find a user's individual features, such as their tenure, gender, and geographic location, as well as features of the subcommunity in which they are most active, such as its size and the prevalence of negative social feedback, have a significant relationship with their likelihood to post answers. By measuring and modeling these relationships our paper presents a first look at the challenges and obstacles to user promotion along the pipeline of contributions in online communities. △ Less

Submitted 8 October, 2020; originally announced October 2020.

arXiv:2009.10947 [pdf, other]

Pose Imitation Constraints for Collaborative Robots

Authors: Glebys Gonzalez, Juan Wachs

Abstract: Achieving human-like motion in robots has been a fundamental goal in many areas of robotics research. Inverse kinematic (IK) solvers have been explored as a solution to provide kinematic structures with anthropomorphic movements. In particular, numeric solvers based on geometry, such as FABRIK, have shown potential for producing human-like motion at a low computational cost. Nevertheless, these me… ▽ More Achieving human-like motion in robots has been a fundamental goal in many areas of robotics research. Inverse kinematic (IK) solvers have been explored as a solution to provide kinematic structures with anthropomorphic movements. In particular, numeric solvers based on geometry, such as FABRIK, have shown potential for producing human-like motion at a low computational cost. Nevertheless, these methods have shown limitations when solving for robot kinematic constraints. This work proposes a framework inspired by FABRIK for human pose imitation in real-time. The goal is to mitigate the problems of the original algorithm while retaining the resulting humanlike fluidity and low cost. We first propose a human constraint model for pose imitation. Then, we present a pose imitation algorithm (PIC), and it's soft version (PICs) that can successfully imitate human poses using the proposed constraint system. PIC was tested on two collaborative robots (Baxter and YuMi). Fifty human demonstrations were collected for a bi-manual assembly and an incision task. Then, two performance metrics were obtained for both robots: pose accuracy with respect to the human and the percentage of environment occlusion/obstruction. The performance of PIC and PICs was compared against the numerical solver baseline (FABRIK). The proposed algorithms achieve a higher pose accuracy than FABRIK for both tasks (25%-FABRIK, 53%-PICs, 58%-PICs). In addition, PIC and it's soft version achieve a lower percentage of occlusion during incision (10%-FABRIK, 4%-PICs, 9%-PICs). These results indicate that the PIC method can reproduce human poses and achieve key desired effects of human imitation. △ Less

Submitted 12 October, 2020; v1 submitted 23 September, 2020; originally announced September 2020.

Comments: 9 pages, 8 figures, 3 tables

arXiv:2008.12364 [pdf]

doi 10.1038/s42254-020-0238-9

Complexity science approach to economic crime

Authors: János Kertész, Johannes Wachs

Abstract: In this comment we discuss how complexity science and network science are particularly useful for identifying and describing the hidden traces of economic misbehaviour such as fraud and corruption. In this comment we discuss how complexity science and network science are particularly useful for identifying and describing the hidden traces of economic misbehaviour such as fraud and corruption. △ Less

Submitted 27 August, 2020; originally announced August 2020.

Comments: Preprint of published comment in Nat. Rev. Phys

Journal ref: Nature Review Physics 2020

arXiv:2006.02371 [pdf, other]

doi 10.1109/ICSE43902.2021.00058

How Gamification Affects Software Developers: Cautionary Evidence from a Natural Experiment on GitHub

Authors: Lukas Moldon, Markus Strohmaier, Johannes Wachs

Abstract: We examine how the behavior of software developers changes in response to removing gamification elements from GitHub, an online platform for collaborative programming and software development. We find that the unannounced removal of daily activity streak counters from the user interface (from user profile pages) was followed by significant changes in behavior. Long-running streaks of activity were… ▽ More We examine how the behavior of software developers changes in response to removing gamification elements from GitHub, an online platform for collaborative programming and software development. We find that the unannounced removal of daily activity streak counters from the user interface (from user profile pages) was followed by significant changes in behavior. Long-running streaks of activity were abandoned and became less common. Weekend activity decreased and days in which developers made a single contribution became less common. Synchronization of streaking behavior in the platform's social network also decreased, suggesting that gamification is a powerful channel for social influence. Focusing on a set of software developers that were publicly pursuing a goal to make contributions for 100 days in a row, we find that some of these developers abandon this quest following the removal of the public streak counter. Our findings provide evidence for the significant impact of gamification on the behavior of developers on large collaborative programming and software development platforms. They urge caution: gamification can steer the behavior of software developers in unexpected and unwanted directions. △ Less

Submitted 10 May, 2021; v1 submitted 3 June, 2020; originally announced June 2020.

Comments: To appear in the proceedings of the 2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE)

arXiv:2004.02809 [pdf, other]

DAISI: Database for AI Surgical Instruction

Authors: Edgar Rojas-Muñoz, Kyle Couperus, Juan Wachs

Abstract: Telementoring surgeons as they perform surgery can be essential in the treatment of patients when in situ expertise is not available. Nonetheless, expert mentors are often unavailable to provide trainees with real-time medical guidance. When mentors are unavailable, a fallback autonomous mechanism should provide medical practitioners with the required guidance. However, AI/autonomous mentoring in… ▽ More Telementoring surgeons as they perform surgery can be essential in the treatment of patients when in situ expertise is not available. Nonetheless, expert mentors are often unavailable to provide trainees with real-time medical guidance. When mentors are unavailable, a fallback autonomous mechanism should provide medical practitioners with the required guidance. However, AI/autonomous mentoring in medicine has been limited by the availability of generalizable prediction models, and surgical procedures datasets to train those models with. This work presents the initial steps towards the development of an intelligent artificial system for autonomous medical mentoring. Specifically, we present the first Database for AI Surgical Instruction (DAISI). DAISI leverages on images and instructions to provide step-by-step demonstrations of how to perform procedures from various medical disciplines. The dataset was acquired from real surgical procedures and data from academic textbooks. We used DAISI to train an encoder-decoder neural network capable of predicting medical instructions given a current view of the surgery. Afterwards, the instructions predicted by the network were evaluated using cumulative BLEU scores and input from expert physicians. According to the BLEU scores, the predicted and ground truth instructions were as high as 67% similar. Additionally, expert physicians subjectively assessed the algorithm using Likert scale, and considered that the predicted descriptions were related to the images. This work provides a baseline for AI algorithms to assist in autonomous medical mentoring. △ Less

Submitted 22 March, 2020; originally announced April 2020.

Comments: 10 pages, 4 figures, to access database, see https://engineering.purdue.edu/starproj/_daisi

arXiv:2003.00856 [pdf, other]

doi 10.1109/WACV48630.2021.00087

Triangle-Net: Towards Robustness in Point Cloud Learning

Authors: Chenxi Xiao, Juan Wachs

Abstract: Three dimensional (3D) object recognition is becoming a key desired capability for many computer vision systems such as autonomous vehicles, service robots and surveillance drones to operate more effectively in unstructured environments. These real-time systems require effective classification methods that are robust to various sampling resolutions, noisy measurements, and unconstrained pose confi… ▽ More Three dimensional (3D) object recognition is becoming a key desired capability for many computer vision systems such as autonomous vehicles, service robots and surveillance drones to operate more effectively in unstructured environments. These real-time systems require effective classification methods that are robust to various sampling resolutions, noisy measurements, and unconstrained pose configurations. Previous research has shown that points' sparsity, rotation and positional inherent variance can lead to a significant drop in the performance of point cloud based classification techniques. However, neither of them is sufficiently robust to multifactorial variance and significant sparsity. In this regard, we propose a novel approach for 3D classification that can simultaneously achieve invariance towards rotation, positional shift, scaling, and is robust to point sparsity. To this end, we introduce a new feature that utilizes graph structure of point clouds, which can be learned end-to-end with our proposed neural network to acquire a robust latent representation of the 3D object. We show that such latent representations can significantly improve the performance of object classification and retrieval tasks when points are sparse. Further, we show that our approach outperforms PointNet and 3DmFV by 35.0% and 28.1% respectively in ModelNet 40 classification tasks using sparse point clouds of only 16 points under arbitrary SO(3) rotation. △ Less

Submitted 23 August, 2021; v1 submitted 27 February, 2020; originally announced March 2020.

Comments: WACV 2021

Journal ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2021, pp. 826-835

arXiv:2001.09955 [pdf, other]

The Effects of Gender Signals and Performance in Online Product Reviews

Authors: Sandipan Sikdar, Rachneet Singh Sachdeva, Johannes Wachs, Florian Lemmerich, Markus Strohmaier

Abstract: This work quantifies the effects of signaling and performing gender on the success of reviews written on the popular amazon shop** platform. Highly rated reviews play an important role in e-commerce since they are prominently displayed below products. Differences in how gender-signaling and gender-performing review authors are received can lead to important biases in what content and perspective… ▽ More This work quantifies the effects of signaling and performing gender on the success of reviews written on the popular amazon shop** platform. Highly rated reviews play an important role in e-commerce since they are prominently displayed below products. Differences in how gender-signaling and gender-performing review authors are received can lead to important biases in what content and perspectives are represented among top reviews. To investigate this, we extract signals of author gender from user names, distinguishing reviews where the author's likely gender can be inferred. Using reviews authored by these gender-signaling authors, we train a deep-learning classifier to quantify the gendered writing style or gendered performance of reviews written by authors who do not send clear gender signals via their user name. We contrast the effects of gender signaling and performance on review success using matching experiments. While we find no general trend that gendered signals or performances influence overall review success, we find strong context-specific effects. For example, reviews in product categories such as Electronics or Computers are perceived as less helpful when authors signal that they are likely woman, but are received as more helpful in categories such as Beauty or Clothing. In addition to these interesting findings, our work provides a general chain of tools for studying gender-specific effects across various social media platforms. △ Less

Submitted 28 January, 2020; v1 submitted 27 January, 2020; originally announced January 2020.

arXiv:1910.10140 [pdf, other]

Gesture Agreement Assessment Using Description Vectors

Authors: Naveen Madapana, Glebys Gonzalez, Juan Wachs

Abstract: Participatory design is a popular design technique that involves the end users in the early stages of the design process to obtain user-friendly gestural interfaces. Guessability studies followed by agreement analyses are often used to elicit and comprehend the preferences (or gestures/proposals) of the participants. Previous approaches to assess agreement, grouped the gestures into equivalence cl… ▽ More Participatory design is a popular design technique that involves the end users in the early stages of the design process to obtain user-friendly gestural interfaces. Guessability studies followed by agreement analyses are often used to elicit and comprehend the preferences (or gestures/proposals) of the participants. Previous approaches to assess agreement, grouped the gestures into equivalence classes and ignored the integral properties that are shared between them. In this work, we represent the gestures using binary description vectors to allow them to be partially similar. In this context, we introduce a new metric referred to as soft agreement rate (SAR) to quantify the level of consensus between the participants. In addition, we performed computational experiments to study the behavior of our partial agreement formula and mathematically show that existing agreement metrics are a special case of our approach. Our methodology was evaluated through a gesture elicitation study conducted with a group of neurosurgeons. Nevertheless, our formulation can be applied to any other user-elicitation study. Results show that the level of agreement obtained by SAR metric is 2.64 times higher than the existing metrics. In addition to the mostly agreed gesture, SAR formulation also provides the mostly agreed descriptors which can potentially help the designers to come up with a final gesture set. △ Less

Submitted 22 October, 2019; originally announced October 2019.

Comments: 5 pages and 3 figures

arXiv:1909.11414 [pdf, other]

doi 10.1038/s41467-021-21465-0

Inequality is rising where social network segregation interacts with urban topology

Authors: Gergő Tóth, Johannes Wachs, Riccardo Di Clemente, Ákos Jakobi, Bence Ságvári, János Kertész, Balázs Lengyel

Abstract: Social networks amplify inequalities due to fundamental mechanisms of social tie formation such as homophily and triadic closure. These forces sharpen social segregation reflected in network fragmentation. Yet, little is known about what structural factors facilitate fragmentation. In this paper we use big data from a widely-used online social network to demonstrate that there is a significant rel… ▽ More Social networks amplify inequalities due to fundamental mechanisms of social tie formation such as homophily and triadic closure. These forces sharpen social segregation reflected in network fragmentation. Yet, little is known about what structural factors facilitate fragmentation. In this paper we use big data from a widely-used online social network to demonstrate that there is a significant relationship between social network fragmentation and income inequality in cities and towns. We find that the organization of the physical urban space has a stronger relationship with fragmentation than unequal access to education, political segregation, or the presence of ethnic and religious minorities. Fragmentation of social networks is significantly higher in towns in which residential neighborhoods are divided by physical barriers such as rivers and railroads and are relatively distant from the center of town. Towns in which amenities are spatially concentrated are also typically more socially segregated. These relationships suggest how urban planning may be a useful point of intervention to mitigate inequalities in the long run. △ Less

Submitted 25 September, 2019; originally announced September 2019.

Journal ref: Nature Communications, 12, 1143 (2021)

arXiv:1909.08664 [pdf, other]

Corruption Risk in Contracting Markets: A Network Science Perspective

Authors: Johannes Wachs, Mihály Fazekas, János Kertész

Abstract: We use methods from network science to analyze corruption risk in a large administrative dataset of over 4 million public procurement contracts from European Union member states covering the years 2008-2016. By map** procurement markets as bipartite networks of issuers and winners of contracts we can visualize and describe the distribution of corruption risk. We study the structure of these netw… ▽ More We use methods from network science to analyze corruption risk in a large administrative dataset of over 4 million public procurement contracts from European Union member states covering the years 2008-2016. By map** procurement markets as bipartite networks of issuers and winners of contracts we can visualize and describe the distribution of corruption risk. We study the structure of these networks in each member state, identify their cores and find that highly centralized markets tend to have higher corruption risk. In all EU countries we analyze, corruption risk is significantly clustered. However, these risks are sometimes more prevalent in the core and sometimes in the periphery of the market, depending on the country. This suggests that the same level of corruption risk may have entirely different distributions. Our framework is both diagnostic and prescriptive: it roots out where corruption is likely to be prevalent in different markets and suggests that different anti-corruption policies are needed in different countries. △ Less

Submitted 18 September, 2019; originally announced September 2019.

arXiv:1906.08667 [pdf, other]

doi 10.1038/s41598-019-47198-1

A network approach to cartel detection in public auction markets

Authors: Johannes Wachs, János Kertész

Abstract: Competing firms can increase profits by setting prices collectively, imposing significant costs on consumers. Such groups of firms are known as cartels and because this behavior is illegal, their operations are secretive and difficult to detect. Cartels feel a significant internal obstacle: members feel short-run incentives to cheat. Here we present a network-based framework to detect potential ca… ▽ More Competing firms can increase profits by setting prices collectively, imposing significant costs on consumers. Such groups of firms are known as cartels and because this behavior is illegal, their operations are secretive and difficult to detect. Cartels feel a significant internal obstacle: members feel short-run incentives to cheat. Here we present a network-based framework to detect potential cartels in bidding markets based on the idea that the chance a group of firms can overcome this obstacle and sustain cooperation depends on the patterns of its interactions. We create a network of firms based on their co-bidding behavior, detect interacting groups, and measure their cohesion and exclusivity, two group-level features of their collective behavior. Applied to a market for school milk, our method detects a known cartel and calculates that it has high cohesion and exclusivity. In a comprehensive set of nearly 150,000 public contracts awarded by the Republic of Georgia from 2011 to 2016, detected groups with high cohesion and exclusivity are significantly more likely to display traditional markers of cartel behavior. We replicate this relationship between group topology and the emergence of cooperation in a simulation model. Our method presents a scalable, unsupervised method to find groups of firms in bidding markets ideally positioned to form lasting cartels. △ Less

Submitted 20 June, 2019; originally announced June 2019.

Journal ref: Scientific Reports, 2019

arXiv:1905.04841 [pdf, other]

Extending Policy from One-Shot Learning through Coaching

Authors: Mythra V. Balakuntala, Vishnunandan L. N. Venkatesh, Jyothsna Padmakumar Bindu, Richard M. Voyles, Juan Wachs

Abstract: Humans generally teach their fellow collaborators to perform tasks through a small number of demonstrations. The learnt task is corrected or extended to meet specific task goals by means of coaching. Adopting a similar framework for teaching robots through demonstrations and coaching makes teaching tasks highly intuitive. Unlike traditional Learning from Demonstration (LfD) approaches which requir… ▽ More Humans generally teach their fellow collaborators to perform tasks through a small number of demonstrations. The learnt task is corrected or extended to meet specific task goals by means of coaching. Adopting a similar framework for teaching robots through demonstrations and coaching makes teaching tasks highly intuitive. Unlike traditional Learning from Demonstration (LfD) approaches which require multiple demonstrations, we present a one-shot learning from demonstration approach to learn tasks. The learnt task is corrected and generalized using two layers of evaluation/modification. First, the robot self-evaluates its performance and corrects the performance to be closer to the demonstrated task. Then, coaching is used as a means to extend the policy learnt to be adaptable to varying task goals. Both the self-evaluation and coaching are implemented using reinforcement learning (RL) methods. Coaching is achieved through human feedback on desired goal and action modification to generalize to specified task goals. The proposed approach is evaluated with a scoo** task, by presenting a single demonstration. The self-evaluation framework aims to reduce the resistance to scoo** in the media. To reduce the search space for RL, we bootstrap the search using least resistance path obtained using resistive force theory. Coaching is used to generalize the learnt task policy to transfer the desired quantity of material. Thus, the proposed method provides a framework for learning tasks from one demonstration and generalizing it using human feedback through coaching. △ Less

Submitted 12 May, 2019; originally announced May 2019.

arXiv:1903.00959 [pdf, other]

DESK: A Robotic Activity Dataset for Dexterous Surgical Skills Transfer to Medical Robots

Authors: Naveen Madapana, Md Masudur Rahman, Natalia Sanchez-Tamayo, Mythra V. Balakuntala, Glebys Gonzalez, Jyothsna Padmakumar Bindu, L. N. Vishnunandan Venkatesh, Xingguang Zhang, Juan Barragan Noguera, Thomas Low, Richard Voyles, Yexiang Xue, Juan Wachs

Abstract: Datasets are an essential component for training effective machine learning models. In particular, surgical robotic datasets have been key to many advances in semi-autonomous surgeries, skill assessment, and training. Simulated surgical environments can enhance the data collection process by making it faster, simpler and cheaper than real systems. In addition, combining data from multiple robotic… ▽ More Datasets are an essential component for training effective machine learning models. In particular, surgical robotic datasets have been key to many advances in semi-autonomous surgeries, skill assessment, and training. Simulated surgical environments can enhance the data collection process by making it faster, simpler and cheaper than real systems. In addition, combining data from multiple robotic domains can provide rich and diverse training data for transfer learning algorithms. In this paper, we present the DESK (Dexterous Surgical Skill) dataset. It comprises a set of surgical robotic skills collected during a surgical training task using three robotic platforms: the Taurus II robot, Taurus II simulated robot, and the YuMi robot. This dataset was used to test the idea of transferring knowledge across different domains (e.g. from Taurus to YuMi robot) for a surgical gesture classification task with seven gestures. We explored three different scenarios: 1) No transfer, 2) Transfer from simulated Taurus to real Taurus and 3) Transfer from Simulated Taurus to the YuMi robot. We conducted extensive experiments with three supervised learning models and provided baselines in each of these scenarios. Results show that using simulation data during training enhances the performance on the real robot where limited real data is available. In particular, we obtained an accuracy of 55% on the real Taurus data using a model that is trained only on the simulator data. Furthermore, we achieved an accuracy improvement of 34% when 3% of the real data is added into the training process. △ Less

Submitted 3 March, 2019; originally announced March 2019.

Comments: 8 pages, 5 figures, 4 tables, submitted to IROS 2019 conference

arXiv:1811.11539 [pdf, other]

doi 10.1007/s10664-019-09685-x

Gender Differences in Participation and Reward on Stack Overflow

Authors: Anna May, Johannes Wachs, Aniko Hannak

Abstract: Programming is a valuable skill in the labor market, making the underrepresentation of women in computing an increasingly important issue. Online question and answer platforms serve a dual purpose in this field: they form a body of knowledge useful as a reference and learning tool, and they provide opportunities for individuals to demonstrate credible, verifiable expertise. Issues, such as male-or… ▽ More Programming is a valuable skill in the labor market, making the underrepresentation of women in computing an increasingly important issue. Online question and answer platforms serve a dual purpose in this field: they form a body of knowledge useful as a reference and learning tool, and they provide opportunities for individuals to demonstrate credible, verifiable expertise. Issues, such as male-oriented site design or overrepresentation of men among the site's elite may therefore compound the issue of women's underrepresentation in IT. In this paper we audit the differences in behavior and outcomes between men and women on Stack Overflow, the most popular of these Q&A sites. We observe significant differences in how men and women participate in the platform and how successful they are. For example, the average woman has roughly half of the reputation points, the primary measure of success on the site, of the average man. Using an Oaxaca-Blinder decomposition, an econometric technique commonly applied to analyze differences in wages between groups, we find that most of the gap in success between men and women can be explained by differences in their activity on the site and differences in how these activities are rewarded. Specifically, 1) men give more answers than women and 2) are rewarded more for their answers on average, even when controlling for possible confounders such as tenure or buy-in to the site. Women ask more questions and gain more reward per question. We conclude with a hypothetical redesign of the site's scoring system based on these behavioral differences, cutting the reputation gap in half. △ Less

Submitted 28 November, 2018; originally announced November 2018.

Journal ref: Empirical Software Engineering 2019

arXiv:1811.05058 [pdf]

Electrophysiological indicators of gesture perception

Authors: Maria E. Cabrera, Keisha Novak, Dan Foti, Richard Voyles, Juan P. Wachs

Abstract: Background: While there has been abundant research concerning neurological responses to gesture generation, the time course of gesture processing is not well understood. Specifically, it is not clear if or how particular characteristics within the kinematic execution of gestures capture attention and aid in the classification of gestures with communicative intent. If indeed key features of gesture… ▽ More Background: While there has been abundant research concerning neurological responses to gesture generation, the time course of gesture processing is not well understood. Specifically, it is not clear if or how particular characteristics within the kinematic execution of gestures capture attention and aid in the classification of gestures with communicative intent. If indeed key features of gestures with perceptual saliency exist, such features could help form the basis of a compact representation of the gestures in memory. Methods: This study used a set of available gesture videos as stimuli. The timing for salient features of performed gestures was determined by isolating inflection points in the hands' motion trajectories. Participants passively viewed the gesture videos while continuous EEG data was collected. We focused on mu oscillations (10 Hz) and used linear regression to test for associations between the timing of mu oscillations and inflection points in motion trajectories. Results: Peaks in the EEG signals at central and occipital electrodes were used to isolate the salient events within each gesture. EEG power oscillations were detected 343 and 400ms on average after inflection points at occipital and central electrodes, respectively. A regression model showed that inflection points in the motion trajectories strongly predicted subsequent mu oscillations (R^2=0.961, p<.01). Conclusion: The results suggest that coordinated activity in the visual and motor cortices are highly correlated with key motion components within gesture trajectories. These points may be associated with neural signatures used to encode gestures in memory for later identification and even recognition. △ Less

Submitted 12 November, 2018; originally announced November 2018.

Comments: 29 pages, 8 figures, 1 table

arXiv:1810.05485 [pdf, other]

doi 10.1098/rsos.182103

Social capital predicts corruption risk in towns

Authors: Johannes Wachs, Taha Yasseri, Balázs Lengyel, János Kertész

Abstract: Corruption is a social plague: gains accrue to small groups, while its costs are borne by everyone. Significant variation in its level between and within countries suggests a relationship between social structure and the prevalence of corruption, yet, large scale empirical studies thereof have been missing due to lack of data. In this paper we relate the structural characteristics of social capita… ▽ More Corruption is a social plague: gains accrue to small groups, while its costs are borne by everyone. Significant variation in its level between and within countries suggests a relationship between social structure and the prevalence of corruption, yet, large scale empirical studies thereof have been missing due to lack of data. In this paper we relate the structural characteristics of social capital of towns with corruption in their local governments. Using datasets from Hungary, we quantify corruption risk by suppressed competition and lack of transparency in the town's awarded public contracts. We characterize social capital using social network data from a popular online platform. Controlling for social, economic, and political factors, we find that settlements with fragmented social networks, indicating an excess of \textit{bonding social capital} have higher corruption risk and towns with more diverse external connectivity, suggesting a surplus of \textit{bridging social capital} are less exposed to corruption. We interpret fragmentation as fostering in-group favoritism and conformity, which increase corruption, while diversity facilitates impartiality in public life and stifles corruption. △ Less

Submitted 12 October, 2018; originally announced October 2018.

Comments: Submitted

Journal ref: Royal Society Open Science, 2019

arXiv:1807.11096 [pdf]

Spiking Neural Networks for Early Prediction in Human Robot Collaboration

Authors: Tian Zhou, Juan P. Wachs

Abstract: This paper introduces the Turn-Taking Spiking Neural Network (TTSNet), which is a cognitive model to perform early turn-taking prediction about human or agent's intentions. The TTSNet framework relies on implicit and explicit multimodal communication cues (physical, neurological and physiological) to be able to predict when the turn-taking event will occur in a robust and unambiguous fashion. To t… ▽ More This paper introduces the Turn-Taking Spiking Neural Network (TTSNet), which is a cognitive model to perform early turn-taking prediction about human or agent's intentions. The TTSNet framework relies on implicit and explicit multimodal communication cues (physical, neurological and physiological) to be able to predict when the turn-taking event will occur in a robust and unambiguous fashion. To test the theories proposed, the TTSNet framework was implemented on an assistant robotic nurse, which predicts surgeon's turn-taking intentions and delivers surgical instruments accordingly. Experiments were conducted to evaluate TTSNet's performance in early turn-taking prediction. It was found to reach a F1 score of 0.683 given 10% of completed action, and a F1 score of 0.852 at 50% and 0.894 at 100% of the completed action. This performance outperformed multiple state-of-the-art algorithms, and surpassed human performance when limited partial observation is given (< 40%). Such early turn-taking prediction capability would allow robots to perform collaborative actions proactively, in order to facilitate collaboration and increase team efficiency. △ Less

Submitted 29 July, 2018; originally announced July 2018.

Comments: Under review for journal

arXiv:1804.05705 [pdf, other]

doi 10.1145/3201064.3201088

And Now for Something Completely Different: Visual Novelty in an Online Network of Designers

Authors: Johannes Wachs, Bálint Daróczy, Anikó Hannák, Katinka Páll, Christoph Riedl

Abstract: Novelty is a key ingredient of innovation but quantifying it is difficult. This is especially true for visual work like graphic design. Using designs shared on an online social network of professional digital designers, we measure visual novelty using statistical learning methods to compare an images features with those of images that have been created before. We then relate social network positio… ▽ More Novelty is a key ingredient of innovation but quantifying it is difficult. This is especially true for visual work like graphic design. Using designs shared on an online social network of professional digital designers, we measure visual novelty using statistical learning methods to compare an images features with those of images that have been created before. We then relate social network position to the novelty of the designers images. We find that on this professional platform, users with dense local networks tend to produce more novel but generally less successful images, with important exceptions. Namely, users making novel images while embedded in cohesive local networks are more successful. △ Less

Submitted 23 April, 2018; v1 submitted 16 April, 2018; originally announced April 2018.

Comments: accepted to 10th International ACM Web Science Conference, 2018, May 27-30, Amsterdam, The Netherlands, 11 pages, 6 figures, 60 references

arXiv:1709.09276 [pdf, other]

Early Turn-taking Prediction with Spiking Neural Networks for Human Robot Collaboration

Authors: Tian Zhou, Juan P. Wachs

Abstract: Turn-taking is essential to the structure of human teamwork. Humans are typically aware of team members' intention to keep or relinquish their turn before a turn switch, where the responsibility of working on a shared task is shifted. Future co-robots are also expected to provide such competence. To that end, this paper proposes the Cognitive Turn-taking Model (CTTM), which leverages cognitive mod… ▽ More Turn-taking is essential to the structure of human teamwork. Humans are typically aware of team members' intention to keep or relinquish their turn before a turn switch, where the responsibility of working on a shared task is shifted. Future co-robots are also expected to provide such competence. To that end, this paper proposes the Cognitive Turn-taking Model (CTTM), which leverages cognitive models (i.e., Spiking Neural Network) to achieve early turn-taking prediction. The CTTM framework can process multimodal human communication cues (both implicit and explicit) and predict human turn-taking intentions in an early stage. The proposed framework is tested on a simulated surgical procedure, where a robotic scrub nurse predicts the surgeon's turn-taking intention. It was found that the proposed CTTM framework outperforms the state-of-the-art turn-taking prediction algorithms by a large margin. It also outperforms humans when presented with partial observations of communication cues (i.e., less than 40% of full actions). This early prediction capability enables robots to initiate turn-taking actions at an early stage, which facilitates collaboration and increases overall efficiency. △ Less

Submitted 26 September, 2017; originally announced September 2017.

Comments: Submitted to IEEE International Conference on Robotics and Automation (ICRA) 2018

arXiv:1709.09269 [pdf, other]

Early Prediction for Physical Human Robot Collaboration in the Operating Room

Authors: Tian Zhou, Juan P. Wachs

Abstract: To enable a natural and fluent human robot collaboration flow, it is critical for a robot to comprehend their human peers' on-going actions, predict their behaviors in the near future, and plan its actions correspondingly. Specifically, the capability of making early predictions is important, so that the robot can foresee the precise timing of a turn-taking event and start motion planning and exec… ▽ More To enable a natural and fluent human robot collaboration flow, it is critical for a robot to comprehend their human peers' on-going actions, predict their behaviors in the near future, and plan its actions correspondingly. Specifically, the capability of making early predictions is important, so that the robot can foresee the precise timing of a turn-taking event and start motion planning and execution early enough to smooth the turn-taking transition. Such proactive behavior would reduce human's waiting time, increase efficiency and enhance naturalness in collaborative task. To that end, this paper presents the design and implementation of an early turn-taking prediction algorithm, catered for physical human robot collaboration scenarios. Specifically, a Robotic Scrub Nurse (RSN) system which can comprehend surgeon's multimodal communication cues and perform turn-taking prediction is presented. The developed algorithm was tested on a collected data set of simulated surgical procedures in a surgeon-nurse tandem. The proposed turn-taking prediction algorithm is found to be significantly superior to its algorithmic counterparts, and is more accurate than human baseline when little partial input is given (less than 30% of full action). After observing more information, the algorithm can achieve comparable performances as humans with a F1 score of 0.90. △ Less

Submitted 26 September, 2017; originally announced September 2017.

arXiv:1705.02972 [pdf, ps, other]

Why Do Men Get More Attention? Exploring Factors Behind Success in an Online Design Community

Authors: Johannes Wachs, Anikó Hannák, András Vörös, Bálint Daróczy

Abstract: Online platforms are an increasingly popular tool for people to produce, promote or sell their work. However recent studies indicate that social disparities and biases present in the real world might transfer to online platforms and could be exacerbated by seemingly harmless design choices on the site (e.g., recommendation systems or publicly visible success measures). In this paper we analyze an… ▽ More Online platforms are an increasingly popular tool for people to produce, promote or sell their work. However recent studies indicate that social disparities and biases present in the real world might transfer to online platforms and could be exacerbated by seemingly harmless design choices on the site (e.g., recommendation systems or publicly visible success measures). In this paper we analyze an exclusive online community of teams of design professionals called Dribbble and investigate apparent differences in outcomes by gender. Overall, we find that men produce more work, and are able to show it to a larger audience thus receiving more likes. Some of this effect can be explained by the fact that women have different skills and design different images. Most importantly however, women and men position themselves differently in the Dribbble community. Our investigation of users' position in the social network shows that women have more clustered and gender homophilous following relations, which leads them to have smaller and more closely knit social networks. Overall, our study demonstrates that looking behind the apparent patterns of gender inequalities in online markets with the help of social networks and product differentiation helps us to better understand gender differences in success and failure. △ Less

Submitted 8 May, 2017; originally announced May 2017.

Comments: in The International AAAI Conference on Web and Social Media (ICWSM2017), Montreal, May 2017

Journal ref: ICWSM 2017

arXiv:1704.05090 [pdf]

Communication Modalities for Supervised Teleoperation in Highly Dexterous Tasks - Does one size fit all?

Authors: Tian Zhou, Maria E. Cabrera, Juan P. Wachs

Abstract: This study tries to explain the connection between communication modalities and levels of supervision in teleoperation during a dexterous task, like surgery. This concept is applied to two surgical related tasks: incision and peg transfer. It was found that as the complexity of the task escalates, the combination linking human supervision with a more expressive modality shows better performance th… ▽ More This study tries to explain the connection between communication modalities and levels of supervision in teleoperation during a dexterous task, like surgery. This concept is applied to two surgical related tasks: incision and peg transfer. It was found that as the complexity of the task escalates, the combination linking human supervision with a more expressive modality shows better performance than other combinations of modalities and control. More specifically, in the peg transfer task, the combination of speech modality and action level supervision achieves shorter task completion time (77.1 +- 3.4 s) with fewer mistakes (0.20 +- 0.17 pegs dropped). △ Less

Submitted 17 April, 2017; originally announced April 2017.

Comments: Previously published online at 2nd Workshop on the Role of Human Sensormotor Control in Surgical Robotics at 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Hamburg, Germany

arXiv:1701.05924 [pdf]

Coherency in One-Shot Gesture Recognition

Authors: Maria Cabrera, Richard Voyles, Juan Wachs

Abstract: User's intentions may be expressed through spontaneous gesturing, which have been seen only a few times or never before. Recognizing such gestures involves one shot gesture learning. While most research has focused on the recognition of the gestures itself, recently new approaches were proposed to deal with gesture perception and production as part of the same problem. The framework presented in t… ▽ More User's intentions may be expressed through spontaneous gesturing, which have been seen only a few times or never before. Recognizing such gestures involves one shot gesture learning. While most research has focused on the recognition of the gestures itself, recently new approaches were proposed to deal with gesture perception and production as part of the same problem. The framework presented in this work focuses on learning the process that leads to gesture generation, rather than mining the gesture's associated features. This is achieved using kinematic, cognitive and biomechanic characteristics of human interaction. These factors enable the artificial production of realistic gesture samples originated from a single observation. The generated samples are then used as training sets for different state-of-the-art classifiers. Performance is obtained first, by observing the machines' gesture recognition percentages. Then, performance is computed by the human recognition from gestures performed by robots. Based on these two scenarios, a composite new metric of coherency is proposed relating to the amount of agreement between these two conditions. Experimental results provide an average recognition performance of 89.2% for the trained classifiers and 92.5% for the participants. Coherency in recognition was determined at 93.6%. While this new metric is not directly comparable to raw accuracy or other pure performance-based standard metrics, it provides a quantifier for validating how realistic the machine generated samples are and how accurate the resulting mimicry is. △ Less

Submitted 20 January, 2017; originally announced January 2017.

Comments: This paper was submitted to a IEEE conference

arXiv:1701.05921 [pdf]

What makes a gesture a gesture? Neural signatures involved in gesture recognition

Authors: Maria Cabrera, Keisha Novak, Daniel Foti, Richard Voyles, Juan Wachs

Abstract: Previous work in the area of gesture production, has made the assumption that machines can replicate "human-like" gestures by connecting a bounded set of salient points in the motion trajectory. Those inflection points were hypothesized to also display cognitive saliency. The purpose of this paper is to validate that claim using electroencephalography (EEG). That is, this paper attempts to find ne… ▽ More Previous work in the area of gesture production, has made the assumption that machines can replicate "human-like" gestures by connecting a bounded set of salient points in the motion trajectory. Those inflection points were hypothesized to also display cognitive saliency. The purpose of this paper is to validate that claim using electroencephalography (EEG). That is, this paper attempts to find neural signatures of gestures (also referred as placeholders) in human cognition, which facilitate the understanding, learning and repetition of gestures. Further, it is discussed whether there is a direct map** between the placeholders and kinematic salient points in the gesture trajectories. These are expressed as relationships between inflection points in the gestures' trajectories with oscillatory mu rhythms (8-12 Hz) in the EEG. This is achieved by correlating fluctuations in mu power during gesture observation with salient motion points found for each gesture. Peaks in the EEG signal at central electrodes (motor cortex) and occipital electrodes (visual cortex) were used to isolate the salient events within each gesture. We found that a linear model predicting mu peaks from motion inflections fits the data well. Increases in EEG power were detected 380 and 500ms after inflection points at occipital and central electrodes, respectively. These results suggest that coordinated activity in visual and motor cortices is sensitive to motion trajectories during gesture observation, and it is consistent with the proposal that inflection points operate as placeholders in gesture recognition. △ Less

Submitted 20 January, 2017; originally announced January 2017.

Comments: This work has been submitted to a IEEE conference and is awaiting for a decision

Showing 1–48 of 48 results for author: Wachs, J