-
A national longitudinal dataset of skills taught in U.S. higher education curricula
Authors:
Alireza Javadian Sabet,
Sarah H. Bana,
Renzhe Yu,
Morgan R. Frank
Abstract:
Higher education plays a critical role in driving an innovative economy by equip** students with knowledge and skills demanded by the workforce. While researchers and practitioners have developed data systems to track detailed occupational skills, such as those established by the U.S. Department of Labor (DOL), much less effort has been made to document skill development in higher education at a…
▽ More
Higher education plays a critical role in driving an innovative economy by equip** students with knowledge and skills demanded by the workforce. While researchers and practitioners have developed data systems to track detailed occupational skills, such as those established by the U.S. Department of Labor (DOL), much less effort has been made to document skill development in higher education at a similar granularity. Here, we fill this gap by presenting a longitudinal dataset of skills inferred from over three million course syllabi taught at nearly three thousand U.S. higher education institutions. To construct this dataset, we apply natural language processing to extract from course descriptions detailed workplace activities (DWAs) used by the DOL to describe occupations. We then aggregate these DWAs to create skill profiles for institutions and academic majors. Our dataset offers a large-scale representation of college-educated workers and their role in the economy. To showcase the utility of this dataset, we use it to 1) compare the similarity of skills taught and skills in the workforce according to the US Bureau of Labor Statistics, 2) estimate gender differences in acquired skills based on enrollment data, 3) depict temporal trends in the skills taught in social science curricula, and 4) connect college majors' skill distinctiveness to salary differences of graduates. Overall, this dataset can enable new research on the source of skills in the context of workforce development and provide actionable insights for sha** the future of higher education to meet evolving labor demands especially in the face of new technologies.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
Brief for the Canada House of Commons Study on the Implications of Artificial Intelligence Technologies for the Canadian Labor Force: Generative Artificial Intelligence Shatters Models of AI and Labor
Authors:
Morgan R. Frank
Abstract:
Exciting advances in generative artificial intelligence (AI) have sparked concern for jobs, education, productivity, and the future of work. As with past technologies, generative AI may not lead to mass unemployment. But, unlike past technologies, generative AI is creative, cognitive, and potentially ubiquitous which makes the usual assumptions of automation predictions ill-suited for today. Exist…
▽ More
Exciting advances in generative artificial intelligence (AI) have sparked concern for jobs, education, productivity, and the future of work. As with past technologies, generative AI may not lead to mass unemployment. But, unlike past technologies, generative AI is creative, cognitive, and potentially ubiquitous which makes the usual assumptions of automation predictions ill-suited for today. Existing projections suggest that generative AI will impact workers in occupations that were previously considered immune to automation. As AI's full set of capabilities and applications emerge, policy makers should promote workers' career adaptability. This goal requires improved data on job separations and unemployment by locality and job titles in order to identify early-indicators for the workers facing labor disruption. Further, prudent policy should incentivize education programs to accommodate learning with AI as a tool while preparing students for the demands of the future of work.
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
The Resume Paradox: Greater Language Differences, Smaller Pay Gaps
Authors:
Joshua R. Minot,
Marc Maier,
Bradford Demarest,
Nicholas Cheney,
Christopher M. Danforth,
Peter Sheridan Dodds,
Morgan R. Frank
Abstract:
Over the past decade, the gender pay gap has remained steady with women earning 84 cents for every dollar earned by men on average. Many studies explain this gap through demand-side bias in the labor market represented through employers' job postings. However, few studies analyze potential bias from the worker supply-side. Here, we analyze the language in millions of US workers' resumes to investi…
▽ More
Over the past decade, the gender pay gap has remained steady with women earning 84 cents for every dollar earned by men on average. Many studies explain this gap through demand-side bias in the labor market represented through employers' job postings. However, few studies analyze potential bias from the worker supply-side. Here, we analyze the language in millions of US workers' resumes to investigate how differences in workers' self-representation by gender compare to differences in earnings. Across US occupations, language differences between male and female resumes correspond to 11% of the variation in gender pay gap. This suggests that females' resumes that are semantically similar to males' resumes may have greater wage parity. However, surprisingly, occupations with greater language differences between male and female resumes have lower gender pay gaps. A doubling of the language difference between female and male resumes results in an annual wage increase of $2,797 for the average female worker. This result holds with controls for gender-biases of resume text and we find that per-word bias poorly describes the variance in wage gap. The results demonstrate that textual data and self-representation are valuable factors for improving worker representations and understanding employment inequities.
△ Less
Submitted 17 July, 2023;
originally announced July 2023.
-
Art and the science of generative AI: A deeper dive
Authors:
Ziv Epstein,
Aaron Hertzmann,
Laura Herman,
Robert Mahari,
Morgan R. Frank,
Matthew Groh,
Hope Schroeder,
Amy Smith,
Memo Akten,
Jessica Fjeld,
Hany Farid,
Neil Leach,
Alex Pentland,
Olga Russakovsky
Abstract:
A new class of tools, colloquially called generative AI, can produce high-quality artistic media for visual arts, concept art, music, fiction, literature, video, and animation. The generative capabilities of these tools are likely to fundamentally alter the creative processes by which creators formulate ideas and put them into production. As creativity is reimagined, so too may be many sectors of…
▽ More
A new class of tools, colloquially called generative AI, can produce high-quality artistic media for visual arts, concept art, music, fiction, literature, video, and animation. The generative capabilities of these tools are likely to fundamentally alter the creative processes by which creators formulate ideas and put them into production. As creativity is reimagined, so too may be many sectors of society. Understanding the impact of generative AI - and making policy decisions around it - requires new interdisciplinary scientific inquiry into culture, economics, law, algorithms, and the interaction of technology and creativity. We argue that generative AI is not the harbinger of art's demise, but rather is a new medium with its own distinct affordances. In this vein, we consider the impacts of this new medium on creators across four themes: aesthetics and culture, legal questions of ownership and credit, the future of creative work, and impacts on the contemporary media ecosystem. Across these themes, we highlight key research questions and directions to inform policy and beneficial uses of the technology.
△ Less
Submitted 7 June, 2023;
originally announced June 2023.
-
Exposure of occupations to technologies of the fourth industrial revolution
Authors:
Benjamin Meindl,
Morgan R. Frank,
Joana Mendonça
Abstract:
The fourth industrial revolution (4IR) is likely to have a substantial impact on the economy. Companies need to build up capabilities to implement new technologies, and automation may make some occupations obsolete. However, where, when, and how the change will happen remain to be determined. Robust empirical indicators of technological progress linked to occupations can help to illuminate this ch…
▽ More
The fourth industrial revolution (4IR) is likely to have a substantial impact on the economy. Companies need to build up capabilities to implement new technologies, and automation may make some occupations obsolete. However, where, when, and how the change will happen remain to be determined. Robust empirical indicators of technological progress linked to occupations can help to illuminate this change. With this aim, we provide such an indicator based on patent data. Using natural language processing, we calculate patent exposure scores for more than 900 occupations, which represent the technological progress related to them. To provide a lens on the impact of the 4IR, we differentiate between traditional and 4IR patent exposure. Our method differs from previous approaches in that it both accounts for the diversity of task-level patent exposures within an occupation and reflects work activities more accurately. We find that exposure to 4IR patents differs from traditional patent exposure. Manual tasks, and accordingly occupations such as construction and production, are exposed mainly to traditional (non-4IR) patents but have low exposure to 4IR patents. The analysis suggests that 4IR technologies may have a negative impact on job growth; this impact appears 10 to 20 years after patent filing. Further, we compared the 4IR exposure to other automation and AI exposure scores. Whereas many measures refer to theoretical automation potential, our patent-based indicator reflects actual technology diffusion. Our work not only allows analyses of the impact of 4IR technologies as a whole, but also provides exposure scores for more than 300 technology fields, such as AI and smart office technologies. Finally, the work provides a general map** of patents to tasks and occupations, which enables future researchers to construct individual exposure measures.
△ Less
Submitted 25 October, 2021;
originally announced October 2021.
-
Industrial Topics in Urban Labor System
Authors:
Jaehyuk Park,
Morgan R. Frank,
Lijun Sun,
Hye** Youn
Abstract:
Categorization is an essential component for us to understand the world for ourselves and to communicate it collectively. It is therefore important to recognize that classification system are not necessarily static, especially for economic systems, and even more so in urban areas where most innovation takes place and is implemented. Out-of-date classification systems would potentially limit furthe…
▽ More
Categorization is an essential component for us to understand the world for ourselves and to communicate it collectively. It is therefore important to recognize that classification system are not necessarily static, especially for economic systems, and even more so in urban areas where most innovation takes place and is implemented. Out-of-date classification systems would potentially limit further understanding of the current economy because things constantly change. Here, we develop an occupation-based classification system for the US labor economy, called industrial topics, that satisfy adaptability and representability. By leveraging the distributions of occupations across the US urban areas, we identify industrial topics - clusters of occupations based on their co-existence pattern. Industrial topics indicate the mechanisms under the systematic allocation of different occupations. Considering the densely connected occupations as an industrial topic, our approach characterizes regional economies by their topical composition. Unlike the existing survey-based top-down approach, our method provides timely information about the underlying structure of the regional economy, which is critical for policymakers and business leaders, especially in our fast-changing economy.
△ Less
Submitted 17 September, 2020;
originally announced September 2020.
-
Generalized Word Shift Graphs: A Method for Visualizing and Explaining Pairwise Comparisons Between Texts
Authors:
Ryan J. Gallagher,
Morgan R. Frank,
Lewis Mitchell,
Aaron J. Schwartz,
Andrew J. Reagan,
Christopher M. Danforth,
Peter Sheridan Dodds
Abstract:
A common task in computational text analyses is to quantify how two corpora differ according to a measurement like word frequency, sentiment, or information content. However, collapsing the texts' rich stories into a single number is often conceptually perilous, and it is difficult to confidently interpret interesting or unexpected textual patterns without looming concerns about data artifacts or…
▽ More
A common task in computational text analyses is to quantify how two corpora differ according to a measurement like word frequency, sentiment, or information content. However, collapsing the texts' rich stories into a single number is often conceptually perilous, and it is difficult to confidently interpret interesting or unexpected textual patterns without looming concerns about data artifacts or measurement validity. To better capture fine-grained differences between texts, we introduce generalized word shift graphs, visualizations which yield a meaningful and interpretable summary of how individual words contribute to the variation between two texts for any measure that can be formulated as a weighted average. We show that this framework naturally encompasses many of the most commonly used approaches for comparing texts, including relative frequencies, dictionary scores, and entropy-based measures like the Kullback-Leibler and Jensen-Shannon divergences. Through several case studies, we demonstrate how generalized word shift graphs can be flexibly applied across domains for diagnostic investigation, hypothesis generation, and substantive interpretation. By providing a detailed lens into textual shifts between corpora, generalized word shift graphs help computational social scientists, digital humanists, and other text analysis practitioners fashion more robust scientific narratives.
△ Less
Submitted 5 August, 2020;
originally announced August 2020.
-
Allotaxonometry and rank-turbulence divergence: A universal instrument for comparing complex systems
Authors:
P. S. Dodds,
J. R. Minot,
M. V. Arnold,
T. Alshaabi,
J. L. Adams,
D. R. Dewhurst,
T. J. Gray,
M. R. Frank,
A. J. Reagan,
C. M. Danforth
Abstract:
Complex systems often comprise many kinds of components which vary over many orders of magnitude in size: Populations of cities in countries, individual and corporate wealth in economies, species abundance in ecologies, word frequency in natural language, and node degree in complex networks. Here, we introduce `allotaxonometry' along with `rank-turbulence divergence' (RTD), a tunable instrument fo…
▽ More
Complex systems often comprise many kinds of components which vary over many orders of magnitude in size: Populations of cities in countries, individual and corporate wealth in economies, species abundance in ecologies, word frequency in natural language, and node degree in complex networks. Here, we introduce `allotaxonometry' along with `rank-turbulence divergence' (RTD), a tunable instrument for comparing any two ranked lists of components. We analytically develop our rank-based divergence in a series of steps, and then establish a rank-based allotaxonograph which pairs a map-like histogram for rank-rank pairs with an ordered list of components according to divergence contribution. We explore the performance of rank-turbulence divergence, which we view as an instrument of `type calculus', for a series of distinct settings including: Language use on Twitter and in books, species abundance, baby name popularity, market capitalization, performance in sports, mortality causes, and job titles. We provide a series of supplementary flipbooks which demonstrate the tunability and storytelling power of rank-based allotaxonometry.
△ Less
Submitted 2 August, 2023; v1 submitted 22 February, 2020;
originally announced February 2020.
-
A common trajectory recapitulated by urban economies
Authors:
Inho Hong,
Morgan R. Frank,
Iyad Rahwan,
Woo-Sung Jung,
Hye** Youn
Abstract:
Is there a general economic pathway recapitulated by individual cities over and over? Identifying such evolution structure, if any, would inform models for the assessment, maintenance, and forecasting of urban sustainability and economic success as a quantitative baseline. This premise seems to contradict the existing body of empirical evidences for path-dependent growth sha** the unique history…
▽ More
Is there a general economic pathway recapitulated by individual cities over and over? Identifying such evolution structure, if any, would inform models for the assessment, maintenance, and forecasting of urban sustainability and economic success as a quantitative baseline. This premise seems to contradict the existing body of empirical evidences for path-dependent growth sha** the unique history of individual cities. And yet, recent empirical evidences and theoretical models have amounted to the universal patterns, mostly size-dependent, thereby expressing many of urban quantities as a set of simple scaling laws. Here, we provide a mathematical framework to integrate repeated cross-sectional data, each of which freezes in time dimension, into a frame of reference for longitudinal evolution of individual cities in time. Using data of over 100 millions employment in thousand business categories between 1998 and 2013, we decompose each city's evolution into a pre-factor and relative changes to eliminate national and global effects. In this way, we show the longitudinal dynamics of individual cities recapitulate the observed cross-sectional regularity. Larger cities are not only scaled-up versions of their smaller peers but also of their past. In addition, our model shows that both specialization and diversification are attributed to the distribution of industry's scaling exponents, resulting a critical population of 1.2 million at which a city makes an industrial transition into innovative economies.
△ Less
Submitted 18 October, 2018;
originally announced October 2018.
-
Small cities face greater impact from automation
Authors:
Morgan R. Frank,
Lijun Sun,
Manuel Cebrian,
Hye** Youn,
Iyad Rahwan
Abstract:
The city has proven to be the most successful form of human agglomeration and provides wide employment opportunities for its dwellers. As advances in robotics and artificial intelligence revive concerns about the impact of automation on jobs, a question looms: How will automation affect employment in cities? Here, we provide a comparative picture of the impact of automation across U.S. urban areas…
▽ More
The city has proven to be the most successful form of human agglomeration and provides wide employment opportunities for its dwellers. As advances in robotics and artificial intelligence revive concerns about the impact of automation on jobs, a question looms: How will automation affect employment in cities? Here, we provide a comparative picture of the impact of automation across U.S. urban areas. Small cities will undertake greater adjustments, such as worker displacement and job content substitutions. We demonstrate that large cities exhibit increased occupational and skill specialization due to increased abundance of managerial and technical professions. These occupations are not easily automatable, and, thus, reduce the potential impact of automation in large cities. Our results pass several robustness checks including potential errors in the estimation of occupational automation and sub-sampling of occupations. Our study provides the first empirical law connecting two societal forces: urban agglomeration and automation's impact on employment.
△ Less
Submitted 21 September, 2017; v1 submitted 16 May, 2017;
originally announced May 2017.
-
The Lexicocalorimeter: Gauging public health through caloric input and output on social media
Authors:
S. E. Alajajian,
J. R. Williams,
A. J. Reagan,
S. C. Alajajian,
M. R. Frank,
L. Mitchell,
J. Lahne,
C. M. Danforth,
P. S. Dodds
Abstract:
We propose and develop a Lexicocalorimeter: an online, interactive instrument for measuring the "caloric content" of social media and other large-scale texts. We do so by constructing extensive yet improvable tables of food and activity related phrases, and respectively assigning them with sourced estimates of caloric intake and expenditure. We show that for Twitter, our naive measures of "caloric…
▽ More
We propose and develop a Lexicocalorimeter: an online, interactive instrument for measuring the "caloric content" of social media and other large-scale texts. We do so by constructing extensive yet improvable tables of food and activity related phrases, and respectively assigning them with sourced estimates of caloric intake and expenditure. We show that for Twitter, our naive measures of "caloric input", "caloric output", and the ratio of these measures are all strong correlates with health and well-being measures for the contiguous United States. Our caloric balance measure in many cases outperforms both its constituent quantities, is tunable to specific health and well-being measures such as diabetes rates, has the capability of providing a real-time signal reflecting a population's health, and has the potential to be used alongside traditional survey data in the development of public policy and collective self-awareness. Because our Lexicocalorimeter is a linear superposition of principled phrase scores, we also show we can move beyond correlations to explore what people talk about in collective detail, and assist in the understanding and explanation of how population-scale conditions vary, a capacity unavailable to black-box type methods.
△ Less
Submitted 10 January, 2017; v1 submitted 17 July, 2015;
originally announced July 2015.
-
Reply to Garcia et al.: Common mistakes in measuring frequency dependent word characteristics
Authors:
P. S. Dodds,
E. M. Clark,
S. Desu,
M. R. Frank,
A. J. Reagan,
J. R. Williams,
L. Mitchell,
K. D. Harris,
I. M. Kloumann,
J. P. Bagrow,
K. Megerdoomian,
M. T. McMahon,
B. F. Tivnan,
C. M. Danforth
Abstract:
We demonstrate that the concerns expressed by Garcia et al. are misplaced, due to (1) a misreading of our findings in [1]; (2) a widespread failure to examine and present words in support of asserted summary quantities based on word usage frequencies; and (3) a range of misconceptions about word usage frequency, word rank, and expert-constructed word lists. In particular, we show that the English…
▽ More
We demonstrate that the concerns expressed by Garcia et al. are misplaced, due to (1) a misreading of our findings in [1]; (2) a widespread failure to examine and present words in support of asserted summary quantities based on word usage frequencies; and (3) a range of misconceptions about word usage frequency, word rank, and expert-constructed word lists. In particular, we show that the English component of our study compares well statistically with two related surveys, that no survey design influence is apparent, and that estimates of measurement error do not explain the positivity biases reported in our work and that of others. We further demonstrate that for the frequency dependence of positivity---of which we explored the nuances in great detail in [1]---Garcia et al. did not perform a reanalysis of our data---they instead carried out an analysis of a different, statistically improper data set and introduced a nonlinearity before performing linear regression.
△ Less
Submitted 28 May, 2015; v1 submitted 25 May, 2015;
originally announced May 2015.
-
Constructing a taxonomy of fine-grained human movement and activity motifs through social media
Authors:
Morgan R. Frank,
Jake Ryland Williams,
Lewis Mitchell,
James P. Bagrow,
Peter Sheridan Dodds,
Christopher M. Danforth
Abstract:
Profiting from the emergence of web-scale social data sets, numerous recent studies have systematically explored human mobility patterns over large populations and large time scales. Relatively little attention, however, has been paid to mobility and activity over smaller time-scales, such as a day. Here, we use Twitter to identify people's frequently visited locations along with their likely acti…
▽ More
Profiting from the emergence of web-scale social data sets, numerous recent studies have systematically explored human mobility patterns over large populations and large time scales. Relatively little attention, however, has been paid to mobility and activity over smaller time-scales, such as a day. Here, we use Twitter to identify people's frequently visited locations along with their likely activities as a function of time of day and day of week, capitalizing on both the content and geolocation of messages. We subsequently characterize people's transition pattern motifs and demonstrate that spatial information is encoded in word choice.
△ Less
Submitted 11 May, 2015; v1 submitted 28 September, 2014;
originally announced October 2014.
-
Human language reveals a universal positivity bias
Authors:
Peter Sheridan Dodds,
Eric M. Clark,
Suma Desu,
Morgan R. Frank,
Andrew J. Reagan,
Jake Ryland Williams,
Lewis Mitchell,
Kameron Decker Harris,
Isabel M. Kloumann,
James P. Bagrow,
Karine Megerdoomian,
Matthew T. McMahon,
Brian F. Tivnan,
Christopher M. Danforth
Abstract:
Using human evaluation of 100,000 words spread across 24 corpora in 10 languages diverse in origin and culture, we present evidence of a deep imprint of human sociality in language, observing that (1) the words of natural human language possess a universal positivity bias; (2) the estimated emotional content of words is consistent between languages under translation; and (3) this positivity bias i…
▽ More
Using human evaluation of 100,000 words spread across 24 corpora in 10 languages diverse in origin and culture, we present evidence of a deep imprint of human sociality in language, observing that (1) the words of natural human language possess a universal positivity bias; (2) the estimated emotional content of words is consistent between languages under translation; and (3) this positivity bias is strongly independent of frequency of word usage. Alongside these general regularities, we describe inter-language variations in the emotional spectrum of languages which allow us to rank corpora. We also show how our word evaluations can be used to construct physical-like instruments for both real-time and offline measurement of the emotional content of large-scale texts.
△ Less
Submitted 15 June, 2014;
originally announced June 2014.
-
Shadow networks: Discovering hidden nodes with models of information flow
Authors:
James P. Bagrow,
Suma Desu,
Morgan R. Frank,
Narine Manukyan,
Lewis Mitchell,
Andrew Reagan,
Eric E. Bloedorn,
Lashon B. Booker,
Luther K. Branting,
Michael J. Smith,
Brian F. Tivnan,
Christopher M. Danforth,
Peter S. Dodds,
Joshua C. Bongard
Abstract:
Complex, dynamic networks underlie many systems, and understanding these networks is the concern of a great span of important scientific and engineering problems. Quantitative description is crucial for this understanding yet, due to a range of measurement problems, many real network datasets are incomplete. Here we explore how accidentally missing or deliberately hidden nodes may be detected in n…
▽ More
Complex, dynamic networks underlie many systems, and understanding these networks is the concern of a great span of important scientific and engineering problems. Quantitative description is crucial for this understanding yet, due to a range of measurement problems, many real network datasets are incomplete. Here we explore how accidentally missing or deliberately hidden nodes may be detected in networks by the effect of their absence on predictions of the speed with which information flows through the network. We use Symbolic Regression (SR) to learn models relating information flow to network topology. These models show localized, systematic, and non-random discrepancies when applied to test networks with intentionally masked nodes, demonstrating the ability to detect the presence of missing nodes and where in the network those nodes are likely to reside.
△ Less
Submitted 20 December, 2013;
originally announced December 2013.
-
Standing Swells Surveyed Showing Surprisingly Stable Solutions for the Lorenz '96 Model
Authors:
Morgan R. Frank,
Lewis Mitchell,
Peter Sheridan Dodds,
Christopher M. Danforth
Abstract:
The Lorenz '96 model is an adjustable dimension system of ODEs exhibiting chaotic behavior representative of dynamics observed in the Earth's atmosphere. In the present study, we characterize statistical properties of the chaotic dynamics while varying the degrees of freedom and the forcing. Tuning the dimensionality of the system, we find regions of parameter space with surprising stability in th…
▽ More
The Lorenz '96 model is an adjustable dimension system of ODEs exhibiting chaotic behavior representative of dynamics observed in the Earth's atmosphere. In the present study, we characterize statistical properties of the chaotic dynamics while varying the degrees of freedom and the forcing. Tuning the dimensionality of the system, we find regions of parameter space with surprising stability in the form of standing waves traveling amongst the slow oscillators. The boundaries of these stable regions fluctuate regularly with the number of slow oscillators. These results demonstrate hidden order in the Lorenz '96 system, strengthening the evidence for its role as a hallmark representative of nonlinear dynamical behavior.
△ Less
Submitted 16 July, 2014; v1 submitted 20 December, 2013;
originally announced December 2013.
-
An Evolutionary Algorithm Approach to Link Prediction in Dynamic Social Networks
Authors:
Catherine A. Bliss,
Morgan R. Frank,
Christopher M. Danforth,
Peter Sheridan Dodds
Abstract:
Many real world, complex phenomena have underlying structures of evolving networks where nodes and links are added and removed over time. A central scientific challenge is the description and explanation of network dynamics, with a key test being the prediction of short and long term changes. For the problem of short-term link prediction, existing methods attempt to determine neighborhood metrics…
▽ More
Many real world, complex phenomena have underlying structures of evolving networks where nodes and links are added and removed over time. A central scientific challenge is the description and explanation of network dynamics, with a key test being the prediction of short and long term changes. For the problem of short-term link prediction, existing methods attempt to determine neighborhood metrics that correlate with the appearance of a link in the next observation period. Recent work has suggested that the incorporation of topological features and node attributes can improve link prediction. We provide an approach to predicting future links by applying the Covariance Matrix Adaptation Evolution Strategy (CMA-ES) to optimize weights which are used in a linear combination of sixteen neighborhood and node similarity indices. We examine a large dynamic social network with over $10^6$ nodes (Twitter reciprocal reply networks), both as a test of our general method and as a problem of scientific interest in itself. Our method exhibits fast convergence and high levels of precision for the top twenty predicted links. Based on our findings, we suggest possible factors which may be driving the evolution of Twitter reciprocal reply networks.
△ Less
Submitted 13 August, 2014; v1 submitted 23 April, 2013;
originally announced April 2013.
-
Happiness and the Patterns of Life: A Study of Geolocated Tweets
Authors:
Morgan R. Frank,
Lewis Mitchell,
Peter S. Dodds,
Christopher M. Danforth
Abstract:
The patterns of life exhibited by large populations have been described and modeled both as a basic science exercise and for a range of applied goals such as reducing automotive congestion, improving disaster response, and even predicting the location of individuals. However, these studies previously had limited access to conversation content, rendering changes in expression as a function of movem…
▽ More
The patterns of life exhibited by large populations have been described and modeled both as a basic science exercise and for a range of applied goals such as reducing automotive congestion, improving disaster response, and even predicting the location of individuals. However, these studies previously had limited access to conversation content, rendering changes in expression as a function of movement invisible. In addition, they typically use the communication between a mobile phone and its nearest antenna tower to infer position, limiting the spatial resolution of the data to the geographical region serviced by each cellphone tower. We use a collection of 37 million geolocated tweets to characterize the movement patterns of 180,000 individuals, taking advantage of several orders of magnitude of increased spatial accuracy relative to previous work. Employing the recently developed sentiment analysis instrument known as the 'hedonometer', we characterize changes in word usage as a function of movement, and find that expressed happiness increases logarithmically with distance from an individual's average location.
△ Less
Submitted 12 September, 2013; v1 submitted 4 April, 2013;
originally announced April 2013.
-
The Geography of Happiness: Connecting Twitter sentiment and expression, demographics, and objective characteristics of place
Authors:
Lewis Mitchell,
Kameron Decker Harris,
Morgan R. Frank,
Peter Sheridan Dodds,
Christopher M. Danforth
Abstract:
We conduct a detailed investigation of correlations between real-time expressions of individuals made across the United States and a wide range of emotional, geographic, demographic, and health characteristics. We do so by combining (1) a massive, geo-tagged data set comprising over 80 million words generated over the course of several recent years on the social network service Twitter and (2) ann…
▽ More
We conduct a detailed investigation of correlations between real-time expressions of individuals made across the United States and a wide range of emotional, geographic, demographic, and health characteristics. We do so by combining (1) a massive, geo-tagged data set comprising over 80 million words generated over the course of several recent years on the social network service Twitter and (2) annually-surveyed characteristics of all 50 states and close to 400 urban populations. Among many results, we generate taxonomies of states and cities based on their similarities in word use; estimate the happiness levels of states and cities; correlate highly-resolved demographic characteristics with happiness levels; and connect word choice and message length with urban characteristics such as education levels and obesity rates. Our results show how social media may potentially be used to estimate real-time levels and changes in population-level measures such as obesity rates.
△ Less
Submitted 18 May, 2013; v1 submitted 13 February, 2013;
originally announced February 2013.
-
CP Violating Asymmetry in Stop Decay into Bottom and Chargino
Authors:
Helmut Eberl,
Sebastian M. R. Frank,
Walter Majerotto
Abstract:
In the MSSM with complex parameters, loop corrections to the decay of a stop into a bottom quark and a chargino can lead to a CP violating decay rate asymmetry. We calculate this asymmetry at full one-loop level and perform a detailed numerical study, analyzing the dependence on the parameters and complex phases involved. If the stop can decay into a gluino, the self-energy and the vertex correcti…
▽ More
In the MSSM with complex parameters, loop corrections to the decay of a stop into a bottom quark and a chargino can lead to a CP violating decay rate asymmetry. We calculate this asymmetry at full one-loop level and perform a detailed numerical study, analyzing the dependence on the parameters and complex phases involved. If the stop can decay into a gluino, the self-energy and the vertex correction dominate due to the strong coupling. It is shown that the vertex contribution is always suppressed. We therefore give a simple approximate formula for the asymmetry. We account for the constraints on the parameters coming from several experimental limits. Asymmetries up to 25 percent are obtained. We also comment on the feasibility of measuring this asymmetry at the LHC.
△ Less
Submitted 6 December, 2010; v1 submitted 23 December, 2009;
originally announced December 2009.
-
CP Violating Asymmetry in Stop Decay into Bottom and Chargino
Authors:
Sebastian M. R. Frank,
Helmut Eberl
Abstract:
In the MSSM with complex parameters, loop corrections to the decay of a stop into a bottom quark and a chargino can lead to a CP violating decay rate asymmetry.
We calculate this asymmetry at full one-loop level and perform a detailed numerical study, analyzing the dependence on the parameters and complex phases involved. In addition, we take the Yukawa couplings of the top and bottom quark ru…
▽ More
In the MSSM with complex parameters, loop corrections to the decay of a stop into a bottom quark and a chargino can lead to a CP violating decay rate asymmetry.
We calculate this asymmetry at full one-loop level and perform a detailed numerical study, analyzing the dependence on the parameters and complex phases involved. In addition, we take the Yukawa couplings of the top and bottom quark running. We account for the constraints on the parameters coming from several experimental limits.
Asymmetries of several percent are obtained. We also comment on the feasibility of measuring this asymmetry at the LHC.
△ Less
Submitted 8 October, 2009; v1 submitted 1 October, 2009;
originally announced October 2009.
-
CP Violating Asymmetries Induced by Supersymmetry
Authors:
Sebastian M. R. Frank
Abstract:
In the Minimal Supersymmetric Standard Model (MSSM) with complex parameters, one-loop corrections to the decay of a stop into a bottom-quark and a chargino can lead to a CP violating decay rate asymmetry. We perform a detailed numerical analysis of this asymmetry and also of the branching ratio of the decay, analyzing the dependence on the parameters and complex phases involved. In addition, we…
▽ More
In the Minimal Supersymmetric Standard Model (MSSM) with complex parameters, one-loop corrections to the decay of a stop into a bottom-quark and a chargino can lead to a CP violating decay rate asymmetry. We perform a detailed numerical analysis of this asymmetry and also of the branching ratio of the decay, analyzing the dependence on the parameters and complex phases involved. In addition, we take the Yukawa couplings of the top- and bottom-quark running. We account for the constraints on the parameters coming from the experimental limit of the electric dipole moment of the electron by calculating and thus checking it automatically along the way. We obtain as results that the asymmetry rises up to 24 percent, depending on the point in parameter space. The combined quantity asymmetry times branching ratio reaches up to 3.5 percent. We also comment on the feasibility of measuring this asymmetry at the Large Hadron Collider (LHC) at CERN. It will be possible to measure our decay rate asymmetry at LHC.
△ Less
Submitted 22 September, 2009;
originally announced September 2009.
-
$Q^2$ Independence of $QF_2/F_1$, Poincare Invariance and the Non-Conservation of Helicity
Authors:
Gerald A. Miller,
Michael R. Frank
Abstract:
A relativistic constituent quark model is found to reproduce the recent data regarding the ratio of proton form factors, $F_2(Q^2)/F_1(Q^2)$. We show that imposing Poincare invariance leads to substantial violation of the helicity conservation rule, as well as an analytic result that the ratio $F_2(Q^2)/F_1(Q^2)\sim 1/Q$ for intermediate values of $Q^2$.
A relativistic constituent quark model is found to reproduce the recent data regarding the ratio of proton form factors, $F_2(Q^2)/F_1(Q^2)$. We show that imposing Poincare invariance leads to substantial violation of the helicity conservation rule, as well as an analytic result that the ratio $F_2(Q^2)/F_1(Q^2)\sim 1/Q$ for intermediate values of $Q^2$.
△ Less
Submitted 29 January, 2002; v1 submitted 9 January, 2002;
originally announced January 2002.
-
Nucleon form factors and a nonpointlike diquark
Authors:
J. C. R. Bloch,
C. D. Roberts,
S. M. Schmidt,
A. Bender,
M. R. Frank
Abstract:
Nucleon form factors are calculated on q^2 in [0,3] GeV^2 using an Ansatz for the nucleon's Fadde'ev amplitude motivated by quark-diquark solutions of the relativistic Fadde'ev equation. Only the scalar diquark is retained, and it and the quark are confined. A good description of the data requires a nonpointlike diquark correlation with an electromagnetic radius of 0.8 r_pi. The composite, nonpo…
▽ More
Nucleon form factors are calculated on q^2 in [0,3] GeV^2 using an Ansatz for the nucleon's Fadde'ev amplitude motivated by quark-diquark solutions of the relativistic Fadde'ev equation. Only the scalar diquark is retained, and it and the quark are confined. A good description of the data requires a nonpointlike diquark correlation with an electromagnetic radius of 0.8 r_pi. The composite, nonpointlike nature of the diquark is crucial. It provides for diquark-breakup terms that are of greater importance than the diquark photon absorption contribution.
△ Less
Submitted 30 July, 1999;
originally announced July 1999.
-
The UA(1) problem and the role of correlated q qbar exchange in effective theories of QCD
Authors:
M. R. Frank,
T. Meissner
Abstract:
The combined absence of physical realizations of the UA(1) symmetry possessed by the classical QCD action in the chiral limit, and of an isoscalar Goldstone boson associated with its spontaneous breakdown, has been dubbed the UA(1) problem. A formal resolution of this problem proposed by 't Hooft relies on instantons to provide a mass to the would-be Goldstone boson (eta '). An alternate scheme…
▽ More
The combined absence of physical realizations of the UA(1) symmetry possessed by the classical QCD action in the chiral limit, and of an isoscalar Goldstone boson associated with its spontaneous breakdown, has been dubbed the UA(1) problem. A formal resolution of this problem proposed by 't Hooft relies on instantons to provide a mass to the would-be Goldstone boson (eta '). An alternate scheme for the generation of an eta ' mass proposed by Kogut and Susskind derives from quark annihilation into gluons and a strong infrared singularity in the gluon propagator associated with confinement. We demonstrate here how such diagrams are generated in quark based effective theories by including a certain class of diagrams which arise from correlated q qbar exchange and are of higher order 1/N_c. A low energy energy expansion of this corrections is of the form discussed by Witten, di Vecchia and Veneziano.
△ Less
Submitted 15 October, 1997; v1 submitted 7 March, 1997;
originally announced March 1997.
-
On the calculation of hadron form factors from Euclidean Dyson-Schwinger equations
Authors:
M. Burkardt,
M. R. Frank,
K. L. Mitchell
Abstract:
We apply Euclidean time methods to phenomenological Dyson-Schwinger models of hadrons. By performing a Fourier transform of the momentum space correlation function to Euclidean time and by taking the large Euclidean time limit, we project onto the lightest on-mass-shell hadron for given quantum numbers. The procedure, which actually resembles lattice gauge theory methods, allows the extraction o…
▽ More
We apply Euclidean time methods to phenomenological Dyson-Schwinger models of hadrons. By performing a Fourier transform of the momentum space correlation function to Euclidean time and by taking the large Euclidean time limit, we project onto the lightest on-mass-shell hadron for given quantum numbers. The procedure, which actually resembles lattice gauge theory methods, allows the extraction of moments of structure functions, moments of light-cone wave functions and form factors without `ad hoc' extrapolations to the on-mass-shell points. We demonstrate the practicality of the procedure with the example of the pion form factor.
△ Less
Submitted 6 November, 1996;
originally announced November 1996.
-
The hadron-quark transition with a lattice of nonlocal confining solitons
Authors:
Charles W. Johnson,
George Fai,
Michael R. Frank
Abstract:
We use a lattice of nonlocal confining solitons to describe nuclear matter in the Wigner-Seitz approximation. The average density is varied by changing the size of the Wigner-Seitz cell. At sufficiently large density quark energy bands develop. The intersection of the filled valence band with the next empty band at a few times standard nuclear density signals a transition from a color insulator…
▽ More
We use a lattice of nonlocal confining solitons to describe nuclear matter in the Wigner-Seitz approximation. The average density is varied by changing the size of the Wigner-Seitz cell. At sufficiently large density quark energy bands develop. The intersection of the filled valence band with the next empty band at a few times standard nuclear density signals a transition from a color insulator to a color conductor and is identified with the critical density for quark deconfinement.
△ Less
Submitted 24 June, 1996;
originally announced June 1996.
-
Low-energy QCD: Chiral coefficients and the quark-quark interaction
Authors:
M. R. Frank,
T. Meissner
Abstract:
A detailed investigation of the low-energy chiral expansion is presented within a model truncation of QCD. The truncation allows for a phenomenological description of the quark-quark interaction in a framework which maintains the global symmetries of QCD and permits a $1/N_c$ expansion. The model dependence of the chiral coefficients is tested for several forms of the quark-quark interaction by…
▽ More
A detailed investigation of the low-energy chiral expansion is presented within a model truncation of QCD. The truncation allows for a phenomenological description of the quark-quark interaction in a framework which maintains the global symmetries of QCD and permits a $1/N_c$ expansion. The model dependence of the chiral coefficients is tested for several forms of the quark-quark interaction by varying the form of the running coupling, $α(q^2)$, in the infrared region. The pattern in the coefficients that arises at tree level is consistent with large $N_c$ QCD, and is related to the model truncation.
△ Less
Submitted 14 November, 1995; v1 submitted 13 November, 1995;
originally announced November 1995.
-
The Role of Color Neutrality in Nuclear Physics--Modifications of Nucleonic Wave Functions
Authors:
M. R. Frank,
B. K. Jennings,
G. A. Miller
Abstract:
The influence of the nuclear medium upon the internal structure of a composite nucleon is examined. The interaction with the medium is assumed to depend on the relative distances between the quarks in the nucleon consistent with the notion of color neutrality, and to be proportional to the nucleon density. In the resulting description the nucleon in matter is a superposition of the ground state…
▽ More
The influence of the nuclear medium upon the internal structure of a composite nucleon is examined. The interaction with the medium is assumed to depend on the relative distances between the quarks in the nucleon consistent with the notion of color neutrality, and to be proportional to the nucleon density. In the resulting description the nucleon in matter is a superposition of the ground state (free nucleon) and radial excitations. The effects of the nuclear medium on the electromagnetic and weak nucleon form factors, and the nucleon structure function are computed using a light-front constituent quark model. Further experimental consequences are examined by considering the electromagnetic nuclear response functions. The effects of color neutrality supply small but significant corrections to predictions of observables.
△ Less
Submitted 19 September, 1995;
originally announced September 1995.
-
Model gluon propagator and pion and rho-meson observables
Authors:
M. R. Frank,
C. D. Roberts
Abstract:
A one parameter, model confined-gluon propagator is employed in a phenomenological application of the Dyson-Schwinger and Bethe-Salpeter equations to the calculation of a range of $π$- and $ρ$-meson observables. Good agreement is obtained with the data. The calculated quark propagator does not have a singularity on the real-$p^2$ axis. A mass formula for the pion, involving only the vacuum, dres…
▽ More
A one parameter, model confined-gluon propagator is employed in a phenomenological application of the Dyson-Schwinger and Bethe-Salpeter equations to the calculation of a range of $π$- and $ρ$-meson observables. Good agreement is obtained with the data. The calculated quark propagator does not have a singularity on the real-$p^2$ axis. A mass formula for the pion, involving only the vacuum, dressed quark propagator, is presented and shown to provide an accurate estimate of the mass obtained via a direct solution of the Bethe-Salpeter equation.
△ Less
Submitted 3 August, 1995;
originally announced August 1995.
-
Off-Shell Axial Anomaly via the γ^* π^0 -> γTransition
Authors:
M. R. Frank,
K. L. Mitchell,
C. D. Roberts,
P. C. Tandy
Abstract:
The $γ^* π^0 \rightarrow γ$ form factor, including the extension off the pion mass-shell, is obtained from a generalized impulse approximation within a QCD-based model field theory known to provide an excellent description of the pion charge form factor. This approach implements dressing of the vertex functions and propagators consistent with dynamical chiral symmetry breaking, gauge invariance,…
▽ More
The $γ^* π^0 \rightarrow γ$ form factor, including the extension off the pion mass-shell, is obtained from a generalized impulse approximation within a QCD-based model field theory known to provide an excellent description of the pion charge form factor. This approach implements dressing of the vertex functions and propagators consistent with dynamical chiral symmetry breaking, gauge invariance, quark confinement and perturbative QCD. Soft nonperturbative behavior, dictated by the axial anomaly, is found to evolve to the perturbative QCD limit only for \mbox{$Q^2 \geq 20~{\rm GeV}^2$}.
△ Less
Submitted 2 December, 1994;
originally announced December 1994.
-
Relativistic models for quasielastic $(e,e')$\ at large momentum transfers
Authors:
Hungchong Kim,
C. J. Horowitz,
M. R. Frank
Abstract:
Inclusive quasielastic response functions are calculated for electron scattering in a relativistic model including momentum dependent scalar and vector mean fields. The momentum dependence of the mean fields is taken from Dirac optical fits to proton nucleus scattering and is important in describing data at momentum transfers of 1 GeV/c or larger. Our simple model is applicable for quasielastic…
▽ More
Inclusive quasielastic response functions are calculated for electron scattering in a relativistic model including momentum dependent scalar and vector mean fields. The momentum dependence of the mean fields is taken from Dirac optical fits to proton nucleus scattering and is important in describing data at momentum transfers of 1 GeV/c or larger. Our simple model is applicable for quasielastic scattering over a large range of momentum transfers.
△ Less
Submitted 6 October, 1994;
originally announced October 1994.
-
Nonperturbative aspects of the quark-photon vertex
Authors:
Michael R. Frank
Abstract:
The electromagnetic interaction with quarks is investigated through a relativistic, electromagnetic gauge-invariant treatment. Gluon dressing of the quark-photon vertex and the quark self-energy functions is described by the inhomogeneous Bethe-Salpeter equation in the ladder approximation and the Schwinger-Dyson equation in the rainbow approximation respectively. Results for the calculation of…
▽ More
The electromagnetic interaction with quarks is investigated through a relativistic, electromagnetic gauge-invariant treatment. Gluon dressing of the quark-photon vertex and the quark self-energy functions is described by the inhomogeneous Bethe-Salpeter equation in the ladder approximation and the Schwinger-Dyson equation in the rainbow approximation respectively. Results for the calculation of the quark-photon vertex are presented in both the time-like and space-like regions of photon momentum squared, however emphasis is placed on the space-like region relevant to electron scattering. The treatment presented here simultaneously addresses the role of dynamically generated $q\bar{q}$ vector bound states and the approach to asymptotic behavior. The resulting description is therefore applicable over the entire range of momentum transfers available in electron scattering experiments. Input parameters are limited to the model gluon two-point function, which is chosen to reflect confinement and asymptotic freedom, and are largely constrained by the obtained bound-state spectrum.
△ Less
Submitted 8 March, 1994;
originally announced March 1994.
-
Gauge Invariance and the Electromagnetic Current of Composite Pions
Authors:
M. R. Frank,
P. C. Tandy
Abstract:
The Global Color-symmetry Model of QCD is extended to deal with a background electromagnetic field and the associated conserved current is identified for composite $\bar{q}q$ pion modes of the model. Although the analysis is limited to tree level in the bilocal fields that bosonize the model, the identified photon-pion vertex produces the charge form factor associated with ladder Bethe-Salpeter…
▽ More
The Global Color-symmetry Model of QCD is extended to deal with a background electromagnetic field and the associated conserved current is identified for composite $\bar{q}q$ pion modes of the model. Although the analysis is limited to tree level in the bilocal fields that bosonize the model, the identified photon-pion vertex produces the charge form factor associated with ladder Bethe-Salpeter pion amplitudes. A Ward-Takahashi identity for this vertex is derived in terms of the effective inverse propagator for the equivalent local pion field and the intrinsic ladder Bethe-Salpeter amplitudes. This identity is then used to illustrate gauge invariance by showing that identical vertex information is produced from the gauge change of the free action once proper account is taken of the gauge transformation properties of the bilocal pion fields. Comments are made on the location of the vector dominance mechanism in this treatment.
△ Less
Submitted 30 March, 1993;
originally announced March 1993.