The Lifespan of a Lie – Trust Issues – Medium
"The most famous psychology study of all time was a sham. Why can’t we escape the Stanford Prison Experiment?"
june 2018 by tsuomela
Frictionless Data: Making Research Data Quality Visible| International Journal of Digital Curation
"There is significant friction in the acquisition, sharing, and reuse of research data. It is estimated that eighty percent of data analysis is invested in the cleaning and mapping of data (Dasu and Johnson,2003). This friction hampers researchers not well versed in data preparation techniques from reusing an ever-increasing amount of data available within research data repositories. Frictionless Data is an ongoing project at Open Knowledge International focused on removing this friction. We are doing this by developing a set of tools, specifications, and best practices for describing, publishing, and validating data. The heart of this project is the “Data Package”, a containerization format for data based on existing practices for publishing open source software. This paper will report on current progress toward that goal."
research-data  data-curation  analysis  methods 
may 2018 by tsuomela
Too Much Information and the KWIC | SpringerLink
"This paper takes a media archaeology look at the development of the Keyword-in-Context (KWIC) display by Peter Luhn and how the KWIC helped automate ways of disseminating information about information. The paper takes the development of the KWIC as an example of the development of a knowledge technology that frames knowledge in a certain way. The KWIC and other information technologies transform knowledge into information that can be quantified and processed. Developments like the KWIC are the beginning of language engineering—a new way of conceiving of text as information to be manipulated. Finally, the paper proposes a way of reflecting on developments like the KWIC by replicating these early technologies. Replications can take the form of demonstration devices or knowledge things that expose the processes in our infrastructure."
digital-humanities  keywords  methods  history 
may 2018 by tsuomela
Levine, C.: Forms: Whole, Rhythm, Hierarchy, Network (Paperback and eBook) | Princeton University Press
"Forms offers a powerful new answer to one of the most pressing problems facing literary, critical, and cultural studies today—how to connect form to political, social, and historical context. Caroline Levine argues that forms organize not only works of art but also political life—and our attempts to know both art and politics. Inescapable and frequently troubling, forms shape every aspect of our experience. Yet, forms don't impose their order in any simple way. Multiple shapes, patterns, and arrangements, overlapping and colliding, generate complex and unpredictable social landscapes that challenge and unsettle conventional analytic models in literary and cultural studies. Borrowing the concept of "affordances" from design theory, this book investigates the specific ways that four major forms—wholes, rhythms, hierarchies, and networks—have structured culture, politics, and scholarly knowledge across periods, and it proposes exciting new ways of linking formalism to historicism and literature to politics. Levine rereads both formalist and antiformalist theorists, including Cleanth Brooks, Michel Foucault, Jacques Rancière, Mary Poovey, and Judith Butler, and she offers engaging accounts of a wide range of objects, from medieval convents and modern theme parks to Sophocles's Antigone and the television series The Wire. The result is a radically new way of thinking about form for the next generation and essential reading for scholars and students across the humanities who must wrestle with the problem of form and context."
book  publisher  forms  formal  literary-criticism  methods  social  analysis 
may 2018 by tsuomela
A quick rule of thumb is that when someone seems to be acting like a jerk, an economist will defend the behavior as being the essence of morality, but when someone seems to be doing something nice, an economist will raise the bar and argue that he’s not
"A quick rule of thumb is that when someone seems to be acting like a jerk, an economist will defend the behavior as being the essence of morality, but when someone seems to be doing something nice, an economist will raise the bar and argue that he’s not being nice at all."
economics  methods  ideology 
april 2018 by tsuomela
Topic Modeling in Python with NLTK and Gensim | DataScience+
"In this post, we will learn how to identify which topic is discussed in a document, called topic modeling. In particular, we will cover Latent Dirichlet Allocation (LDA): a widely used topic modelling technique. And we will apply LDA to convert set of research papers to a set of topics."
python  statistics  topic-modeling  digital-humanities  methods 
april 2018 by tsuomela
The Ethics of Technology: A Geometric Analysis of Five Moral Principles // Reviews // Notre Dame Philosophical Reviews // University of Notre Dame
"Martin Peterson, The Ethics of Technology: A Geometric Analysis of Five Moral Principles, Oxford University Press, 2017, 252pp., $74.00 (hbk), ISBN 9780190652265."
book  review  technology  ethics  methods  philosophy 
november 2017 by tsuomela
Ways to Compute Topics over Time, Part 1 · from data to scholarship
"This the first in a series of posts which constitute a “lit review” of sorts to document the range of methods scholars are using to compute the distribution of topics over time."
digital-humanities  topic-modeling  methods  tutorial  temporal 
june 2017 by tsuomela
Developing Data Products… by Brian Caffo et al. [PDF/iPad/Kindle]
"Developing Data Products in R Brian Caffo and Sean Kross This book introduces the topic of Developing Data Products in R. A data product is the ideal output of a Data Science experiment. This book is based on the Coursera Class "Developing Data Products" as part of the Data Science Specialization. Particular emphasis is paid to developing Shiny apps and interactive graphics. "
book  data-science  data  products  publishing  reproducible  research  methods 
november 2016 by tsuomela
Home | Qualitative Data Repository
"QDR selects, ingests, curates, archives, manages, durably preserves, and provides access to digital data used in qualitative and multi-method social inquiry.  The repository develops and publicizes common standards and methodologically informed practices for these activities, as well as for the reusing and citing of qualitative data.  Four beliefs underpin the repository's mission: data that can be shared and reused should be; evidence-based claims should be made transparently; teaching is enriched by the use of well-documented data; and rigorous social science requires common understandings of its research methods."
data  data-curation  repository  qualitative  methods  social-science  research-data 
september 2015 by tsuomela
Strategies for Social Inquiry | Cambridge University Press
"This new book series presents texts on a wide range of issues bearing upon the practice ofsocial inquiry. Strategies are construed broadly to embrace the full spectrum of approachesto analysis, as well as relevant issues in philosophy of social science."
publisher  books  series  social-science  methods  methodology 
september 2015 by tsuomela
"PRISONER is a framework for running ethical and reproducible social network experiments. PRISONER is actively under development, and has been released to help steer its design, and to improve the consideration of these concerns in the community."
social-media  methods  ethics  research  framework  open-source 
september 2015 by tsuomela
Computing Inter-Rater Reliability for Observational Data: An Overview and Tutorial. - PubMed - NCBI
"Many research designs require the assessment of inter-rater reliability (IRR) to demonstrate consistency among observational ratings provided by multiple coders. However, many studies use incorrect statistical procedures, fail to fully report the information necessary to interpret their results, or do not address how IRR affects the power of their subsequent analyses for hypothesis testing. This paper provides an overview of methodological issues related to the assessment of IRR with a focus on study design, selection of appropriate statistics, and the computation, interpretation, and reporting of some commonly-used IRR statistics. Computational examples include SPSS and R syntax for computing Cohen's kappa and intra-class correlations to assess IRR."
research  methods  reliability  reference 
june 2015 by tsuomela
Leviathan And You: A Blog About Big Things: Kind of a Big Fake
"This is why I am not so sure about Brookman's assertion that "you let the data inform your views."  Professions of faith that the data tell their own story ignore the culturally specific choices that inform what counts as a datum in the first place.  Part of why LaCour was successful was because he was able to take advantage of uncritical beliefs that ignored how disciplinary knowledge is produced and authorized. But then again, I'm just making this up."
fraud  data  social-science  methods  bias 
may 2015 by tsuomela
DHQ: Digital Humanities Quarterly: Agent-Based Modeling and Historical Simulation
"This essay discusses agent-based modeling (ABM) and its potential as a technique for studying history, including literary history. How can a computer simulation tell us anything about the past? This essay has three distinct goals. The first is simply to introduce agent-based modeling as a computational practice to an audience of digital humanists, for whom it remains largely unfamiliar despite signs of increasing interest. Second, to introduce one possible application for social simulation by comparing it to conventional, print-based models of the history of book publishing. Third, and most importantly, I’ll sketch out a theory and preliminary method for incorporating social simulation into an on-going program of humanities research."
agent-based-model  history  methods  digital-humanities 
february 2015 by tsuomela
"Large-scale databases of human activity in social media have captured scientific and policy attention, producing a flood of research and discussion. This paper considers methodological and conceptual challenges for this emergent field, with special attention to the validity and representativeness of social media big data analyses. Persistent issues include the over-emphasis of a single platform, Twitter, sampling biases arising from selection by hashtags, and vague and unrepresentative sampling frames. The socio-cultural complexity of user behavior aimed at algorithmic invisibility (such as subtweeting, mock-retweeting, use of “screen captures” for text, etc.) further complicate interpretation of big data social media. Other challenges include accounting for field effects, i.e. broadly consequential events that do not diffuse only through the network under study but affect the whole society. The application of network methods from other fields to the study of human social activity may not always be appropriate. The paper concludes with a call to action on practical steps to improve our analytic capacity in this promising, rapidly-growing field."
conference  social-media  methods  methodology 
december 2014 by tsuomela
The Historian's Macroscope: Big Digital History
"Welcome to Big Digital History: Exploring Big Data through a Historian’s Macroscope, a co-written manuscript by Shawn Graham, Ian Milligan, and Scott Weingart. Over the next few months, and into April 2014, we’re going to be writing this book in public."
online  big-data  history  distant-reading  methods  digital-humanities  book 
october 2014 by tsuomela
Scientific method: Statistical errors : Nature News & Comment
"P values, the 'gold standard' of statistical validity, are not as reliable as many scientists assume."
statistics  research  replication  methods  significance 
february 2014 by tsuomela
