SNSの分析——欧米の動向

「The aim of this study is to document, characterize, and quantify early public discourse and conversation of the #MeToo movement from Twitter data in the United States. We focus on posts with public first-person revelations of sexual assault/abuse and early life experiences of such events.」

「本研究の目的は米国のツイッターの投稿データかた#MeToo運動の初期の言論を記録し、特徴づけ、定量化することである。性的暴行・虐待の告発（revelations）や、そうした人生経験を打ち明けた投稿に注目をする」（山口訳）」

「Methods:
We purchased full tweets and associated metadata from the Twitter Premium application programming interface between October 14 and 21, 2017 (ie, the first week of the movement). We examined the content of novel English language tweets with the phrase “MeToo” from within the United States (N=11,935). We used machine learning methods, least absolute shrinkage and selection operator regression, and support vector machine models to summarize and classify the content of individual tweets with revelations of sexual assault and abuse and early life experiences of sexual assault and abuse.」

「方法：
2017年10月24日から21日（すなわち運動の最初の週）のすべてのツイートと関連メタデータをTwitter Premiumアプリケーションプログラミングインターフェースから購入した。米国内の"Me too"というフレーズを含むツイートの内容を調査した（N=11,395）。ラッソ回帰法および教師あり機械学習モデルを用いて、性的暴行・虐待の暴露と性的暴行・虐待の初期生活体験を持つ個々のツイートの内容を要約・分類した」（山口訳）

◇Modrek, S., & Chakalov, B. (2019). The# MeToo movement in the United States: text analysis of early twitter conversations. Journal of medical Internet research, 21(9), e13837.

「The aims of this study focused on using big data, machine learning, content analysis, and SNA to identify and characterize Twitter messages related to the COVID-19 Liberate Protests and their relevant online user groups and communities. To carry out these aims, the study was conducted in 2 distinct phases: (1) data collection and processing; and (2) data analysis using unsupervised machine learning approaches in combination with manual annotation of Twitter topics and user-generated messages, which we describe below (see Figure A1 for summary of methods). All data collection and analyses were done in the computer programming languages Python and R.」

「本研究の目的は、ビッグデータ、機械学習、コンテンツ分析、SNAを用いて、COVID-19 Liberate Protestsとその関連するユーザーグループとコミュニティに関連するTwitterメッセージを識別し、特徴付けることである。これらの目的を達成するため、本研究は2つのフェーズに分かれて行われた。1）データ収集と処理、（2）教師なし機械学習アプローチとTwitterトピックおよびユーザー生成メッセージの手動アノテーションを組み合わせたデータ分析、である。データ収集と分析はすべて、コンピュータ・プログラミング言語であるPythonとRで行った。」

「Tweets were accessed through the public streaming application programming interface (API) from Twitter. We used virtual machines deployed on Amazon Web Services (AWS) cloud-based computing services to collect tweets first filtered for general COVID-19-related keywords including: “covid19”, “corona”, “coronavirus”, “coronavid19” as used and validated in prior COVID-19 Twitter studies [20,21]. From this initial corpus of general COVID-19 tweets, we then filtered the dataset for the keyword “Liberate” (including tweets that contained #liberate), as it is widely used to describe the Liberate movement in the context of both pro-liberate and con-liberate user-generated conversations and sentiment. Data collection was set for the period between April 1st, 2020 to April 20th, 2020, however the vast majority of the tweets were generated between April 17th - 20th when widespread media coverage, Liberate protests, and national debate about the Liberate movement was at its peak. In fact, from April 1st – 16th we only detected 45 tweets that contained the term “Liberate”. As seen in Fig. 1, the Liberate discussion on Twitter occurred after Trump voiced his support of the Liberate movement, which shows a spike in tweets with the word “Liberate” shortly after he tweeted. After data collection and filtering, we collected 34,672 tweets in total that included both a COVID-19 general term and the term “Liberate”. Each tweet contained the text content of the tweet and additional metadata such as user information, time stamp, media (e.g. images, videos), and any associated hyperlinks.」

「ツイートは、Twitterの公開ストリーミングアプリケーションプログラミングインターフェース（API）を通じてアクセスした。Amazon Web Services（AWS）のクラウドコンピューティングサービス上に配置された仮想マシンを使用し、まずCOVID-19関連の一般的なキーワードでフィルタリングしたツイートを収集した。COVID-19のTwitterにおける分析の先行研究[20,21]で使用・検証された「covid19」、「corona」、「coronavid19」などを含む一般的なキーワードでまずフィルタリングされたツイートを収集した。この一般的なCOVID-19ツイートの初期コーパスから、「Liberate」というキーワード（#liberateを含むツイート）でデータセットをフィルタリングした。このキーワードは、pro-liberate派とcon-liberate反対派の両方のユーザーの生成した会話と感情の文脈でLiberate Movementを説明するために広く使用されているためである。データ収集期間は2020年4月1日～2020年4月20日であるが、ツイートの大部分は4月17日～20日に生成され、メディアによる報道、Liberate protests,、Liberate movementに関する国内議論がピークに達した。実際、4月1日から16日の間に「Liberate」という言葉を含むツイートは45件しか検出されない。図1に見られるように、Twitter上でのLiberate debateはトランプ氏がLiberate Movementを支持すると発言した後に発生しており、トランプ氏がツイートした直後に「Liberate」という単語を含むツイートが急増していることが分かる。データ収集とフィルタリングの結果、COVID-19の一般用語と「Liberate」という単語が両方含まれるツイートが合計34,672件収集された。各ツイートには、ツイートのテキストコンテンツと、ユーザー情報、タイムスタンプ、メディア（画像や動画など）、関連するハイパーリンクなどの追加メタデータが含まれる。」

「We utilized the Biterm Topic Model (BTM) that identifies patterns in short texts that has been used in prior studies [22,23] examining a wide variety of topics, including self-reporting of COVID-19-related symptoms on Twitter and other public health issues of concern [20].」

「我々は、Twitter上のCOVID-19関連の症状の自己報告や懸念される他の公衆衛生問題[20]など、様々なトピックを調査する先行研究[22,23]で使用されている短いテキストのパターンを特定するBiterm Topic Model（BTM）を利用した。」

◇Haupt, M. R., Jinich-Diamant, A., Li, J., Nali, M., & Mackey, T. K. (2021). Characterizing twitter user topics and communication network dynamics of the “Liberate” movement during COVID-19 using unsupervised machine learning and social network analysis. Online Social Networks and Media, 21, 100114.

「As social movements relying on the weak ties found in social networks have spread around the world, researchers have taken several approaches to understanding how networks function in such instances as the Arab Spring. While social scientists have primarily relied on survey or content analysis methodology, network scientists have used social network analysis. This research combines content analysis with the automated techniques of network analysis to determine the roles played by those using Twitter to communicate during the Turkish Gezi Park uprising. Based on a network analysis of nearly 2.4 million tweets and a content analysis of a subset of 5126 of those tweets, we found that information sharing was by far the most common use of the tweets and retweets, while tweets that indicated leadership of the movement constituted a small percentage of the overall number of tweets. 」

「社会的ネットワークという弱い絆に依拠した社会運動が世界中に広がる中、「アラブの春」のような事例においてネットワークがどのように機能しているかを理解するために、研究者はいくつかのアプローチをとってきた。社会科学者が主に調査やコンテンツ分析の手法を用いるのに対して、ネットワーク科学者はソーシャルネットワーク分析を用いている。本研究では、内容分析とネットワーク分析の自動化技術を組み合わせ、トルコのゲジ公園暴動でTwitterを使ってコミュニケーションした人々が果たした役割を明らかにする。約240万件のツイートのネットワーク分析と、そのうちの5126件のツイートのサブセットの内容分析から、ツイートやリツイートの用途は情報共有が圧倒的に多く、運動のリーダーシップを示すツイートは全体の数に対してごく少数であることが明らかになった。」

◇Ogan, C., & Varol, O. (2017). What is gained and what is left to be done when content analysis is added to network analysis in the study of a social movement: Twitter use during Gezi Park. Information, Communication & Society, 20(8), 1220-1238.

投稿日：2022/07/17

修正日：2022/07/17

目次

SNSの分析——欧米の動向

コメントを残すコメントをキャンセル

目次

コメントを残す コメントをキャンセル

コメントを残すコメントをキャンセル