Shirui (Oli) Zhou

Logo

Data Scientist Intern@ World Bank | Data Analyst Intern @ Ashoka | Operation Assistant @ Matters Lab | Research Assistant @ Intellisia Institude | Data Science for Public Policy @ Georgetown Georgetown

A human really love sea 🏝 Skateboarding 🛹 Guitar 🎸 and photography 📹

Random Thougths in English

Random Thoughts in Mandarin

View My LinkedIn Profile

Data Science Blog

View My GitHub Profile

Selected Projects in Data Science


Forecasting Cryptocurrency Prices Using Machine Learning: An Analysis of Reddit Discussions

code webiste

Quantifying the Complex Relationship between Lyrics, Chord Progression, and Emotion Stimulation Examine the Song Features

- Utilizes Bag of Words and Word2Vec methods, combined with KNN/hierarchical clustering, to identify songs with similar lyrics. Incorporates LDA and sentiment analysis for deeper lyrical understanding. Explores relationships between numerical (tempo, chord progression, key) and textual features using supervised machine learning.
- Trianing machine to understand the connection between elements in music, improving recommendation accuracy using machine leanring techniques and textual features

paper code webiste blog


Build a song similarity calculator to improve the recommendation system

- Features an interface for a song similarity calculator that evaluates both lyrical and numerical attributes of songs. Utilizes cosine similarity for comparison, displaying results through an overlapping radar graph for visual analysis.

whitepaper code application


Network Analysis of Twitter to Identify Opinion Leader, Emotional Cascade and Community Structure

- Deployed K-core decomposition to examine the community structure, applied NLP including Name Entity Recognition, Sentimental Analysis and Topic Modelling on tweets to investigate the emotion cascade

paper code slides


How Consensus can be Involved through Innovating Voting Mechanism? Using Polis Platform as an Example

- Using PCA and UMAP to visualize the participants’ stance on a 2-dimensional map, uses Kmeans to cluster and classify group A and B, and uses centroid coords calculation to get the distance between two groups.

paper code slides


Predicting Attitudes towards UBI in EU using Supervised Machine Learning Techniques

- Built and trained Logistic Regression, DecisionTree, SVM, RandomForest, XGBoost and GBDT to identify 5 primary indicators and 35 secondary indicators the key influential factor on the attitude of EU citizen towards UBI

paper code slides


Using DID to Evaluate the Impact of Intensive Case Management Services

- Designed DID to construct quasi-experiment setting for causal inference to quantify the impacts

paper code


Can Remittances Compensate for Parental Absence? Evaluated by the Psychological Well-being and Education Outcome

- Developed a multivariate model in STATA incorporating Difference-in-Difference and Propensity Score Matching

paper code


George Floyd protests’ impact on the election result

- Employed fixed effects regression trating county’s features as time-invariant variable to examine the influence of protest on election result

paper code


Review of a Dynamic Model of Housing Demand: Estimation and Policy Implications

- Investigates the problem of household housing demand facing income and house price shocks. Two stage method by Bajari et al. (2013) is used and discrete state dynamic programming and fmin research are numerical methods applied

paper code



Selected Articles on Policy Suggestion


Congressional Policy towards U.S. Asian Policy

- Analysis and Recommendation on the vote for Cohen Amendment to Burma Sanctions
- Recommendation Regarding the Vote on Bonior’s Motion to Recommit and Bill of H.R.4444
- Recommendation Regarding the Vote on Korea Sanctions and Policy Enhancement Act (H.R. 757)
- Recommendation Regarding the Vote on final Senate passage of HR.1314 (TPA)
- Recommendation Regarding the Vote on final Senate passage of S.1169
- Recommendation on the Vote on the Currency Exchange Rate Oversight Reform Act of 2011 (S.1619)
- Vote on Dole:Smith Amendment and McCain:Kerry Amendment

Decentralized Web as Public Sphere

paper

- Your taste, your deepest fear and your dream are, in a sense, shaped by the allocation of your attention, which are disproportionately allocated to the commercial advertisement, where capital intentionally directs. This can be a silent penetration process, as depicted in the The Society of the Spectacle by Guy Debord, where the consumer culture and commodity fetishism would only affect the individual thinking and decision-making in a subconscious way.

Protocolizing the Governance of Public Goods: Using Social Media as an Example

paper

- By applying Elinor Ostrom’s Institutional Analysis and Development (IAD) framework to common-pool resource problems (Ostrom, 2005), this study aims to explore the possibilities and limitations of protocolization in addressing the governance challenges of social media platforms. In doing so, the paper delves into the paradox of programmability and irreproducibility, examining the potential risks of systematizing and institutionalizing bias (Crawford, 2016), while considering alternative approaches that reimagine the relationship between technology and democracy (Winner, 1986).

What if we have an Digital Agora?

paper

- Reimagine Social Platform with Tokenization of Attention Economy, Bridging-based Algorithm and Quadratic Voting

Strategies for Government Relations in US and EU to VP for Governmental Relations at Facebook

paper

- Although Facebook need to face the judging of “big tobacco moment” in technology corporations, it has several opportunists as one of the most powerful platforms. This paper concludes on to realize the profit maximization, it need to cooperate with government and serve as a safeguard for public interest, which can be achieved by grass-root movement, build coalition with government officials, and reframe its public image.

Strategies for Green Party to Robert Habeck, Vice Chancellor, Federal Minister for Economic Affairs and Climate Action, Co-leader of Green Party

paper

- The division of opinion over multiple issues in the current traffic light coalition and the raising of Afd, as well as the pervasiveness of anti-globalization, populism and polarization could impose further difficulty for Greens to maintain its position in the coalition. Considering the current global economic recession and democracy deconstruction, Greens should be prepared to forgo some universal basic values and adopt more realistic foreign policies, and advocate for strengthening national power, with the emphasis on its representativeness of future and hope.

Strategies to Constrain the Cigarette and Vaping Industries to the President of the United States, Joseph Biden

paper

- This memorandum suggests using COVID-19 as opportunity window, reframing the debate as the movements to greater ethnic equity and realization of national value, building advocacy coalition and discrediting the front group as three- pronged political strategies to constraint the cigarettes and vaping strategies.

Page template forked from evanca