当前位置:天才代写 > Python代写,python代做代考-价格便宜,0时差服务 > Python project代写 Python code代写

Python project代写 Python code代写

2021-09-08 15:51 星期三 所属: Python代写,python代做代考-价格便宜,0时差服务 浏览:678

Python project代写

Python project代写

Python project代写 Note: Here is a sample tweet file that you can work with: “trump_20200530.csv”. This dataset contains President Trump’s tweets

Analyze Data

Note: Here is a sample tweet file that you can work with: “trump_20200530.csv”. This dataset contains President Trump’s tweets from the moment he took office on January 20, 2017 to May 30, 2020. Download the data from Canvas. You are free to down sample this data to keep it within 1000 tweets.

Q1. Asking questions Python project代写

What are some compelling questions that you can ask with this dataset (list at least 2 questions)? Enter your responses and rationale behind choosing those questions in markdown cells. Hint: You had collected pandemictweets.csv file using certain keywords and had provided a rationale behind the choice of the keywords. This can help you think through some questions.

Q2. Inspect & Data Cleaning Python project代写

2a). Inspect: Write code to inspect the data. What do you observe? Along with the code, write your observation in markdown cell.

2b). Clean: Write code to clean the data. Along with code, you need to write the rationale behind the cleaning process, i.e. what are you observing after first level of cleaning, what is still messy and needs additional cleaning, how are you deciding to do it, etc. At this stage, your cleaning should at least comprise 1). Common data cleaning steps and 2). Dealing with messy Twitter data. Python project代写

2c). Tokenize: Write code to tokenize your entire dataset. Use at least 3 different types of tokenizers. Display results from all the tokenizers in a pandas dataframe so that you can visually compare them.

2d). Pick the best tokenizer. Which one do you think works best for your data and why?

Python project代写
Python project代写

Q3. Analyze Data for Sentiment

3a). Sentiment analysis: Pick at least 3 different ways of conducting sentiment analysis. Write code to loop through your data and find sentiment for each tweet, using each of these methods.

3b). Quantitatively comparing methods: Is there a way to do pairwise comparison of the methods that you picked? You need to report at least one pairwise comparison between the methods. Better if you are able to report all pairwise comparisons. Python project代写

In you pairwise comparison, compute a quantitative measure to show what proportion of tweets match or does not match between two measures. You can also include the rationale behind the choice of your measure.

3c). Qualitative + Quantitative comparison: Write code to randomly pick 25 tweets. By hand, mark each of their sentiments. Now pick two sentiment analysis methods of your choice to automatically find sentiments of these 25 tweets. Considering that your hand-label is the absolute ground truth, write code to figure out which sentiment analysis method works better? It is possible that both methods that you picked are equally good. Provide rationale in markdown cell for your response. Hint: the more the analysis method’s output matches with your hand-label, the better it is.

Q4. Analyze Data with LIWC

Pick at least two dimensions from LIWC that you would want to investigate with respect to your data. Write code to find what proportion of each of the two dimensions that you picked are present in your data. Motivate your choice of dimension with a research question. For example, if you are curious to know how many angry tweets are there, you can pick the “angry” dimension in LIWC. Another neat trick here would be make your code modular via python functions, so that later you can reuse this function. Python project代写

Q5. Analyze Data over time Python project代写

How does the polarity of your corpus change over time? Answer this question by showing plots. What do you observe from the plot?

Question 6:

i)Pick an online site of your choice where you are eager to try out web-scraping skills to fetch data. These can be review websites, like Amazon, etsy, online donation platforms like Kickstarter or gofundme, etc. Fetch at least 10 web pages and associated text data using beautiful soup scraping technique. You are free to try other scraping techniques. Here are weare only concerned about the text data. For example, this can be reviews for a product page on Amazon, or project description in case of Kickstarter. Python project代写

Save the scraped text data in a file and submit it to be graded for the bonus question.

Also, place the text data in a pandas dataframe. Choose appropriate index for the data. Display the pandas dataframe in your notebook.

ii)Clean the collected data and conduct sentiment analysis to show polarity of each of these10 scraped data points.

Python project代写
Python project代写

其他代写:代写CS C++代写 java代写 matlab代写 web代写 app代写 作业代写 物理代写 数学代写 考试助攻 paper代写 r代写 金融经济统计代写 python代写

合作平台:essay代写 论文代写 写手招聘 英国留学生代写

 

天才代写-代写联系方式