Data Science using Python

Rithic Hariharan
2 min readAug 9, 2020

--

Programming language Python is very simple to learn and a super readable one. The scope of Python is spreading everywhere and it will be the future including from Machine Learning to automating simple things, It can be used anywhere in the world.

Now coming to the hottest area called Data Science. Data science is an interdisciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from many structural and unstructured data. It is more related to data mining, ML and big data.

Here I am going to discuss big data analysis using 2 of Python’s most prominent libraries .

  1. Pandas
  2. Matplotlib

Pandas:- One of the most important libraries that are used in big data analysis. This was invented by Wes McKinney. He built it upon the NumPy module using python and C. This library is popular among most of the data scientists out there.

Matplotlib:- A python library used to visualise the data sets. This python library is also based on NumPy. This library provides an object-oriented API for embedding plots into applications using general-purpose GUI toolkits like Tkinter, wxPython, Qt, or GTK+.

My recent experiments on DataScience

Recently I made a hands on with Covid Cases dataset and visualized it as a graph.

The data for the number of covid cases in India till 27th July 2020 was collected from kaggle and I’ve used these datas using Python programming language. in the form of a .csv file.

I created a python program which used the data which was in the form of .csv file, the data was of Covid virus cases in India. The dataset was downloaded from kaggle. The datas I got was Till 27th of July 2020.

Steps to extract the data and to visualise the graph:-

  1. Import both the packages Pandas and Matplotlib

2.Using Pandas read the .csv file downloaded by the link given above

3.Filter the required data using pandas

4.Create a list of the datas extracted

5.Plot the graph using Matplotlib

6.End the programme

This was the output of the graph that I made(X-axis and Y-axis are just for my test works not to consider)The data is of Tamil Nadu

Data set :- https://www.kaggle.com/imdevskp/covid19-corona-virus-india- dataset?select=complete.csv

Github Link:-https://github.com/gr8rithic/Covid_Virus_Cases_Graph

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

--

--

Rithic Hariharan
Rithic Hariharan

Written by Rithic Hariharan

Tech Enthusiast Blogger Writer Aviation Enthusiast Student

No responses yet

Write a response