- 152
- 7 153 694
Shashank Kalanithi
Приєднався 1 жов 2011
Contact me on Instagram @shashankanalytics
or
Twitter @kalamari95
I help people break into the world of Analytics. Let's learn together!
or
Twitter @kalamari95
I help people break into the world of Analytics. Let's learn together!
Using Code and GPT-3 to Learn Faster
Thanks to ProjectPro.io for their support: bit.ly/3YZvAzE
Today we'll write some simple code to link Notion to GPT-3 to summarize articles in Notion so that we can quickly determine if an article is worth our time to read.
My FREE Courses:
🐍 Python 🐍. : ua-cam.com/video/sZDgJKI8DAM/v-deo.html
⌭ SQL ⌭ : ua-cam.com/video/gwp3dJUsy5g/v-deo.html
📊 Tableau📊 : ua-cam.com/video/Gl2lg-TtRJo/v-deo.html
📈 Statistics 📈 : ua-cam.com/video/wwsizzg6UjU/v-deo.html
🤖 Machine Learning 🤖 : ua-cam.com/video/KLjTAcH7Ikk/v-deo.html
Patreon: www.patreon.com/shashankkalanithi
MX Master 3 amzn.to/3sTroBW
LG 35in Curved Monitor: amzn.to/39pPzR3
USB-C Hub: amzn.to/31Ip8Sl
MacBook Pro Retina 16 Inch: amzn.to/2PSwZde
Twitter: kalamari95
LinkedIn: www.linkedin.com/in/shashankkalanithi/
Today we'll write some simple code to link Notion to GPT-3 to summarize articles in Notion so that we can quickly determine if an article is worth our time to read.
My FREE Courses:
🐍 Python 🐍. : ua-cam.com/video/sZDgJKI8DAM/v-deo.html
⌭ SQL ⌭ : ua-cam.com/video/gwp3dJUsy5g/v-deo.html
📊 Tableau📊 : ua-cam.com/video/Gl2lg-TtRJo/v-deo.html
📈 Statistics 📈 : ua-cam.com/video/wwsizzg6UjU/v-deo.html
🤖 Machine Learning 🤖 : ua-cam.com/video/KLjTAcH7Ikk/v-deo.html
Patreon: www.patreon.com/shashankkalanithi
MX Master 3 amzn.to/3sTroBW
LG 35in Curved Monitor: amzn.to/39pPzR3
USB-C Hub: amzn.to/31Ip8Sl
MacBook Pro Retina 16 Inch: amzn.to/2PSwZde
Twitter: kalamari95
LinkedIn: www.linkedin.com/in/shashankkalanithi/
Переглядів: 8 569
Відео
How Data Science ACTUALLY Works
Переглядів 89 тис.Рік тому
Check out Deepnote for the easiest way to practice your data science skills: deepnote.com/? Dataset used in this video: www.kaggle.com/datasets/thedevastator/airlines-traffic-passenger-statistics Ever wanted to know how a real Data Science team operates? Join me, me, and me, as we take you through a rushed Data Science ask from management. Can we get the work in on time? 00:00 - Introduction 00...
Does Instagram think you live in an influential city? | A deep dive into web data
Переглядів 5 тис.Рік тому
Request this and many other datasets @: brightdata.grsm.io/shashank-datasets What city is the most influential according to Instagram? We'll use a massive 40GB dataset to try and figure this out! 00:00 - Introduction 00:23 - Analysis Start 02:44 - Requirements Gathering 10:43 - Data Cleaning Algorithm 19:32 - Clean Our 40 GB Dataset 27:21 - Creating our ML Algorithm 36:19 - Plot our data 39:10 ...
How I start Data Science Projects | What to do when you're stuck
Переглядів 16 тис.Рік тому
Check out Part 2 here: ua-cam.com/video/lpF5SSgczeE/v-deo.html Check out BrightData here: brightdata.grsm.io/shashank What do you do when you're feeling stuck with a Data Science project? In this video I work with REAL Instagram data and show you how you can start your analyses for any Data Science projects you might be working on. My FREE Courses: 🐍 Python 🐍. : ua-cam.com/video/sZDgJKI8DAM/v-d...
Is the Meta Data Engineering Certificate any good? (as a Data Engineer)
Переглядів 49 тис.Рік тому
Discounted Coursera Plus: imp.i384100.net/7mPer3 Affiliate Link for Coursera: imp.i384100.net/rndvGQ 00:00 - Introduction 01:23 - How I became a data engineer 01:38 - My opinion of the Meta Data Engineer Certification 02:46 - My opinion of the course content 03:02 - Core skills to become a data engineer 08:06 - Final opinion on the cert 08:53 - Will this get you a job 10:37 - Pricing 11:34 - Ge...
A Data Crash Course | 100+ Key Data Concepts
Переглядів 11 тис.Рік тому
CHECK OUT PROJECTPRO.io: bit.ly/3A4IRwd The world of data is HUGE! This is a whirlwind introduction to that world, from where you can branch out into what you find most interesting. 00:00 - Introduction 01:44 - Data Engineering 13:12 - Data Science 29:04: Data Analysis My FREE Courses: 🐍 Python 🐍. : ua-cam.com/video/sZDgJKI8DAM/v-deo.html ⌭ SQL ⌭ : ua-cam.com/video/gwp3dJUsy5g/v-deo.html 📊 Tabl...
Why do Data Engineers Exist?
Переглядів 6 тис.Рік тому
I interview the authors of Fundamentals of Data Engineering: Joe Reis and Matt Housley about what a Data Engineer is, why they exist, and what the future might hold for the industry. CHECK OUT THE BOOK HERE: amzn.to/3QvxbsW My FREE Courses: 🐍 Python 🐍. : ua-cam.com/video/sZDgJKI8DAM/v-deo.html ⌭ SQL ⌭ : ua-cam.com/video/gwp3dJUsy5g/v-deo.html 📊 Tableau📊 : ua-cam.com/video/Gl2lg-TtRJo/v-deo.html...
R for Data Scientists and Data Analysts
Переглядів 12 тис.Рік тому
R for Data Scientists and Data Analysts
Day in the Life of a Data Analyst: Stakeholder ➡️ Co-Worker ➡️ SQL ➡️ Python ➡️ Visualizations
Переглядів 74 тис.Рік тому
Day in the Life of a Data Analyst: Stakeholder ➡️ Co-Worker ➡️ SQL ➡️ Python ➡️ Visualizations
What I think about Data Bootcamps and why I'm launching one
Переглядів 9 тис.2 роки тому
What I think about Data Bootcamps and why I'm launching one
What's the difference in Data Professions?
Переглядів 6 тис.2 роки тому
What's the difference in Data Professions?
Meeting the Author - Storytelling with Data
Переглядів 7 тис.2 роки тому
Meeting the Author - Storytelling with Data
Scraping LinkedIn for the BEST DATA ANALYST Degrees
Переглядів 17 тис.2 роки тому
Scraping LinkedIn for the BEST DATA ANALYST Degrees
Storytelling with Data - an (unofficial) overview - Part 1
Переглядів 25 тис.2 роки тому
Storytelling with Data - an (unofficial) overview - Part 1
how I CRUSH Data Analyst Technical Interviews
Переглядів 39 тис.2 роки тому
how I CRUSH Data Analyst Technical Interviews
How to Start Freelancing as a Data Scientist / Analyst (part-time)
Переглядів 217 тис.2 роки тому
How to Start Freelancing as a Data Scientist / Analyst (part-time)
What's the best day to invest in the stock market (VOO)
Переглядів 9 тис.2 роки тому
What's the best day to invest in the stock market (VOO)
The GOOGLE Guide to Exploratory Data Analysis (w/ code)
Переглядів 18 тис.2 роки тому
The GOOGLE Guide to Exploratory Data Analysis (w/ code)
Beginner's Excel Tutorial - Data Science/Analysis and Beyond
Переглядів 88 тис.2 роки тому
Beginner's Excel Tutorial - Data Science/Analysis and Beyond
I recorded myself for 8 hours (as a data analyst)
Переглядів 20 тис.2 роки тому
I recorded myself for 8 hours (as a data analyst)
How a Data Analysts Solves an SQL Problem (comment a better solution below)
Переглядів 11 тис.2 роки тому
How a Data Analysts Solves an SQL Problem (comment a better solution below)
How to Become a Data Analyst (Updated for 2022)
Переглядів 255 тис.2 роки тому
How to Become a Data Analyst (Updated for 2022)
Machine Learning for Beginners - Data Scientists and Analysts
Переглядів 15 тис.2 роки тому
Machine Learning for Beginners - Data Scientists and Analysts
Exploratory Data Analysis (comment your best insight on the data)
Переглядів 72 тис.2 роки тому
Exploratory Data Analysis (comment your best insight on the data)
Get a Data Science or Data Analyst Role (feat. Nick Singh)
Переглядів 13 тис.2 роки тому
Get a Data Science or Data Analyst Role (feat. Nick Singh)
Practical Statistics for Data Scientists - Chapter 3 - Statistical Experiments Significance Testing
Переглядів 17 тис.2 роки тому
Practical Statistics for Data Scientists - Chapter 3 - Statistical Experiments Significance Testing
What is a DATA ANALYST | by a data analyst
Переглядів 8 тис.2 роки тому
What is a DATA ANALYST | by a data analyst
Practical Statistics for Data Scientists - Chapter 2 - Data and Sampling Distributions
Переглядів 25 тис.2 роки тому
Practical Statistics for Data Scientists - Chapter 2 - Data and Sampling Distributions
Which skill u need to have to work on data scientists/ Analysts
Sire, you did great but you need to speak slowly & patiently. I'm sure cheesecakes taste better than our own words ;)
Great overview Shashank! I miss your videos. You were one of the first youtubers I watched in the data science field about a year ago and helped motivate me to get my first data analyst role last month. Hope all is well and you come back soon
The first vid was great. The second and third video you lost me. A bunch of copy pasta code and feel to much information was stuffed inside a video. Even as a refresher it’s too much.
Can tha minimal_ds kernel be updated 😭🤣🤣
ahm ahm euhm hmm auh ahm ahm ehm.. Loved your video otherwise, very informal for someone working with excel atm.
Thank you so much for the video
Just stunning! Could you make video about a book "story telling with data" by Cole Nassbaumer Knaflic
Excellent content, it’s helpful
Hey Shashank, great video, it was really helpful. Can you please share the notion link.
I've been looking for this tutorial for a whole week and here i found it !! This vid are recommended for anyone of you that looking for melting the data using pandas!!! I Love it!! Keep it up !
Fake American accent😂😂
Column manipulation starts at 1:20:00
Very Good Video Shashank, now i learned little bit about what actually data analytics is. thanks you.
24:18 erroe { "name": "ImportError", "message": "Missing optional dependency 'openpyxl'. Use pip or conda to install openpyxl.", "stack": "--------------------------------------------------------------------------- ModuleNotFoundError Traceback (most recent call last) File c:\\python3.12\\Lib\\site-packages\\pandas\\compat\\_optional.py:135, in import_optional_dependency(name, extra, errors, min_version) 134 try: --> 135 module = importlib.import_module(name) 136 except ImportError: File c:\\python3.12\\Lib\\importlib\\__init__.py:90, in import_module(name, package) 89 level += 1 ---> 90 return _bootstrap._gcd_import(name[level:], package, level) File <frozen importlib._bootstrap>:1387, in _gcd_import(name, package, level) File <frozen importlib._bootstrap>:1360, in _find_and_load(name, import_) File <frozen importlib._bootstrap>:1324, in _find_and_load_unlocked(name, import_) ModuleNotFoundError: No module named 'openpyxl' During handling of the above exception, another exception occurred: ImportError Traceback (most recent call last) Cell In[19], line 1 ----> 1 dataset = pd.read_excel(pwd + \"\\\\Data - Survey Monkey Output Edited.xlsx\") 2 dataset File c:\\python3.12\\Lib\\site-packages\\pandas\\io\\excel\\_base.py:495, in read_excel(io, sheet_name, header, names, index_col, usecols, dtype, engine, converters, true_values, false_values, skiprows, nrows, na_values, keep_default_na, na_filter, verbose, parse_dates, date_parser, date_format, thousands, decimal, comment, skipfooter, storage_options, dtype_backend, engine_kwargs) 493 if not isinstance(io, ExcelFile): 494 should_close = True --> 495 io = ExcelFile( 496 io, 497 storage_options=storage_options, 498 engine=engine, 499 engine_kwargs=engine_kwargs, 500 ) 501 elif engine and engine != io.engine: 502 raise ValueError( 503 \"Engine should not be specified when passing \" 504 \"an ExcelFile - ExcelFile already has the engine set\" 505 ) File c:\\python3.12\\Lib\\site-packages\\pandas\\io\\excel\\_base.py:1567, in ExcelFile.__init__(self, path_or_buffer, engine, storage_options, engine_kwargs) 1564 self.engine = engine 1565 self.storage_options = storage_options -> 1567 self._reader = self._engines[engine]( 1568 self._io, 1569 storage_options=storage_options, 1570 engine_kwargs=engine_kwargs, 1571 ) File c:\\python3.12\\Lib\\site-packages\\pandas\\io\\excel\\_openpyxl.py:552, in OpenpyxlReader.__init__(self, filepath_or_buffer, storage_options, engine_kwargs) 534 @doc(storage_options=_shared_docs[\"storage_options\"]) 535 def __init__( 536 self, (...) 539 engine_kwargs: dict | None = None, 540 ) -> None: 541 \"\"\" 542 Reader using openpyxl engine. 543 (...) 550 Arbitrary keyword arguments passed to excel engine. 551 \"\"\" --> 552 import_optional_dependency(\"openpyxl\") 553 super().__init__( 554 filepath_or_buffer, 555 storage_options=storage_options, 556 engine_kwargs=engine_kwargs, 557 ) File c:\\python3.12\\Lib\\site-packages\\pandas\\compat\\_optional.py:138, in import_optional_dependency(name, extra, errors, min_version) 136 except ImportError: 137 if errors == \"raise\": --> 138 raise ImportError(msg) 139 return None 141 # Handle submodules: if we have submodule, grab parent module from sys.modules ImportError: Missing optional dependency 'openpyxl'. Use pip or conda to install openpyxl." }
Great video! Thank you Shashank. I like your pace and the way you describe how to progress through the tasks. Just btw, my prof for my MSc used to speak way too slowly, so I used to listen to his recordings at 1.25 to 1.50 speed lol (no need to speed you up, its perfect!)
Pro tip guys: use "&" instead of concatenate in excel, does the same thing
I didn't get it when he said "as a data analyst u will spend alot of time trying to get the data to a format u can use" could someone break it down for me + if there is any advice about how to develop this skill
fix your posture
omg I actually done something like this before on Jupyter. I'm a chip designer but we have terabytes of data so we end up doing rudimentary histograms, averaging, filters, organization by different design families. I had to do this stuff twice, one in unix and once in excel because my boss doesn't understand python and wanted to see all the formulas in excel. That was a real freaking pain using excel formulas. This is an eye opener lol. I always assumed data analyst are doing some crazy advanced mathematics and statistics lol
what is this on? pycharm? vs?
24:34
What certifications did u mean??
Great video Shashank! Working on improving your filler phrases "ums" and "uhs" will further improve your viewer retention! Big love!
At 2:12:10, I got some of those unix_time values as negative. What could be the reason? Help
I am stuck at 2:06:24 computer_date = pd.to_datetime(netflix["date_added"]) it's giving me an error called ValueError: time data " August 4, 2017" doesn't match format "%B %d, %Y", at position 218. Please help me.
passing a format='mixed' to the to_datetime method helped.
I wish schools taught these. Not those lame ass basic irrelevant ones.
Thankyou for this. This was Gold.❤
Concatenate is there as a backward compatability because that was the first function they created. Concat is the newer one which can take arrays as arguments. Concatenate cannot accept arrays as arguments, I mean it can but it would spill it over the adjacent cells, as opposed to the newer Concat which would concatenate the strings as expected. In Excel, you can opt to use Power Query for the transformations.
More like this please, i like that it comes from a textbook and also simplified
It is so generous of you to show a real work flow here. A "what DA do daily" vlog isn't help ppl to know nothing, I don't want to see what your lunch is. Your logic is so clear and your skills definitly are sophisticated, I gained a lot from this as a merely beginner. Well done! Many many appreciations.
Thus was so awesome
Shout out to Transpose now a days
Thank you so much, I've watched countless videos. This is the first showing the actual work.
00:00 Learn how to transform SurveyMonkey data from wide to long format for better visualization. 05:55 Concatenate and collapse rows to simplify data in Excel 18:16 Naming files is important for data organization 23:44 Data columns are being dropped to improve performance 35:21 Python can clean and unpivot data easily 41:01 Manipulating data using Python and Pandas 52:24 Learn how to join data using left-hand, inner, and right-hand joins in SQL and R. 58:02 Aggregate data by counting unique respondent IDs. 1:11:34 Learned how to import, rename, unpivot, join, and aggregate data in pandas 1:17:09 Closing remarks
he is the reality in expectation vs reality
This was fantastic, thank you! A video of how to navigate with the ambiguity of stakeholders would be super helpful!!
This is hilarious 😂. You know panda so well and can't remember if it's a colon. That is why I want to be a data analyst.
hey, let me know one important thing... is it true that DA mostly work with Excel rather than Python? so if I learn Excel, Power BI, then am closer to a DA profile? and is Python optional? am talking based on statement of CodeBasics platform(founder & Co-Founder is >12Y experienced in US).
SQL,EXCEL,POWERBI FIRST AND FOREMOST. THEN PYTHON-PANDAS.
Dude…no… go back to school .. stop baiting jr devs … Da is not this, not DA that actually adds value
dataset_merged = pd.merge(left=dataset_melted, right=questions, how="left", left_on="Question + Subquestion", right_on="Question + Subquestion") print("Original Data", len(dataset_melted)) print("Merged Data", len(dataset_merged)) dataset_merged KeyError: 'Question + Subquestion' why am i getting key error here?
"Skillfully Underbid" is such valuble info for a beginner in any industry
Practical Statistics for Data Scientists, Do you prepare only chapter one or there's other chapters that you prepare? Continue the good work.
pwd actually means "print working directory" and it's a linux command. It would be redundant to have a present working directory and a current working directory. Also it's better practice to have all bare imports above your import as'.
Its Thursday !!
Доброе утро 🎉🎉 Вас смотреть и слушать очень интересно и приятно. Я ✍️ Ся, вы?
As a data analyst, all I do is CODE. But I love it! Interesting to see how my job functions align more with your description of Data Engineer.
Its crazy he was in excel but didnt use powerquery to transpose and unpivot
Taking notes 📝
one tip if you want to apply quotes on multiple phrases (on visual studio, not sure if code works too): - put each phrase into a line - hold alt and move mouse in vertical on the beginning of selection, you'll select multiple lines - press quote to quote the before the first letter, press end to move to end of all lines and quote again to quote past the last letter.