Juzef the Koala is trying to determine in which area he is an expert
Yesterday, I completed one more minor achievement on my own – I was able to receive enough upvotes on my works to receive a status of Kaggle Notebooks Expert. I’ve set it as a personal goal for myself for the end of this year, but thought that it is too optimistic (I will explain later why), and just continued ‘dancing as nobody sees it’.
A second talk I had in Zurich was related to my brave attempt to parse data about last MeasureCamps – fortunately for me, it was not so hard due to usage Replit as a digital session board solution by organizers of last events.
Attaching here links to notebooks with analysis:
– Descriptive overview/EDA of topics, levels, focus areas, locations, and presenters count of the latest MeasureCamps: https://lnkd.in/dECwApyA – Cluster analysis of MeasureCamp sessions descriptions & titles with the usage of BERtopic framework: https://lnkd.in/dXjny7yW – A dataset – I deployed it on Kaggle too: https://lnkd.in/dh5zeda5
One month ago I had a chance to visit Zurich to present there two topic. One of that is related to application of PCA + k-means for user personas definition – attaching here a deck.
Juzef the Koala with friends is trying to study how to produce eucalyptus
I decided to continue my journey with Kaggle and different datasets there, and to combine it with learning something new. While I used to check there some neutral datasets, and disappointed myself again with the inability to compose correct ML techniques on competitions, I still like to check what I can do with video games datasets, or try to extract some gaming-related stuff in common data sources.
Juzef the Koala, as he was captured in his childhood. He plays a game titled “Super Eucalyptus Bros.”
For last few weeks I’m obsessed with Kaggle: https://www.kaggle.com/ – a platform for data scientists, ML/AI engineers and data analysts. For significant share of people it is one of entry points in a machine learning world – just because of infinite variety of competitions, datasets, examples of notebooks and relatively strong community.
Juzef The Koala on vacation (as it is imagined by Microsoft Copilot)
Need to mention one more fact about myself – I really like computer games. I cannot say that I have enough time to spend with this hobby, but I still continue monitoring this industry and also intact with major platforms responsible for video games distribution.
And one more fact about me – sometimes I like to investigate if any such platform has an option somehow to extract data, either by API or by other more or less legal methods.
Today I decided to pay attention to GOG.com, one of the most noticeable PC gaming-related video games platform, focused on video games preservation and indie gaming support. In this article I will try to show how to extract data from GOG (basic stats about video games on platform and reviews data), and to discuss on how it can be used.