Analyzing Geospatial Data with Python (Part 2 — Hypothesis Test) | by Gustavo Santos | Aug, 2023

Gustavo Santos

Learning about geospatial hypothesis test for Asheville’s AirBnb listings Blue Ridge Mountains in Asheville, NC. Photo from the author’s personal collection. In the first post, linked below, we worked with an introduction to Geospatial Data Analysis, where we downloaded the listings from AirBnb for the city of Asheville, in North Carolina (USA) and went through … Read more

The Research Agent: Addressing the Challenge of Answering Questions Based on a Large Text Corpus | by Rahul Nayak | Aug, 2023

The Research Agent: Addressing the Challenge of Answering Questions Based on a Large Text Corpus | by Rahul Nayak | Aug, 2023

The first question that one may ask is why not just use the ChatGPT interface and ask questions. It has been trained on a humungous volume of Internet data generated till 2021, so a text corpus like the Mahabharata is known to it. That was my first approach. I asked the ChatGPT several questions about … Read more

Free From Limitations: The Validation of Machine Hallucinations at MoMA | by Christian Burke | Aug, 2023

Christian Burke

Photo by Jamison McAndie on Unsplash Since 1929, the Museum of Modern Art (MoMA) in New York City has served as an art lover’s mecca. It’s a lighthouse that shines a light on avant-garde paintings and sculptures, and since the definition of “modern art” is continually in flux, its collections are, too. Now, this distinguished … Read more

Regression and Bayesian Methods in Modern Preference Elicitation | by Ouaguenouni Mohamed | Aug, 2023

Regression and Bayesian Methods in Modern Preference Elicitation | by Ouaguenouni Mohamed | Aug, 2023

The Bayes Framework identifies two principal components: the data D and the model w. By specifying the likelihood P(D∣w) and a prior over the model P(w), we aim to find the model that maximizes the posterior P(w∣D), derived via Bayes’ theorem as: In preference learning, having a distribution over w offers the advantage of capturing … Read more

Class Imbalance: From SMOTE to SMOTE-NC and SMOTE-N | by Essam Wisam | Aug, 2023

Essam Wisam

Exploring three algorithms to tackle the class imbalance problem In the previous story we explained how the naive random oversampling and random oversampling examples (ROSE) algorithms work. More importantly, we also defined the class imbalance problem and derived solutions for it with intuition. I highly recommend checking that story to ensure clear understanding of class … Read more

Helpful Tools and Services for Your Business

business intelligence tools

In today’s competitive business landscape, the tools and services you deploy can make the difference between success and stagnation. Whether you’re a startup or an established enterprise, it’s crucial to keep pace with the latest advancements and resources available. This article dives deep into some of the must-have tools and services, ranging from data analytics … Read more

Geospatial Data Engineering: Spatial Indexing | by Dea Bardhoshi | Aug, 2023

Dea Bardhoshi

Optimizing queries, improving runtimes, and geospatial data science applications Photo by Tamas Tuzes-Katai on Unsplash In doing geospatial data science work, it is very important to think about optimizing the code you are writing. How can you make datasets with hundreds of millions of rows aggregate or join faster? This is where concepts such as … Read more

How Does the Uncertainty Principle Limit Time Series Analysis? | by Rodrigo da Motta | Aug, 2023

Rodrigo da Motta

Why we can’t extract precise time and frequency information from a time series mutually, and how wavelet analysis can tackle this limitation Photo by Jamie Street on Unsplash The connection between the Fourier Transform, Uncertainty Principle, and time series analysis unveils a fascinating interplay that shapes the extraction of simultaneous temporal and frequency information. To … Read more

By 2030, AI Could Create These 7 Kinds of Jobs

By 2030, AI Could Create These 7 Kinds of Jobs

ChatGPT has taken the world by storm. It already has over 100 million users and is the single fastest growing product ever made. Image by author Without a doubt, it will disrupt how we work and operate. It has already completely changed my work as a data scientist. Other companies aren’t far behind OpenAI. There … Read more