ChatGPT and many other LLMs have led the way for creating LLM-based applications in different domains. These models are extremely powerful at processing text inputs and creating text outputs based on…
The Poisson distribution is a ubiquitous discrete probability distribution. It was published by Siméon Denis Poisson in the early 19th century and since found applications in many industries…
The article explains what duck type compatibility and consistent-with mean in Python type hinting, and how to use them to write expressive type hints.
Today, we’re going to explore a crucial decision that researchers and practitioners face when training machine and deep learning models: Should we stick to a fixed custom dataset or embrace the power…
Meet Falcon LLM: The New Foundation Model that Quickly Top the Open LLM Leaderboard. The model has become one of the most interesting open-source foundation models in the space..
So far, we have seen regression methods to find the relationship between the different variables. Here we are going to use a tree-based approach. Most of you might know what a tree data structure is…
Prior to founding Data-Centric Solutions, I spearheaded the development, execution, and implementation of the AI strategy at an established bank. Although relatively green in…
This post investigates where we can use PyArrow to improve our pandas and Dask workflows right now. General support for PyArrow dtypes was added with pandas 2.0 to pandas and Dask. This solves a…
Dramatically speed up reusable computationally intensive tasks such as using an LLM or expensive database queries with lru_cache.
Recently, I’ve pondered how I can provide equal value to both technical and business-oriented professionals in my writings. Fortunately, my role as a data science consultant naturally offers a wealth…