Sampling Techniques in Data Analysis

Considerable emphasis is given to the analytical methods and algorithms used in data science projects, extracting meaningful insights from data and discovering valuable information. But equally as…

The Ultimate Guide to Training BERT from Scratch: The Tokenizer

The Ultimate Guide to Training BERT's tokenizer from Scratch

Digital Transformation in Finance: How Machine Learning is Redefining Financial Services and Overcoming Technology…

When talking about digital transformation in financial services, two things come to mind as critical issues: the importance of traditional finance function and the cost of “technology debt”. In this…

medium.com_towards-ai 2 weeks ago

Cracking the Code of Large Language Models: What Databricks Taught Me

Learn to build your own end-to-end production-ready LLM workflows by developing custom models using techniques like fine tuning, Flash Attention, LoRa, AliBi, PEFT

medium.com_towards-ai 2 weeks ago

AI Image Fusion and DGX GH200

In the realm of Computer Vision (CV), the ability to stitch together partial images and measure dimensions isn’t just an advanced trick — it’s a vital skill. Whether you’re creating a panoramic view…

medium.com_towards-ai 2 weeks ago

Ways to Iterate JavaScript Arrays

for loop is an iterative statement, it checks for some conditions and then executes a block of code repeatedly as long as those conditions are met. for…in loop is an easier way to loop through arrays…

medium.com_towards-ai 2 weeks ago

Prompt Engineering to Leverage In-Context Learning in Large Language Models

Large Language Models are more and more used and their skills are surprising. Part of their success is their ability to learn from a few examples, a phenomenon known as in-context learning; in the…

medium.com_towards-ai 2 weeks ago

What’s New in Pandas 2.1

pandas 2.1 was released on August 30th 2023. Let’s take a look at the things this release introduces and how it will help us improving our pandas workloads. It includes a bunch of improvements and…

How to Boost Pandas Speed And Process 10M-row Datasets in Milliseconds

I think I have said that countless times for the past three years I have been using Pandas. The most recent one I saw said, “make Pandas 71,803 times faster”. But I won’t give you that kind of…

medium.com_towards-ai 2 weeks ago

Model Explainability, Revisited: SHAP and Beyond

The rapid rise of large language models has dominated much of the conversation around AI in recent months—which is understandable, given LLMs’ novelty and the speed of their integration into the…