Large Language Models in Molecular Biology

Will we ever decipher the language of molecular biology? Here, I argue that we are just a few years away from having accurate in silico models of the primary biomolecular information highway — from…

Simplify Your Data Preparation with These Four Lesser-Known Scikit-Learn Classes

Simplify Your Data Preparation With These 4 Lesser-Known Scikit-Learn Classes. Forget train_test_split: Pipeline, ColumnTransformer, FeatureUnion and FunctionTransformer are indispensable even if you use XGBoost or….

Data Ticket Takers vs. Decision Makers

Fundamentally, there are two different types of data teams in this world. There are those who are reactive to the wants of the organization, and then there are those who proactively lead the…

4 Reasons Why I Won’t Sign the “Existential Risk” New Statement

Some weeks ago, I published my pro and con arguments for signing that very well-known open letter by the Future of Life Institute — in the end, I signed it, though there were some caveats. A few…

Predicting the Functionality of Water Pumps with XGBoost

∘ Introduction ∘ Objective ∘ Tools/Frameworks ∘ Exploratory Data Analysis ∘ Feature Engineering ∘ Creating Training and Testing Splits ∘ Determining the Evaluation Metric ∘ Creating Baseline…

3D Deep Learning Python Tutorial: PointNet Data Preparation

The Ultimate Python Guide to structure large LiDAR point cloud for training a 3D Deep Learning Semantic Segmentation Model with the PointNet Architecture.

7 Signs You’ve Become an Advanced Sklearn User Without Even Realizing It

Can you be a master, an advanced Sklearn user without even realizing it. Find out using these seven signs!

Understanding ChatGPT Plugins: Benefits, Risks, and Future Developments

This article covers how plugins work, the necessity that drove their integration into ChatGPT, and the transformative impact they'll have on...

Investigating Unconscious Bias in AI Systems

AI systems operate based on the data used to train them. For chat-GPT, the exact nature of the curation process behind the datasets used to train it isn’t fully understood. What we can…

Why trust and safety in enterprise AI is (relatively) easy

In Part 1 of this series, I said something that I’d thought I’d never say: when we’re dealing with typical enterprise-scale AI systems, trust and safety is easy. Yes, okay, it’s actually pretty hard…