Blog

This blog is a curated collection of my thoughts on AI, technology, and business—originally shared on LinkedIn and expanded here. You'll find research paper analyses broken down for practical application, non-technical explanations of complex AI concepts, reflections on industry trends, and actionable insights for leveraging AI in real-world scenarios. Each post aims to cut through the hype and deliver substance that matters.

31 Mar, 2025 linkedin
Rewriting from scratch is increasingly viable due to AI-assi
Rewriting from scratch is increasingly viable due to AI-assisted coding. Someday, all SaaS might be rebuilt every 3 months, with AI ensuring parity & no regression via AI auto-tests based on hi...
17 Mar, 2025 linkedin
Qwen releases QwQ-32B, small reasoning model which rivals wi
Qwen releases QwQ-32B, small reasoning model which rivals with DeepSeek-R1 and o1-mini.
01 Mar, 2025 linkedin
You should encourage employees to wear Meta glasses to captu
You should encourage employees to wear Meta glasses to capture tribal knowledge for AI coding agents. And here is why ⬇️
05 Feb, 2025 linkedin
People saying with AI you take 3 min to generate code and 2
People saying “with AI you take 3 min to generate code and 2 hours to debug it” or things along those lines are just bad software engineers.
09 Oct, 2024 linkedin
18 lessons to develop better products using LLMs - Use n
🚀 18 lessons to develop better products using LLMs:
21 Sep, 2024 linkedin
Developers who dont use AI-assisted coding are already falli
Developers who don’t use AI-assisted coding are already falling behind.
03 Sep, 2024
Ai Experts
Don’t trust “AI experts”.
30 Aug, 2024 linkedin
Dont trust AI experts
🐔 Don’t trust “AI experts”.
26 Aug, 2024
ColBERT: Contextualized Late Interaction over BERT
🍄 For RAG, and generally any semantic matching task, try ColBERT!
19 Aug, 2024 linkedin
The AI product market is overcrowdednot with effective tool
🙋‍♂️ The AI product market is overcrowded—not with effective tools, but with promises.
15 Aug, 2024 linkedin
TLDRs on Googles Gemma, spoiler dont use Gemma 7B yet? N
⚡ TLDRs on Google’s Gemma, spoiler: don’t use Gemma 7B (yet?)
10 Aug, 2024 linkedin
New model from Mistral AI Mistral Large! TLDR - Second bes
🚨 New model from Mistral AI: Mistral Large!
09 Aug, 2024 linkedin
How do you continue training on an already pre trained LLM
🥋 How do you continue training on an already pre trained LLM: TLDRs of the “Simple and Scalable Strategies to Continually Pre-train LLMs” paper
09 Aug, 2024 linkedin
Some takeaways I got from the conversation between Lex Frid
🎧 Some takeaways I got from the conversation between Lex Fridman and Sam Altman:
08 Aug, 2024 linkedin
Anyscale again! They built a FREE model comparator, where yo
Anyscale again! They built a FREE model comparator, where you can evaluate 3 open source LLMs simultaneously using the same prompt. They support llama2 7b, 13b and 70b among 8 others.
29 Jul, 2024 linkedin
Amazing post by Yi Tay about the challenges of training LLM
🌟 Amazing post by Yi Tay about the challenges of training LLMs as a startup! “Training great LLMs entirely from ground up in the wilderness as a startup”
28 Jul, 2024 linkedin
If you are looking for a product manager, here is one
If you are looking for a product manager, here is one. Abdel has contributed to shape my view in roadmap planning, feature definition, market analysis and more. This guy likes his job, sometimes to...
24 Jul, 2024 linkedin
Did you know? Cosmic rays is one of the greatest threat to
😱 Did you know? Cosmic rays is one of the greatest threat to interplanetary travel with people onboard the spacecraft, and… they can make your LLM training fail.
24 Jul, 2024 linkedin
Hugging Face was a sassy chatbot
Hugging Face was a sassy chatbot.
23 Jul, 2024 linkedin
Takeways from the Mixtral paper with no chitchat
🐬 Takeways from the Mixtral paper with no chitchat.
15 Jul, 2024 linkedin
OCR just got better
🤠 OCR just got better.
12 Jul, 2024 linkedin
What you need to know about Groq and LPUs How can Groq run L
What you need to know about Groq and LPUs: How can Groq run LLMs so much faster than the competition?
09 Jul, 2024 linkedin
How do we get LLMs to know what a software bug is without m
🤔 How do we get LLMs to know what a software bug is without making them write buggy code? A non technical dive into alignment.
01 Jul, 2024 linkedin
Amazing read by M Waleed Kadous from Anyscale httpslnkd
Amazing read by M Waleed Kadous from Anyscale: https://lnkd.in/e77fUTiv
25 Jun, 2024 linkedin
We often talk about the sample efficiency of ML models and
🤌 We often talk about the sample efficiency of ML models and compare it to that of humans.
21 Jun, 2024 linkedin
Token facts cheat sheet for practical estimations Tokens
🦎 Token facts cheat sheet for practical estimations
31 Jul, 2023 linkedin
A lot of people, when they hear that we are using logarithm
A lot of people, when they hear that we are using logarithm as a trick to go from a multiplication to an addition, think that we are diverting the log from its original purpose, we are “hacking” it.
25 Jul, 2023 linkedin
Mathematical intuition is an amazing tool, but it has its li
Mathematical intuition is an amazing tool, but it has its limits.
06 Jul, 2023 linkedin
One way a project that was supposed to bring big money ends
One way a project that was “supposed” to bring big money ends up being a money waster.
19 Aug, 2022 linkedin
One cool idea behind deep learning is the manifold hypothesi
One cool idea behind deep learning is the manifold hypothesis.
17 Aug, 2022 linkedin
Meta AI published a blogpost titled Using AI to bring childr
Meta AI published a blogpost titled “Using AI to bring children’s drawings to life”.
09 Aug, 2022 linkedin
Leetcode style coding interviews are great because - They al
Leetcode style coding interviews are great because: They allow companies to efficiently filter applications. They are standard and transparent, which makes them relatively fair. They show yo...
02 Aug, 2022 linkedin
Learning rate schedules allow for changing the learning rate
Learning rate schedules allow for changing the learning rate while training, instead of having the same learning rate for every batch of every epoch.
26 Jul, 2022 linkedin
ReLU is so dominant in the field because it embraces the fac
ReLU is so dominant in the field because it embraces the fact that all a neural network does is slice and dice the input space, linear transformation after linear transformation, hyperplane after h...
24 Jul, 2022 linkedin
To train a classifier, you need a dataset
To train a classifier, you need a dataset. When you don’t have a dataset, you build one by asking humans to classify your examples.
16 Jul, 2022 linkedin
A human brain requires less data to be trained on a given ta
A human brain requires less data to be trained on a given task than a neural net, or does it?
15 Jul, 2022 linkedin
1 No product that has been called AI as of today is intellig
1) No product that has been called “AI” as of today is intelligent. 2) “AI” that is not machine learning will never achieve intelligence. 3) Deep learning may not be the tool to build intelligent s...
12 Jul, 2022 linkedin
The most impactful artists of tomorrow will be the ones who
The most impactful artists of tomorrow will be the ones who know where to sail to on the latent space of art.
08 Jul, 2022 linkedin
GauGAN2, NVIDIAs model keeps amazing me
GauGAN2, NVIDIA’s model keeps amazing me. Aside of producing crazy photorealistic landscapes, it can also make some stunning semi abstract images like this one, where I just made a few strokes and ...
02 Jul, 2022 linkedin
Machine learning uses optimization, but in a slightly differ
Machine learning uses optimization, but in a slightly different approach than traditional optimization does.
30 Jul, 2020 linkedin
Some Friday AI fun Can a gorilla ride a camel? This is the
Some Friday AI fun:

theo martin

theo martin

Blog