Konnichi wa! 🇯🇵
A good portion of today's academic research in AI is centered around Machine Learning. But most researchers concur that even the most advanced algorithms, including variants of Deep Learning, won't be enough to construct genuine artificial intelligence 🤖. Melanie Mitchell, a professor at the Santa Fe Institute, discusses promising ideas focusing on human ability to draw analogies in this short video for Quanta Magazine. The full interview is also available here.
Moving on, here's an impressive blog post that lays out the implementation of a "diff" tool in SQL. "Diff" is a technical term for explanation algorithms that aid in finding highly plausible explanations for differences detected between two datasets of the same format. An ad hoc way to tackle this problem is by training an interpretable machine learning model (like a decision tree) to distinguish the two datasets. The approach outlined here is particularly intriguing as it can be implemented directly in SQL.
Finally, it's always worth reflecting on how to simplify your implementations and reconsider your technical choices. This blog post (from 2014! 👴🏻) brilliantly illustrates this point by showcasing a computing speed improvement of over 200 times when switching from a Hadoop implementation to a clever use of Linux command-line tools 🐧. For a deeper dive into the topic, a good starting point is Jeroen Janssens's book, Data Science at the Command Line. The second edition was released a few months ago, and you can read it for free online.
Here's to a week of productive studying! 🤓