The Scandal at Kaggle
As I write these words, I marvel at my silver medal from the 2024 Automated Essay Scoring competition on Kaggle. This competition will go down… Read More »The Scandal at Kaggle
As I write these words, I marvel at my silver medal from the 2024 Automated Essay Scoring competition on Kaggle. This competition will go down… Read More »The Scandal at Kaggle
TL;DR: We built a bot that suggests a meaningful response to an ongoing conversation thread on GitHub. This bot can serve as a coding and… Read More »Compound AI Systems: Building a GitHub bot with Llama 3 and dltHub
Apple announced their new high-end Mac Pro desktop with 24 CPU cores, up to 76 GPU cores, 192 GB memory and 800GB/s of system memory… Read More »A DuckDB moment for application servers?
Last week I had the chance to visit a major global fashion retailer and give an industry talk on Real-time AI. This company was hosting… Read More »Time Value of Data: The Summit of Now and the Peak of Soon After
I am playing with Graphext – it’s like Trifacta, but with more powerful data science functionality. If you are a product manager or finance person,… Read More »Graphext, data insights for non-data scientists
About a month ago I wrote a 3-part blog series (parts 1, 2, and 3) on predicting user engagement with news in Reddit communities (subreddits).… Read More »Predicting user engagement with news on Reddit using Kaggle or Colab
What happens if you take a huge cross-section of the world’s news (The GDELT Project), mix it with the biggest online discussion website, and try… Read More »Predicting social engagement for the world’s news with TensorFlow and Cloud Dataflow: Part 1
I got inspired to write this blog by a post I saw today on the French presidential election. Plutchik is really the strongest framework I… Read More »Opinion Analysis of Text using Plutchik
This is the part two blog post of the Sirocco “modernization” series. In the old, SharpNLP version of Sirocco, we used WordNet version 2.7 to… Read More »Selecting a Java WordNet API for lemma lookups