Receive these posts by email like 150k+ others!

    Top Themes in Data Transcript by @ttunguz

    Venture Capitalist at Theory

    About / Categories / Subscribe / Twitter

    11 minute read / Jan 22, 2025 /

    Top Themes in Data Transcript

    Slide 1

    Clearing: While data world consolidates, capabilities have exploded with AI.

    Content:

    Slide 2

    Clearing: My name is Tomasz Tunguz, founder and general partner at Theory.

    Content:

    Transition:

    Slide 3

    Clearing: Every transformation follows a pattern. Today, three powerful movements are reshaping how enterprises work with data.

    Content:

    Transition:

    Slide 4

    Clearing: Let’s talk about the great consolidation.

    Content:

    Transition:

    Slide 5

    Clearing: Buyers are overwhelmed. I’m hearing more and more of them say, “Don’t sell me another tool!”

    Content:

    Transition:

    Slide 6

    Clearing: That MacBook Pro should be called a mainframe pro. It’s just that powerful.

    Content:

    Transition:

    Slide 7

    Clearing: Decoupling storage and computers all about Unlocking flexibility.

    Content:

    Transition:

    Slide 8

    Clearing: AI is changing the way software and data engineering teams work together.

    Content:

    Transition:

    Slide 9

    Clearing: Historically, there’s been a divide between software engineering and AI/ML teams.

    Content:

    Transition:

    Slide 10

    Clearing: AI is a core part of many products, and in the future, every software company will be an AI company.

    Content:

    Transition:

    Slide 11

    Clearing: In the 24 months after chatGPT3 was released, a parameter race was unleashed where the sizes of models became ever larger, culminating most recently with Lama 3.3 at 450 billion parameters.

    Content:

    Transition:

    Slide 12

    Clearing: Databricks’ most recent state of data report published earlier this year. Small models are the most popular.

    Content:

    Transition:

    Slide 13

    Clearing: Plotting MMLU or high school equivalency over time, you can see that small, medium, and large models are converging around 70 to 80% accuracy.

    Content:

    Transition:

    Slide 14

    Clearing: In addition, smaller models offer significantly better latency.

    Content:

    Transition:

    Slide 15

    Clearing: Docspot tracks these prices and plots them on a logarithmic chart.

    Content:

    Transition:

    Slide 16

    Clearing: Data modeling isn’t just back - it’s become the foundation of reliable AI.

    Content:

    Transition:

    Slide 17

    Clearing: Here I created a little TypeScript application that processes the famous FAA data. I did this in 15 minutes.

    Content:

    Transition:

    Slide 18

    Clearing: Many other organizations, the leading organizations are starting to use AI in a pretty meaningful way.

    Content:

    Transition:

    Slide 19

    Clearing: Data governance isn’t about control anymore - it’s about enablement.

    Content:

    Transition:

    Slide 20

    Clearing: The business intelligence ecosystem has been a pendulum oscillating between centralized and decentralized control.

    Content:

    Transition:

    Slide 21

    Clearing: I believe data pipelines are the backbone of any modern AI system.

    Content:

    Transition:

    Slide 22

    Clearing: This slide really captures the essence of why intelligent data pipelines are so vital.

    Content:

    Slide 23

    Clearing: Every transformation follows a pattern. Today, three powerful movements are reshaping how enterprises work with data.

    Content:

    Transition:


    Read More:

    What DeepSeek's Newest Model Means for AI