Search Captions & Ask AI

How Does AI Actually Work?

October 24, 2023 / 01:06:41

This episode features David and Danu Banga, Google's Director of Generative AI, discussing artificial intelligence, machine learning, and generative AI. Key topics include the definitions of AI, machine learning, and deep learning, as well as the implications of generative AI in various industries.

Danu explains that AI is a system of tools and techniques designed to mimic human cognitive abilities. He clarifies the differences between AI, machine learning, and deep learning, emphasizing that deep learning is a subset of machine learning focused on neural networks.

The conversation also covers the significance of the Transformer architecture in generative AI, which allows for the generation of content across different media, such as text and images. Danu highlights how generative AI can enhance productivity and creativity in various fields.

They address the challenges of AI hallucinations, where models generate incorrect information, and discuss the importance of grounding AI outputs in reality. Danu shares insights on the future of AI, emphasizing the potential for democratizing creativity and product development.

The episode concludes with a light-hearted typing challenge, showcasing Danu's skills on the keyboard.

TL;DR

David and Danu Banga discuss AI, machine learning, generative AI, and its impact on industries and creativity.

Episode

1:06:41
00:00:00
what's up people of the internet people of the internet yes it's David uh today we got a little bonus episode for you
00:00:07
don't worry we've got a regular episode on Friday still so stay tuned for that but I wanted to dig a little bit deep
00:00:12
into what exactly AI is right I think we all have been hearing about that for
00:00:18
months now um years possibly but nobody actually has really explained like what
00:00:24
it is or how it works right like people just say that things are AI but what does that even mean um
00:00:30
so I want to get an answer for that so I called up my friend from Google who definitely knows what that means and we
00:00:36
have a little nice conversation about how this all works so hope you enjoy uh
00:00:41
Daniel was gracious enough to come on the podcast and of course we met at my Cafe classic uh so yeah we're going to
00:00:48
debrief after this but um
00:00:55
enjoy we've been talking a lot about uh Ai and generative Ai and all of the
00:01:01
stuff that's happening in the world right now and it's very confusing so we thought it would be actually pretty useful if we got someone who knew what
00:01:07
they were talking about to come on the podcast we weren't just speculating constantly so today uh we have denu
00:01:14
banga with us uh he is Google's head of generative AI or director of generative
00:01:20
AI so we're going to have a long conversation about what that means all right so danu if you were to explain to
00:01:27
someone including me what you do at Google goal or as your job what would
00:01:32
that be we we try to incubate uh generative AI Solutions into production
00:01:40
grade applications for uh companies startups and or Enterprises so that
00:01:45
would that include like a company comes to you they say we want to use generative Ai and then you work with
00:01:50
them to actually integrate it into their product that's correct uh but we do we do have many other teams that really
00:01:55
focus on that longtail work of integration so to say our interest is to figure out what are the patterns that
00:02:02
are not necessarily common at this point or the new ones and then really turn that into 10x scare packages so as you
00:02:09
know many of these technology items especially within the AI space are fairly new right so they demand new
00:02:16
technologies they demand new approaches to Technologies and then what we do is to try to figure out what are the
00:02:21
patterns within this a bit open ecosystem at this point and package these patterns into applications that we
00:02:28
can now either to the teams that are more consistently working on with with with customers on putting that into
00:02:35
their product or open source these capabilities so that some folks can use that so instead of just throwing in a
00:02:42
chatbot that is based on a large language model you're actually integrating a specific solution that
00:02:47
makes more sense to that company exactly think about the early days of programming when when people were writing code right so you have a bunch
00:02:54
of folks writing programs and then sometime I think about the ' 80s and the '90s there's this coming pattern around
00:03:00
design patterns that emerged where some folks would say hey put these things
00:03:05
together and then it's going to be called the specific pattern so to say and then based on the design pattern you
00:03:11
somwhat create a new language and a new mechanism for people to use technology in a bit more consistent manner so it's
00:03:19
it's what we do so we try to understand what are the design patterns of AI and geni and then put that into either
00:03:26
technology and or educational artifacts for people to use so I want to get into a little bit about what AI actually is
00:03:34
cuz we talk a lot about Ai and generative Ai and all the stuff on the podcast cuz it's like the only
00:03:39
conversation happening right now and for the last year um but I think that something that confuses a lot of people
00:03:45
is the fact that you see all these companies that are saying we have ai now we have ai now and nobody really knows
00:03:51
what that means sometimes it means they added a large language model chatbot sometimes it means they added some stuff
00:03:57
under the hood that is doing a lot of work sometimes it means they're just rebranding something that wasn't really AI into AI so in your words what is AI
00:04:06
in relation to what we're seeing in the industry right now so
00:04:12
um to me right AI is uh it's a system so to say it's a collection of tools
00:04:19
techniques science and engineering capabilities um and we get when we get to talk about generative AI I'm also
00:04:25
going to talk about it in terms of a system because I don't think it's necessarily one single thing m
00:04:30
um and it has evolved over time but if you look at that system that AI system overall um it's it's again like I said a
00:04:37
collection of tools and technologies that are really geared towards uh providing human cognitive capabilities
00:04:44
to uh computers and make it so that these computers can accelerate the
00:04:49
processes through which we uh we produce um different things in technology so you
00:04:55
can think of AI as being a collection of planning and scheduling and sensing the
00:05:00
the world around us and understanding that world into a set of cognitive uh uh
00:05:08
container so to say and being able to do other things out of that level of
00:05:13
understanding so AI is somewhat bringing intelligence human intelligence analog to the computers overall and so I know
00:05:21
that's a bit of a complex definition but that's that's where we are now in understanding that as a system and when
00:05:27
you try to break that down into what that means in terms of Technology then it comes in three major forms um one of
00:05:34
a major form which is a umbrella term which is AI it encompasses things like uh like I say planning sensing
00:05:40
scheduling and then processing that data that you sensed um with a a certain list
00:05:46
of tools and then these tools are usually borrowed from the mathematical worlds of uh statistics and probability
00:05:54
and then combining the collection of these tools is what we traditionally call machine learning MH um so AI is
00:06:00
bigger than machine learning and then within machine learning you have a set of tools that are mathematical statistical and whatnot and then the
00:06:07
subset of these tools which is also the essence of uh generative AI is a family
00:06:12
of techniques called Deep learning and so deep learning is uh involved with
00:06:18
using neural networks so to say which is almost an artificial representation or
00:06:24
analog or modeling of what the brain could possibly look like to the full extent of our understanding of it and
00:06:32
try to trying to represent essentially a data structure that would be used to process um and set of techniques that
00:06:39
would be used to process the data that is sensed just to reel it back for a sec so that people understand the difference
00:06:44
between those three can you in a couple of sentences Define the difference
00:06:50
between machine learning or like individually what is machine learning what is deep learning and then what is um what was the third one you said AI I
00:06:58
guess AI yeah between those three can you can you define them in like two sentences each got it so with AI you
00:07:05
want the machine to do things that seem human sort of say right imagine being
00:07:11
here and someone asks you hey David what is the color of the car in the garage you would have to do a few things you
00:07:17
would have to plan the way you would get out and get to the garage you would have to look at this artifact in the garage
00:07:23
and understand it as a car and then you would have to understand colors and then look at that and say okay the color is
00:07:29
right for example so there is this set of steps so to say that you have to carry out as a human intelligent person
00:07:36
that would say okay I'm going to plan my way out I'm going to plan my way into the garage I'm going to look at this object detect that object as a car and
00:07:43
then eventually detect the colors right so there are a few things that you do now if you were to break that so that's
00:07:50
AI so to say uh imagining that the system could do that imagine asking a robot to do the same set of tasks then
00:07:56
overall I would consider that to be AI now if you break that into some levels of deeper details the and taking out of
00:08:03
planning and scheding what are the techniques that you use possibly for navigating this ecosystem all the way up
00:08:09
until you got to the garage right what are the different techniques that you use in order to analyze that object and
00:08:17
understand that as a car and so that that set of techniques is what Ma learning is machine learning okay so
00:08:23
it's like Machine Vision and like object recognition that kind of stuff would be the machine learning Tech te that get
00:08:30
applied on top of AI that create the machine learning mechanism exactly so machine learning could be considered
00:08:36
just the the mathematical underpinning set of artifacts that you would use as a
00:08:41
subset of AI okay right so and then deep learning is just one of these techniques
00:08:47
that so within the concept the context of machine learning there are different techniques like one of them is called uh
00:08:53
nearest neighbors it's usually preceded with a u with a K so K is for number we
00:08:59
can say what are the four nearest neighbors to David and danu and then as
00:09:05
a matter of fact we would look at all the people that are within these buildings and then understand what are
00:09:10
the people that have a distance that is the four uh closest distance to us so
00:09:15
those are the four nearest neighbors that's just one technique out of many techniques there's another technique
00:09:20
called support Vector machines and there are many other techniques like regression classification and so on and
00:09:25
so forth now deep learning is a set of all of these techniques that uses neural
00:09:32
networks as a representation of the data that you would use to process in order to identify objects classify objects and
00:09:39
so on and so forth so you get AI as a bigger bucket that has other things including planning and scheduling and
00:09:45
sensing uh you get machine learning that is more focused on the mathematical and statistical and probability techniques
00:09:51
yeah okay and then you get deep learning that is just one of the application of machine learning techniques that focuses
00:09:57
more on artificial neural and then deep learning did that become something that
00:10:03
became very popular in the vernacular because it was uh it was discovered that it was a very good way to do machine
00:10:09
learning did people try to do a bunch of different machine learning techniques but deep learning just became the most
00:10:15
useful one that is correct and when we so bringing bringing it back to your initial question which was how does how
00:10:22
do these techniques or how do these definitions relate to the current state of affairs of machine learning it's very
00:10:29
related because um machine learning has been applied for a while deep learning
00:10:35
also for the last 10 20 um years so say so the techniques have been around but
00:10:40
then the techniques were boosted based on the Advent of a couple of capabilities and until we started
00:10:45
observing that deep learning was really doing two things one it was that it was able to process a large amount of data
00:10:53
and so the traditional machine learning techniques supervised learning and so on they would they would tend to PL to when
00:11:00
when you give it too much data so it would give you some performance and at some point it wouldn't really give you
00:11:06
more scale it doesn't scale so you start having diminishing returns so you spend a lot of computer capabilities but
00:11:12
you're not really getting good results um but with deep learning it was seen that you can one paralyze that
00:11:18
aggressively if you have a lot of computer capabilities gpus and or tpus mhm and two it wouldn't necessarily
00:11:25
Plateau that means that you can give it a lot of things
00:11:30
going up and so what what we saw was that um the techniques have been applied
00:11:36
for the last many years but their techniques are increasingly getting better and better given the Advent of
00:11:43
additional capabilities that are supporting that increasing performance MH what is that additional capability
00:11:49
that has really tipped the scale especially in the last year uh let's go back to the last six
00:11:57
years so to say with the invention of the Transformer architecture do you want to explain what that is so the
00:12:02
Transformer architecture is is uh was created in 2017 and before that there were many other architectures within the
00:12:09
Deep learning ecosystem that were used to process data scale um one of the
00:12:14
abilities for these systems to process text for example or sequences of data
00:12:20
things like music things like video things that have to deal with frames um
00:12:25
was had been studied for many years right so we had sequences sequence models we had things that we called lstm
00:12:32
long short-term memory models that essentially made it possible for someone
00:12:37
to uh process sequence data and even possibly generate sequence data but the
00:12:43
problem with those architectures was that if you have a text if you have an entire page and then you want to either
00:12:50
summarize that or analyze that then you have to put the entire thing into the model and so we started having
00:12:56
limitations with the capabilities that the machine maches themselves would have to host that amount of text in order for
00:13:03
you to ask a specific question of that text for example what is this text talking about or generate a summary of
00:13:10
this text whatnot so there were some skating issues because if you were to synthesize the entire page of text it's
00:13:16
hard it's more computationally expensive to the n+1 degree to generate or to
00:13:22
synthesize more and more text as you add words ex exactly and it it scale it scale quadratically
00:13:29
right the other thing is that to improve the quality specifically when you have to analyze things like text you want to
00:13:35
maintain a certain say grammatical structure if uh if you're being asked a question about a sentence sometimes the
00:13:42
answer is really towards the end of a sentence but you have to maintain the context with the beginning of a sentence
00:13:48
so there was this idea of essentially um keeping or maintaining a structure of the content that you're analyzing by do
00:13:56
um applying different mechanisms and one of the mechanisms that was invented with the Transformer architecture in 2017 is
00:14:02
what we call the attention mechanism so the attention mechanism is a mechanism through which within the neural network
00:14:09
it's possible for you to maintain a structure or keep information about how
00:14:15
let's say specific words are related within the text that you're analyzing so essentially you're coming up with a
00:14:20
mechanism through which you can an you can analyze a large amount of text while still maintaining the information about
00:14:27
how these specific tokens and or words words is just one representation or
00:14:32
tokens are one representation of words are related within that context now it
00:14:38
gets very expensive computationally and on memory and storage to get that done
00:14:43
and that was the challenge pre 2017 what the Transformer architecture brought um
00:14:48
about was the ability to process these large amounts of data maintain the structure that they have and it not
00:14:57
being extremely uh expensive on the hardware the storage and the computer so then it was possible by basically
00:15:04
parallelizing some of these architectures to make it possible for you to process a very large amount of
00:15:09
data build extremely scalable very very internet skill models if you had the
00:15:15
hardware for it and then eventually being able to get some intelligence out of it and then a couple of things
00:15:20
started happening one is that you remember when I said that if you were able to uh basically break through the
00:15:28
plat Towing of diminishing returns when you start having more and more and more performance so we started seeing things
00:15:34
going this way where you you get more and more performance and eventually you get new abilities you get emerging
00:15:39
abilities out of the same models when you say emerging abilities do you mean things that we didn't expect exactly um
00:15:47
traditionally we would train what we call supervised models based on tasks and so essentially what I would be would
00:15:53
be that you would go to the model and say what's the color of this object and it would say red so that's an that's a
00:15:59
model that is trained towards understanding given an object what is the color and so the way you do that is
00:16:07
that you you give it a lot of examples that are labeled and you say this is a mug the mug is red and black this is
00:16:13
another mug the this mug is white and so on and so forth you tag it manually exactly and the next time you show it
00:16:19
some data it will give you you know mug MH um but it's it's expensive to
00:16:24
basically train one model that can recognize mugs recogniz people also
00:16:30
answer a question and so on and so forth so being able to give multiple tasks so
00:16:35
to say to a single model that you train once was a challenge but with to make it
00:16:41
multimodal yeah multimodal has a couple yeah you can make it multitask and multimodel multimodel essentially to a
00:16:48
level of Simplicity really means that you're able to get the model to analyze images and text and audio and video at
00:16:55
the same time right and then it could it could be m to model input single output IE you you train the model to see images
00:17:02
text audio video but you're only asking it questions about text or the textto text format um that Paradigm is called a
00:17:10
picture is worth more than a thousand words so you can essentially get multiple pictures within the model get
00:17:16
it to learn from it but the way you interact with it is still in a text man okay so we started seeing benefits where
00:17:22
very very large models that had seen a lot of data coming out of the entire not the entire but a huge part of Craw
00:17:29
website for example uh a huge part of you know data that is available out
00:17:35
there started behaving in such a way that they had this almost general purpose intelligence um they could do reasoning
00:17:43
up to a certain extent and that is tested by giving it some mathematical problems and then it would do
00:17:49
derivations so to say assuming that it had seen some of these derivations in um
00:17:54
some mathematical books for example or writings so it would learn that structure leveraging that attention
00:18:00
mechanism and being able to derive the answer step by step and give you a specific answer and is that still
00:18:05
considered an emergent property if it was being fed different levels of ver of
00:18:11
derivations through different text input that's a very good question so the thing that makes the thing that makes that an
00:18:17
emerging property is the fact that it's doing that in a multitask fashion so
00:18:23
remember initially we would train one model to do one thing so if it was one model that was trained only on doing a
00:18:29
derivation over a specific mathematical problem set that would be very simple it wouldn't be considered emerging sure but
00:18:35
if you train one model that can do that on a mathematical Corpus at the same time take an an SAT exam at the same
00:18:43
time uh give you a summary of a specific piece of uh text that you give it and at
00:18:49
the same time write code at the same time optimize code and review code at
00:18:54
the same time so those are the different kinds of emerging property is that a multitask so to say large model um is
00:19:02
able to do in your opinion is are those emergent properties kind of subsets of
00:19:07
the attention mechanism like is that the thing that really allows it to do these kind of things one analog that um I
00:19:16
would I would give you is you know in physics for example when you have particles that are moving at a very very
00:19:24
fast pace so to say in a contained environment then you start get temperature yeah right heat and and
00:19:30
whatnot and if they move faster then you get higher and higher temperature temperature itself or heat itself is not
00:19:37
necessarily something that is uh is a is a physical artifact it's it's an
00:19:43
emergence of that fast movement but that movement itself is very simple right
00:19:48
right so similarly the attention mechanism makes it so that uh the specific elements that you feed the
00:19:55
model get to learn about each other and so they get this interaction mode through which they basically function
00:20:01
they have this simplistic function mechanism at a very very low level yeah
00:20:06
and there's almost this transformation this phase transition that happens where the higher level thing which is the
00:20:13
model starts giving you some of these specific behaviors in a multitask fashion has skill sets you didn't
00:20:19
anticipate it to be able to have that are based on things you did give it but you didn't realize we're connected
00:20:25
exactly a couple of other emerging properties one of my favorite is is called in context learning where
00:20:31
basically your large model now would learn from what we call demonstrations so again traditionally you would want to
00:20:37
give an input to the model and then the model will give you an answer that is a straight um uh input output relationship
00:20:43
but some of these models today you could say hey give me an answer that looks like this or here are four
00:20:49
demonstrations of the kinds of questions that I will be asking you therefore going forward from now I need you to be
00:20:56
answering these questions in this manner mhm and for some reason it's able to remember that context learn from these
00:21:02
demonstration that you gave it and then start giving you answers going forward that sounds like that and that's why uh
00:21:08
is that something we didn't expect it to be able to do exactly that's why systems like uh like chat GPT or Bard are are
00:21:15
very interesting in that sense because you can even basically tell the the system hey you are a knowledgeable
00:21:23
scientist about this field given that background start answering my question questions and then it will be giving you
00:21:30
some very interesting and then there are many ways you can get creative about that space right you can say you are a very funny and creative artist start
00:21:38
giving me answers Within These specific uh steps and the last emerging property I'm going to talk about is what we
00:21:44
called uh um Chain of Thought or or reasoning I think I I spoke about it a bit earlier where the the model or the
00:21:53
AI system is able to give you a stepbystep breakdown on on how it came
00:21:59
up with the answer right right so that's very interesting too and that's definitely not something we expected it
00:22:04
to be able to do exactly okay so that was a lot uh I think there's a lot of answers to this question but effectively
00:22:10
it seems like AI is sort of the outer layer where you try to teach machine
00:22:15
like human analoges to a machine and then you've got machine learning which is a subset of AI and deep learning
00:22:22
which is a subset of machine learning and then when you feed these models just these enormous amounts of data you end
00:22:28
up with these emergent properties that you're not really expecting uh we're going to get a little bit deeper into
00:22:35
those emergent properties and very philosophical next so stick around I think I'm going to go get a coffee
00:22:41
[Music]
00:22:51
first so I think that um because large language models and chat Bots and things
00:22:58
like Dolly are sort of the only things that a lot of normal people in their everyday life have seen AI affecting is
00:23:06
there what what else is the Transformer transforming like what what uh what
00:23:11
industries are being kind of like pulled up by Ai and what's actually driving
00:23:16
that because I think that most people just see like oh we've got chat GPT oh now this random app that I never talked
00:23:23
to has a chat bot for some reason right but like we hear all acoss every
00:23:28
industry that every industry is being uplifted by AI so is that also Transformer based and how is that
00:23:35
working since it's not using a language model right so the Transformers started the revolution so to say right so the
00:23:42
ability to have these emerging properties and since then so that was in 2017 it's been what 6 years now mhm um
00:23:49
since then there's been a lot of evolution of that specific architecture there's been a lot of creativity around
00:23:56
you know building some of these AI systems systems generative AI systems that can generate uh images or text or
00:24:03
given some text give you some image or giving some image give you some text that's captioning and or applying this
00:24:10
paradigm shift so to say into many uh uh Industries and many applications there
00:24:16
are two ways I would say we can look at this one is old school AI is not gone right so
00:24:25
we're still using that we're still applying some of these techniques or recommended systems uh you know when you
00:24:31
go on the website you're still being recommended some out factum things to to buy and or uh suggestions of books to
00:24:38
read and whatnot so many of these initial applications of AI are they're
00:24:45
really really uh useful for very large companies that have their abilities and
00:24:50
this is one thing that I really like talking about very big companies that have the ability to hire hundreds of
00:24:55
Engineers so to say or dozens of Engineers highly trained highly paid that can build some of these highly
00:25:03
tuned systems that would scale to say hundreds of millions of users right for
00:25:08
the businesses that are not the multi-billion dollar businesses we're seeing New Opportunities open up because
00:25:14
these industries can now use some of these generative AI systems in the past
00:25:19
you needed about 7 months to 18 months to build an application with programmers
00:25:25
designers uh product managers and so on and so forth but now if you have a vision well you can go on Bard and say
00:25:32
hey this is my vision help me iterate on that give me five ideas that are related to this and then after that you can say
00:25:39
hey now write a a product requirement specifications for a system that may
00:25:45
look like that and then you can say hey based on all of these interaction write a project plan M and you can iterate on
00:25:51
that context with the with the bot the chat bot so to say and then after that you can say hey considering these
00:25:57
artifacts or considering everything that we've talked about help me write a design document that I could use to
00:26:04
implement this app this solution and it would do that and then you can say now I need you to help me implement this in
00:26:10
Python you know designed the apis for me write the implementations of the apis for me write the system designed for me
00:26:18
it could even help you draw some of these things and so what you're seeing is that you're moving from a a life
00:26:24
cycle where you had to use about 18 months with a team of 10 to even get an
00:26:30
idea into a good shape to probably a matter of hours to weeks working with
00:26:37
prompts and being very creative in the way you interact with that bot or as a smaller group you interact with that bot
00:26:43
yeah to come up with a solution that is pretty pretty good yeah and so what I see is that many Industries um many
00:26:50
startup and Enterprises have really really taken advantage of that I've seen good examples in media I've seen good
00:26:56
examples in in healthcare and Life Sciences I've seen good examples in financial services but in all things
00:27:02
essentially I'm seeing a lot of uh movement do you use these kind of systems in your own work to build your
00:27:08
own apis and stuff you use Bard for your own work yeah I use Bard I use Bard every day every time I have an idea
00:27:15
every time I want to process something I use B to iterate on the idea wow um I
00:27:20
use B for uh outlines if I if I need to give a talk for example a conference um
00:27:27
usually for me the process of creating content would be based on the work that it depends on the topic but based on the
00:27:33
work that I do and based on some research I try to come up with a specific outline that really touches on
00:27:39
the points that I would like to talk about and so I use bar to create to help me create that outline and then I may
00:27:44
feel the outline myself and give it back to Bard and say hey help me summarize this and or help me extract specific
00:27:51
talking points out of this yeah uh and then I can say hey make this a bit more creative and make this a bit more you
00:27:57
know in in different types of tones so there's this that's one mode of interaction the other mode of
00:28:02
interaction is the one that I spoke about earlier which is when I have an idea rough idea say I want to create a
00:28:08
system that helps you determine what coffee you're going to drink in the morning based on prior whatever like
00:28:15
just a toy example like that and so I can formulate specific questions and interact with Bard in that way and I
00:28:22
could have a prototype before the end of the day that works that is implementing python full stack that's and back
00:28:28
everything yeah that's like a productivity explosion exactly I want to reel it back a little bit because we uh
00:28:34
we talked about AI we talked about machine learning we talked about deep learning but the big thing that's being
00:28:39
that's on everyone's mind in the last year is generative AI which you've talked about multiple times so far but
00:28:45
we didn't really Define what generative AI is and what makes it different from those uh other forms of AI so can you
00:28:52
give a quick explanation of what generative AI actually is remember we talked talked about AI overall being a
00:28:59
system not just one thing uh so and machine learning being a set of
00:29:04
techniques that are more mathematical in nature deep learning being one of these techniques that focuses a bit more on
00:29:10
neuron networks so by virtue of of getting something that is a lot more
00:29:15
fundamental generative AI is a deep learning technique so it's still using
00:29:21
the Deep learning Technologies but generative AI is really focus on generating or creating a specific
00:29:28
artifact and so that artifact could be an image it could be a piece of text or
00:29:33
it could be a piece of audio or it could be something else so yeah that's that's a very simplistic definition of what
00:29:39
generative AI is and what what is the foundation of generative AI like what allows that to work because we see
00:29:45
things like generative fill in Photoshop we see generated music now like the every single creative industry and
00:29:52
non-creative Industry is being sort of upended by this generated content what
00:29:57
is allowing systems to actually generate content instead of just classifying content yeah so that's a beautiful
00:30:04
question in a sense that it there's a very very strong common denominator among all of these things and that's the
00:30:09
Transformer architecture we spoke about earlier right so what we've s what we've seen is that applying the same technique
00:30:17
and then changing the question a little bit gives you exactly uh content that is
00:30:22
generated that that you're interested in for example we can say using these lower
00:30:28
Transformer architecture help me generate an image you can give um the
00:30:33
the generative AI problem as given images of different artifacts
00:30:39
like animals like cats and dogs or whatnot create something that looks like some of these things uh using I don't
00:30:46
know interpolation or extrapolation different techniques and make it look like the family of things that I've
00:30:52
shown you in the past and it will give you something that doesn't exist in real life maybe the image very high fidelity
00:30:58
image of a dog or cat that doesn't exist in real life but really really looks like the samples of the things that
00:31:04
you've shown it in the past so the ability for these models to essentially create uh content in different
00:31:11
modalities is to generative ability yeah so we think about like large language models being fed into a Transformer
00:31:17
right and that's just like give me all of the text that has ever been written on the internet and we can develop relationships between words but when
00:31:24
you're when you're generating an image or you're generating audio what is being fed into the transformer in that way
00:31:31
right because we we see uh you know there's um a lot of genetics work that's being worked on with Transformers too
00:31:38
what kind of data do you feed into Transformers to actually make that work in a variety of different fields right
00:31:45
so in general you would give it today and text was very easy easier to acquire
00:31:50
that's why you hear of large language models today a lot more right and I and
00:31:56
the result also so from generating text were a lot more impressive and exciting to look at that's why in my opinion that
00:32:02
field somewhat took over but you're right so there are you could consider the input to be pretty much anything
00:32:08
that could be put into a sequence you a video for example is a sequence of frames right so you could give multiple
00:32:15
videos uh broken down into frames uh to uh a Transformer based architecture
00:32:22
and it gets a bit more complex in the way those sequences of process or structure is maintained the many techniques around attention
00:32:29
mechanism and so on and so forth but let's consider that to be a black box and then it knows how to do that then
00:32:35
what you give it is a set of frames which are videos so to say and then you say give me something that looks like
00:32:41
that so in that sense you've given it videos of or of set of frames um you
00:32:46
could also have a mechanism through which you give it videos and text which
00:32:51
we do today there is this um encoding uh model that is called clip essentially
00:32:57
putting together images and videos I mean and text and which is the foundations of do and a lot of AI image
00:33:04
generation at least the foundational technique of U of these kind of abilities where you you you teach the
00:33:11
model to recognize images and text together as a as a joint uh entity so to
00:33:17
say and the process through which you do that is by getting the image is processed with what we call tokenizer
00:33:23
and or encoder specific to an image and that turn turns that into a vector we call that an embedding and then you do
00:33:30
you go through the same kind of process with the text where you turn the text into a vector and then once you have
00:33:36
these two vectors you can then combine them with basically algebra and then at a higher level you have the task and
00:33:42
Order question that you want the model to answer in one scenario you could you would want the model to say for example
00:33:48
given an image explain the content of this image for me or you may have the reverse problem which is given a text
00:33:55
generate an image that contains the the information so to say that I've provided in this text which is the business of um
00:34:03
Mid Journey so as a kind of to break that down you're depending on the field
00:34:09
that you're trying to use Transformers on you are turning data into numbers and
00:34:15
you're comparing those numbers to each other and then getting an output correct so because you're able to take video or
00:34:22
images or text and vectorize them and turn them into tokens you can
00:34:27
compare them to each other even though they're different types of media correct correct that that is that is excellent
00:34:33
and the the one of the things that makes it really work beautifully is because once you take the images or video or
00:34:41
audio you incode that into an initial Vector that that process is called tokenization M then once you get the
00:34:48
tokens and by the way the tokens can be a bit more complex for example the tokenizers could learn to not just use a
00:34:54
word per token mapping but it could also split words into two or three if that
00:34:59
word has a bit more multiple meanings and complexities or if it finds it effective so Sub tokenization sub
00:35:05
tokenization so you may have a situation where uh a five-word uh sentence gives you 12 or 15
00:35:13
tokens or maybe less so it's a matter it's a the concept is more about
00:35:18
information preservation within the subst structure that is a vector rather than a onetoone mapping between the
00:35:24
words and and and the vector right same thing with an image an image is a two-dimensional structure which has a
00:35:30
third dimension of red green blue right right so if you flatten that entire thing into a pixel intensity over that
00:35:38
entire two times times three so to say Dimension then then you get a larger Vector but that's just a simplistic
00:35:45
tokenization where you say hey I'm going to flatten an image flatten that more by reging blue and then after that I'm
00:35:52
going to have a vector representing the pixel from there you can have a deeper
00:35:57
tokenization that may consider the structure for example the adjacency of object or the distance between objects
00:36:04
or even some deeper level of understanding of the objects within that image at the end of the day you go from
00:36:10
a piece of artifacts like audio and in anodia you use spectrograms and and you
00:36:16
turn that into specific order artifacts so you you go from an asset to a vector
00:36:23
mhm now there's this other step called embedding which is basically doing a projector of that a projection of that
00:36:30
Vector onto a vector space that is shared by every other piece of artifacts I mean other piece of data in that space
00:36:36
it's like a normalization like a normalization but then by that projection what you essentially do
00:36:41
especially if you have a multimodal model like if you work with an image and if you work with Tex for example then
00:36:47
you tokenize them each which is a onetoone relationship between an image and the text and the tokenizer that works for them and once you have these
00:36:54
two vectors you do that projection onto that shared Vector space so to say and
00:36:59
the beautiful thing about that is that and you do that through training the beautiful thing about that is that once
00:37:05
you land these things within the same space they become of the same nature
00:37:10
right so you can start comparing them yeah so you can start assigning relations and making uh having
00:37:16
statements like a car c a r written in text form compared to the image of a car
00:37:21
compared to the image of a car so these vectors will be closed in location so a
00:37:26
Rosetta Stone you're taking you're taking one language and another language and you're sharing them in a certain way
00:37:32
and then once you have this shared say you translate them all to Latin then you can do whatever you want from there and
00:37:38
the common the common substrate of all of these different things assumption at least is that there's information that
00:37:44
is preserved in these different types of artifacts so you're almost doing an information extraction exercise right
00:37:50
describe that what do you mean by information so it may be a longer conversation but at the end of the day
00:37:56
at the end of the day so information and I know you had a whole uh uh video about
00:38:02
the nature of information it could be contextualized to the piece of artifacts that you're working uh you're working
00:38:08
with but in a very very simple manner information is this uh entity or this
00:38:14
thing that could give you it's hard to Define information without using information yeah yeah you have to it's
00:38:21
this thing that can give you a bit of a pattern right so and we usually base that pattern on uh the notion of order
00:38:28
disorder Symmetry and so on and so forth yeah but if you have something that can
00:38:34
um give you a pattern about in indifference and or disorder about a specific subsystem then you start having
00:38:40
information for example if I do
00:38:46
this nothing has changed very much so if you were on the receptive end of that pattern you won't really get much
00:38:52
information but if I do
00:38:57
there's a difference in what I did before and what I'm doing now sure now you may not understand why I'm doing
00:39:03
that but you would understand that there's a difference between what I the way I tapped the frequency at which I
00:39:09
tapped my hands before and the frequency at which I tapped it after then you've gained information yeah so it's the same
00:39:15
way that you may understand some differences within an image for example
00:39:20
looking at a contour and then something changed between this and this then you may realize that these may be two
00:39:26
different object and so on and so forth and within text as well you may have
00:39:31
difference maybe between words or between paragraphs and between different structures so you have some form of
00:39:36
information and the beautiful thing about information is that it could be combined so it's the evolution of
00:39:42
information is what you're maintaining it's the extraction of information and or differences in patterns within
00:39:48
different modalities of data artifact but the beautiful thing about that is that it could be combined at a certain
00:39:54
level or compared that's what makes it possible for you to essentially extract information out of an image by
00:40:00
understanding how different it is or how many different patterns exist within that image yeah and extracting
00:40:05
information out of a piece of text by understanding how many different patterns exist within that text and then putting that together in a normalized
00:40:13
space through which you can start comparing them and then reversing that you can now combine text to images and
00:40:18
basically have that relation maintained with all of that combined is that would
00:40:23
you say would that be the fundamentals of like a general AI that could do everything we're we're getting into the
00:40:30
realm of AGI I would love your opinion on that if you feel comfortable talking about of
00:40:36
course of course so um what is intelligence according to you
00:40:42
according to me this is such a big question um I've thought about this a
00:40:48
lot my personal opinion on this at this point is uh well for the listeners we're
00:40:54
going to Define AGI really quickly AI is artificial general intelligence effectively meaning you can ask an AI to
00:41:02
do anything that a human could be able to do or possibly even more right and it
00:41:08
could be able to help you with that would you agree that that's the definition or do you have an expanded definition that's that's somewhat why
00:41:15
I'm I'm um asking the question of what is intelligence because agreeing on AGI
00:41:22
being artificial general intelligence assumes that we agree on what intelligence sure okay my my definition
00:41:29
of intelligence would be wow thanks uh the ability to synthesize
00:41:38
information and create uh create new actions based on
00:41:45
information that you weren't explicitly told to do that'd be probably my definition of intelligence that's a
00:41:51
that's a decent definition um would you disagree that the context in which you have have to do that specific workflow
00:41:58
that you define has to be defined at e I.E you have to do it within the context of I don't know literature or uh
00:42:06
robotics Automation in a sub field for example having a robot that can control a specific arm either for surgery and it
00:42:13
would be a different thing if that robot controlled an arm say in a restaurant and so on and so forth I think that I
00:42:20
think that when we talk about the generalization of intelligence or even information
00:42:26
we're making a bold claim that goes beyond what we understand so far about the nature of these things uhuh right so
00:42:34
I I see so if I want to break down the problem of AGI uh again I might have
00:42:40
already expressed that I'm not a very big fan of that definition because I don't really think we know exactly what we mean when we say that sure um but if
00:42:46
we want to get into a practical uh realm I think that may be possible to
00:42:53
essentially and which is the the state in which we are now Now by getting these models to progress in their ability to
00:43:01
impact the world as well so we we discussed the software version of the AI
00:43:07
so far which is you give it data it could recognize it or at this point it can also generate data sure but what is
00:43:13
a software real world interaction mode at this point right so we have many
00:43:18
systems for example in healthcare and life sciences that have to deal with the real world in the way that say a
00:43:23
hospital equipment functions or in the way that um a robotic arm that controls
00:43:29
cameras functions so you get many other things about a real world that may have to do with intelligence so I think a lot
00:43:37
of the work that we're doing on improving the quality of these AI systems has to bring things all the way
00:43:44
up to these definitions of AI that I mentioned earlier which involve in and include planning scheduling and acting
00:43:50
and sensing as well so when you start augmenting these systems with these additional capabilities is and you start
00:43:57
training agents that are able to plan and schedule and act on in the real world then you get that sense of a gii
00:44:04
that closer to the definition you gave it right now the we the ability to do
00:44:11
that at that level at that scale gets challenged by where are you sensing what kind of information and also where are
00:44:19
you acting in which kind of world environments right and if you want to look at the real world in which we
00:44:25
operate and you want to look at all the types of interactions and actions that can happen it the the number of
00:44:31
possibilities is larger than the number of atoms in the universe right and so how would you have a generally
00:44:38
intelligent system that knows how to act at this entire world I find that quite a challenging thing to believe but if you
00:44:44
constrain the problem if you make the problem simpler as simple as I want to
00:44:49
have a generally intelligent system that would learn how to use all the hospital equipments within the hospital hospital
00:44:56
system then maybe you have the opportunity to have an AGI system that
00:45:02
can essentially take in the task and execute that effectively so that is my
00:45:08
um uh techno Optimist view of the possibilities of AGI by training agents
00:45:14
that have World represent representations but these are simpler worlds representations that are
00:45:19
constrained by the problem space in which you want these systems to operate right and then being able to plan
00:45:25
schedule sense and act including the other type of capabilities that they can do okay interesting so danu doesn't
00:45:33
think that we're going to have this one omniscient AGI artificial general intelligence that's going to be handling
00:45:40
everything but he rather thinks that we're going to have these smaller more specialized AIS that kind of handle
00:45:46
different tasks and help us do stuff a lot faster this is actually not that different from that whole conversation around the Tesla bot right like where
00:45:53
you can have a robot that's like a human that does human tests or you can have a bunch of really small robots that handle
00:45:59
the test that we already do on a daily basis kind of the same thing pretty interesting um in the next segment we're
00:46:05
going to get into the problem of AI hallucinating which is where it just makes up a ton of random stuff and uh
00:46:13
that's clearly a problem I was very curious about that so that'll be a fun conversation plus we need to see how
00:46:19
fast danu can type so uh get ready for
00:46:24
that [Music]
00:46:35
uh Lis wanted to hop in and ask a question real quick yeah sorry I really liked what you said about um defining
00:46:43
intelligence as it pertains to AGI and I thought David brought up a really
00:46:49
important kind of intelligence like intuition and deduction and the ability to uh EXT ex ract not just pieces of
00:46:57
information but threads and systems of information from multiple kinds of contexts but there's lots of other kinds
00:47:05
of intelligence that people like cognitive scientists like to Define and classify things like spatial reasoning
00:47:12
um things like uh engaging in dialectical thinking um and these are
00:47:17
all intelligences that we've observed in ourselves and so when we think about
00:47:23
sort of a general purpose Swiss Army knife AI do you think that we should be
00:47:29
limiting that to the kinds of tasks that our brains do on a daily basis or do you
00:47:35
see that there's going to be almost like new methods of thinking and new cognitive strengths that emerge as uh
00:47:43
these neural networks get stronger that's that's a super interesting question I for in a practical sense um
00:47:53
I'm actually with you David on that definition right because I think that that's the form of intelligence that
00:48:00
could be mechanistically or mechanically implemented in a piece of software as in
00:48:07
program right so by our own intuition we can think about doing things like that by breaking that down into steps that
00:48:14
kind of um intelligence that you're talking about to me it's a bit more like that emergence right like that emerging
00:48:22
ability that and I don't think we've gotten to the point where we can perceive what those are yeah or
00:48:29
intuitively like a new color yeah intuitively know what exactly we need to do in order for the models to have that
00:48:38
spatial awareness or that other kind of ability now we can program that by
00:48:43
having segmentation models and having distance calculations and then coming up with some mathematical theistic through
00:48:49
which we can claim that we've achieved that capability but I would argue that
00:48:54
the way we learn us is not exactly the way we teach the machines how to do that right so there
00:49:02
there's definitely a lot of a lot more research and we may stumble upon you know we may basically strike luck and
00:49:08
then find out that other kinds of scaling mechanisms or the way it works in physics today is that you have
00:49:14
smaller systems you have a simple interaction mode like magnetization or just Collision analysis or the different
00:49:21
forces that we're working with about four of them and based on these simple interaction mode you get the entire
00:49:27
universe the way we know it at least it's the current theory yeah fundamentals of physics yeah at least it's it's the way we we know but there
00:49:34
may be another way right we may have just sensed it in our own apparatus of sensing in that kind of form and we're
00:49:40
able to explain it the way we explain it but it's still a projection on on the on the screen that we're looking at and
00:49:45
doing our analysis so I'm I'm super excited about the possibility of us
00:49:50
finding out more cognitive routes so to say in the way these systems learn and
00:49:57
right now the best tools that we have essentially in our laboratory of AI are
00:50:02
these deep learning tools the Transformers and there are many other architectures that are being built
00:50:07
around that things like uh memory orware neuron Network architectures or things
00:50:13
like the abilities to pull from a vector store and augment the knowledge uh with
00:50:18
the retrieval augmented generation capability so I feel like the more we add interaction mode and information
00:50:26
uh uh retrieval and use um utilization capabilities Within These models the
00:50:32
more possibilities we have to have this additional emerging capability that is a
00:50:37
lot more cognitive than the mechanic way we've been doing things so I think that's an open question I think it's a
00:50:43
beautiful question and I hope we get lucky In Our Lifetime to find a way to get that done yeah me too we've stumbled
00:50:49
upon a lot of random stuff in science so there's definitely a possibility that we have that happen which would be big
00:50:56
I think it's Richard fan that say that science is the belief in the ignorance of the expert so I I think that if we if
00:51:03
we really take it as a as a basic principle that we could stumble upon some things and then we we believe that
00:51:09
whatever we know so far is may may or may not be way then we have an opportunity to really in incorporate new
00:51:16
information and all knowledge that can get us faster and further yeah I want to
00:51:21
pull this back uh a little bit back to some practical stuff again um a big
00:51:27
philosophical yeah no I love the philosophical conversation I love the philosophical um I think that one thing
00:51:32
that people think about when they think about generative AI uh is the problem of
00:51:38
hallucinations and uh for people that don't know hallucinating is basically when you generate something that just
00:51:45
isn't right or isn't true um in large language models you can ask a question and it will confidently lie to you
00:51:52
sometimes and how do you look at how we're we're going to solve that problem because it seems like part of generative
00:52:00
Ai and part of large language models in general is that it's just it's parting
00:52:05
information based on probabilities and those probabilities are not always going to be correct so
00:52:12
you're I'm assuming working on ways to make these AIS more accurate um accuracy
00:52:17
is obviously going to be a major a major problem and something that we need to solve over the next couple of years how
00:52:24
do you look at solving the hallucination problem the the problem of hallucination
00:52:29
so the way these models work now is that you give it a lot of data and the question you're really asking of it is
00:52:35
um give me let's simplify a token to a word and working within the text domain
00:52:40
give me the next word based on these word that I gave you right so if you if
00:52:46
the if you qualify the problem as write a novel for me or write a paragraph or
00:52:52
write a summary of something then traditionally what would happen is that you would give it the beginning of uh uh
00:53:01
a sentence and you would say complete this sentence for me and so it's that sentence completion so to say that is
00:53:07
based on probability and even basing that on probability in within the context of this conversation is the
00:53:13
simplification so there's there's a lot more going on but the basic principles of how it works is that it would let's
00:53:20
assume that it works off of the most probable word to follow that word that existed
00:53:25
and then taking that longer sentence as an input figuring out what is the most probable word that could follow and so
00:53:32
on and so forth what that means if you simplify the problem just at that level and if I say give me a complete the
00:53:39
sentence doctor something works at John Hopkins or something like that then it
00:53:45
would just put a name there right right so the the the question you haven't
00:53:51
asked is make sure that that name is an existing human being that is really a
00:53:58
doctor at John Hopkins and whatnot right so fundamentally it's a different question
00:54:05
to ask off of that system and then we're back on the reason I call these things a system in the beginning is because yes
00:54:11
you may have a model that gives you the next word prediction or the next token prediction but then you still need to do
00:54:17
a lot more work on top of that input and that output and even at processing sometimes to make sure that the output
00:54:23
and the response that you get out of it is a truthful one or real one right or a
00:54:29
less toxic one if the answer is toxic and you don't want to serve toxicity to your users so there are many
00:54:35
pre-processing and post-processing activities that need to happen one to uh
00:54:41
uh make sure that the context I mean the the answer of the model is grounded we call that concept grounding grounded in
00:54:47
reality MH and the second is to make sure that the cont the output of that model um goes by your certain set of
00:54:55
responsible AI principles right so those are two things but fundamentally the way the science works is that it would give
00:55:01
you something whether that thing is true or not sure it's it's about it's your job to make sure that that thing becomes
00:55:08
true and so the way that happens then now is that you need to associate that
00:55:14
response to basically a source of truth right what is truth yeah yeah what what
00:55:20
is truth what is reality and that's that's another thing where you another another reason why you probably want to
00:55:27
contain and contextualize that use case sort to say down to a source of truth
00:55:32
right uh give me Dr blah that works at John Hopkins then you need to probably have a database of all the doctors that
00:55:38
work in that hospital and make sure that after you get the name of a doctor because the model will give you that you
00:55:44
check that against that database and if that person doesn't exist or you can say fill this specific spot off of the names
00:55:51
the list of names and the database and constraint and constraint so so that's why Bard now has that Google button
00:55:57
where you can ask a question and then you can double check it that's another context that's another mechanism for for
00:56:05
that but that's not exactly why it has that it has that button um the just to
00:56:11
just to land on the concept on the concept of hallucination so it was named hallucination because it could give you
00:56:17
some some answers that seem real but they're not necessarily real but this is her normal functioning mode of these
00:56:24
Technologies um the reason it took us a while to release Bart for example was not because
00:56:29
well we invented the Transformers so we've known how to do this thing for a long time yeah but it's it's all of the
00:56:36
additional Tech set of technologies that we had to build and principles that we had to really build around the behaviors
00:56:41
of a model that really get us to you know one the requirements of build
00:56:47
addition building additional Technologies and then two the challenge around making these Technologies
00:56:52
deterministic in the sense that you always want a specific answer so you have to do a lot more evaluations you
00:56:59
have to do a lot more checks and balances you have to add the number of metrics like is this model answering a
00:57:05
question when it doesn't know the answer you probably want to codify that into something that gets checked yeah and so
00:57:10
on and so forth so there's been a lot of work that we've done on one uh really having clear and concise responsible AIS
00:57:18
um principles and then two turning those into Technologies and or checking mechanisms that could work in
00:57:24
conjunction with the creation the operation uh and the operation of a model and then three U making sure that
00:57:32
these cores and and outputs of checks are available so that that technology could be used on the cloud ecosystem for
00:57:37
example as part of platform so we work with research to understand what are these responsible AI principles that
00:57:43
could be turned into metrics and guard rails and so on those get turned into
00:57:48
product capabilities that work alongside our models and then these models are exposed or are commercialized so say on
00:57:55
our Cloud platform this product called vertex Ai and you can go find it out on on on you know cloud. Google right so
00:58:02
that's that's how we we're essentially fighting the problem of uh of hallucination there's a lot more work
00:58:08
going on in that space mhm okay well I think I'm going to close it out here soon but I um I want to end with asking
00:58:14
if you think that there's anything that we missed anything that people would
00:58:20
gain a lot from hearing about that they just are not hearing in popular media that's very important to the whole AI
00:58:27
story uh two things maybe one is the consumer applications of AI so bar chat
00:58:35
GPT are very popular now so which is something that I'm I'm happy about because I think that it's really
00:58:41
bringing the conversation closer and closer to everyone and you and I have been working tech for a while so we may
00:58:47
have been aware of that coming up and coming together but I think it's a massive opportunity that today um people
00:58:54
that are um news editors or writers or artists or folks that work in different
00:59:01
domains uh can use some of these things to help them write better to help them
00:59:07
um generate images that they can use as part of a Content that they produce and create to write better letters to write
00:59:13
better to do homework and and so on and so forth so I really love the the consumer application but one of the
00:59:19
things that I don't think get talked about a lot is the developer experience um um and also the way the barrier of
00:59:27
entry from creativity and product generation product creation standpoint
00:59:34
it's getting really really lower with these set of Technologies and so I I I
00:59:41
really think that we are at the cusp of a new form of economy where
00:59:48
creation uh of valuable items of
00:59:53
different kinds of forms would not just be a matter of a few being able to do
01:00:00
that because they have a highly a high training and they've they spent years doing I don't know an undergrad in
01:00:06
computer science and so on and so forth but if you bring that level of assistive
01:00:11
creativity abilities to the masses so to say yeah I found that people have ideas
01:00:19
right like people are creative if you sit down and you tell someone let me take away the problem of knowing how to
01:00:25
implement these ideas as talk about your ideas you get many ideas to start emerging so I think that we're really at
01:00:33
the border of a transformation where the economy may take a different form if
01:00:38
different people without the need to really understand in details how to implement some of these ideas are able
01:00:45
to one iterate on the ideas with the assistance of generative AI to validate
01:00:51
some of these ideas with our ability to prototype those in a matter of hours rather than years and then three
01:00:58
test these ideas in the ecosystem and maybe find value for different people
01:01:03
that they could commercialize these ideas for so I'm very optimistic about the possibilities of this in the future
01:01:09
all right well the last thing we're going to do we have a little game here that we play when we bring guests on
01:01:14
where we figure out how quickly they can type the alphabet it's a running scoreboard um
01:01:22
you can use either the MacBook keyboard you can useo thing yeah it's a keyboard
01:01:28
test get yes okay so you'll take that so you get I'm going to ask the AI to TP
01:01:34
type this thing for me so you get three chances um wait what
01:01:41
is the most optimized way of typing the whole alphabet if you as soon as you start typing it starts uh so as soon as
01:01:49
you type the letter A it'll start and does he have to hit enter at the end no you don't have to hit enter as soon as you hit Z it'll finish got it and and uh
01:01:57
no typos is allow so if you miss a letter like let's say you Miss B and go
01:02:04
on to C it will not count you have to hit every single letter and you'll see at the top where it say type A okay um
01:02:13
that will tell you the letter you're supposed to do we give people tests at all or is it just three chances there's three total chances okay
01:02:23
ready you got to hit
01:02:29
G oh it's harder than it looks definitely
01:02:34
a lot harder than it looks that's okay you got to hit J this is why you get three chances don't worry about it I was extremely
01:02:42
[Music]
01:02:47
slow okay so first run 26 seconds now
01:02:52
just hit reset can I change a keyboard yeah you can change keyboard all right so now I
01:02:59
understand why there are options yeah so we have we have mechanical keyboard we also have the butterfly keyboard that um
01:03:06
Apple sells let me know I'm going to get a ke all right so I'll go mechanical you a mechanical let's do it set us up
01:03:12
however you want round two go for it
01:03:23
fight nice okay 26 to9 yeah 26 to 9.8 much
01:03:30
better that's a big come up much better last try last try but you guys aren't impressed that means that I'm not I'm
01:03:37
not that is not bad I was not far I was not far in front nine is is actually really good especially for a second ATT
01:03:43
we've seen some things in here that you would not believe I'll show you the scoreboard
01:03:48
after this and you'll be okay ready ready go
01:03:59
[Music] nice 8.73 not bad honestly not bad okay
01:04:06
where where is that on the leaderboard David so here's the leaderboard fastest Tom Scott 3.5
01:04:13
seconds insane that was crazy to watch it was just um wow so let's see
01:04:21
8.73 is right above Brandon wow wow all right actually no faster than David
01:04:26
Blaine too you beat David bla you beat David blae he might be a magician but you're a
01:04:33
magician on the keyboard wow uh oh 8.7
01:04:39
73 right you also beat Hassan Minaj hey
01:04:46
Hassan so you beat Hanan Minaj David bla and Brandon nice cool all right well
01:04:52
thank you again thanks for having thank you for coming um where can people find you on the
01:04:58
internet uh well I'm Dean banga on X now
01:05:03
okay nowadays and I'm LinkedIn on LinkedIn as well so as Dan banga essentially awesome we'll link that in
01:05:09
description and uh do you want to shout out your any projects that you're finishing up right now or working on right now that Google can people can see
01:05:16
it Google so the the vertex AI platform is really the platform that I'm working
01:05:22
on right so that that's what we we put out Solutions on and uh I say I would
01:05:28
say that look forward to many other more
01:05:33
industri SL domain adapted uh uh capabilities around llms because I think
01:05:39
that large models are a big thing and I think it requires a lot of additional Technologies to actually make it work in
01:05:45
in applications and I think that this is the about the time where we need to come up with things like design patterns
01:05:51
right so if you think about a gang of four for example it's a book that was needed when programming needed some kind
01:05:57
of structure so I think we are at a place in time now where we need some kind of structure and how we build and
01:06:03
deploy large application large models in in in Enterprise environments and that's something that I'm working on awesome
01:06:09
sweet well everyone uh watching listening at home if you were surprised that we had an episode today don't worry
01:06:14
we have normal episode coming on Friday this was just a little extra story for you so uh hope you enjoyed it and we'll
01:06:20
see you on Friday Cheers peace [Music]
01:06:39
n

Episode Highlights

  • Understanding AI
    David and Danu dive deep into what AI really is and how it works.
    “AI is a collection of tools and techniques.”
    @ 04m 19s
    October 24, 2023
  • The Rise of Deep Learning
    Danu explains why deep learning became the go-to method for machine learning.
    “Deep learning became popular because it was a very good way to do machine learning.”
    @ 10m 15s
    October 24, 2023
  • Emerging Abilities in AI
    Danu discusses the unexpected capabilities of AI models, showcasing their advanced reasoning skills.
    “Emerging abilities are things we didn't expect.”
    @ 15m 39s
    October 24, 2023
  • The Evolution of AI
    AI has evolved significantly since 2017, with emerging properties that surprise us.
    “The Transformers started the revolution, so to say.”
    @ 23m 35s
    October 24, 2023
  • AI's Impact on Industries
    AI is uplifting various industries, enabling new opportunities for smaller businesses.
    “Many industries have really taken advantage of generative AI systems.”
    @ 26m 50s
    October 24, 2023
  • Generative AI Explained
    Generative AI focuses on creating artifacts like images and text, using deep learning techniques.
    “Generative AI is really focused on generating or creating a specific artifact.”
    @ 29m 21s
    October 24, 2023
  • The Challenge of AGI
    Creating a generally intelligent system is more complex than we can imagine.
    “The number of possibilities is larger than the number of atoms in the universe.”
    @ 44m 31s
    October 24, 2023
  • AI Hallucinations Explained
    Understanding the phenomenon where AI generates incorrect information confidently.
    “Hallucination is when AI generates something that just isn't right or true.”
    @ 51m 32s
    October 24, 2023
  • The Future of Creativity
    Generative AI is lowering barriers for creativity and product creation.
    “We're really at the border of a transformation where the economy may take a different form.”
    @ 01h 00m 33s
    October 24, 2023

Episode Quotes

  • AI is bringing human intelligence analog to computers.
    How Does AI Actually Work?
  • Emerging abilities are things we didn't expect.
    How Does AI Actually Work?
  • Wow, I use Bard every day!
    How Does AI Actually Work?
  • Generative AI is really focused on generating or creating a specific artifact.
    How Does AI Actually Work?
  • The number of possibilities is larger than the number of atoms in the universe.
    How Does AI Actually Work?
  • Science is the belief in the ignorance of the expert.
    How Does AI Actually Work?

Key Moments

  • AI Explained00:12
  • Generative AI00:55
  • Danu Banga01:14
  • AI as a System04:19
  • Deep Learning06:12
  • Emerging Properties20:31
  • Productivity Boost28:28
  • AGI Complexity44:31

Words per Minute Over Time

Vibes Breakdown

Related Episodes

WWDC Predictions and the New Motorola Razr Plus
June 02, 2023
Captions not detected. You can watch the video, but not search it. If you think this is an error, contact support.
01:32:31
WWDC Predictions and the New Motorola Razr Plus
Hey ChatGPT, Summarize Google I/O
May 17, 2024
Captions not detected. You can watch the video, but not search it. If you think this is an error, contact support.
01:53:19
Hey ChatGPT, Summarize Google I/O
Google I/O: Pixel Fold and Google AI
May 12, 2023
Captions not detected. You can watch the video, but not search it. If you think this is an error, contact support.
01:24:38
Google I/O: Pixel Fold and Google AI
Pixels, Tri-Folds, and Brown iPhones!
August 23, 2024
Captions not detected. You can watch the video, but not search it. If you think this is an error, contact support.
01:28:18
Pixels, Tri-Folds, and Brown iPhones!
Are AI Browsers the Future with Josh Miller
June 02, 2025
Captions not detected. You can watch the video, but not search it. If you think this is an error, contact support.
58:56
Are AI Browsers the Future with Josh Miller
Don’t Ask Google's AI for Advice!
May 31, 2024
Captions not detected. You can watch the video, but not search it. If you think this is an error, contact support.
01:39:24
Don’t Ask Google's AI for Advice!
Tesla's We, Robot Event: Let’s Not Get Nuanced
October 18, 2024
Captions not detected. You can watch the video, but not search it. If you think this is an error, contact support.
01:39:27
Tesla's We, Robot Event: Let’s Not Get Nuanced
The Google I/O Episode!
May 23, 2025
Captions not detected. You can watch the video, but not search it. If you think this is an error, contact support.
01:29:29
The Google I/O Episode!
ChatGPT: Should Creators be Worried?
December 09, 2022
Captions not detected. You can watch the video, but not search it. If you think this is an error, contact support.
01:16:00
ChatGPT: Should Creators be Worried?
Google Leaked Its Entire Pixel Lineup
March 17, 2023
Captions not detected. You can watch the video, but not search it. If you think this is an error, contact support.
01:04:58
Google Leaked Its Entire Pixel Lineup
Waveform Podcast Live at SXSW 2025!
March 11, 2025
Captions not detected. You can watch the video, but not search it. If you think this is an error, contact support.
01:03:06
Waveform Podcast Live at SXSW 2025!
Asking Doctor Mike About Health in Technology
June 23, 2023
Captions not detected. You can watch the video, but not search it. If you think this is an error, contact support.
01:23:05
Asking Doctor Mike About Health in Technology