In this episode, Cory Minton, Brett Roberts & Kyle Prins chat machine learning and deep learning with Keith Manthey. Keith is the Global CTO, Unstructured Data Solutions at Dell EMC. We’ll about the differences in machine learning and deep learning, architectural tenets of deep learning. and different ML and DL libraries. We also chat about some great customer use cases for these new technologies, such as building movie trailers using artificial intelligence.

 

Transcript
Cory:
[0:06] Hi folks this is Corey mountain with the bigdatabeard along for another exciting ride and journey.
Through exciting happenings in the big deal that you go system I’ve joined today by a couple of bearded gentleman Kyle and Brett,
and we are excited because you know there’s a lot of Hot Topics in Big Data it’s there’s a lot of past City Topix but the one that is in Vogue.
Right now probably more than ever,
is this concept of deep learning and we’re seeing deep learning machine learning an AI confusion everywhere and we wanted to.
Spend a little bit of time with somebody who can help us unpack and understand you know really from that architect in that data engineer perspective.
What is all this stuff mean and what are the architectural tenants of deep learning and help us understand how,
we can help our Enterprises achieve their objectives with deep learning so I’m very excited to have joining us today mr. Keith manthey the CTO for the unstructured division.
Adele EMC Keith welcome.
Keith:
[1:08] Thank you.
Cory:
[1:11] Keith I wanted to talk with you a little bit about again deep learning machine learning super interesting and we read your blog post that was really the basis for hot of Ark of the beginning of our conversations.
One thing I like about this is is you start out kind of unpacking this thing for me but before we unpack it what I don’t understand is why is machine learning a deep learning why do you think it’s so hot right now.
Keith:
[1:36] You know I think the best way to describe it is you know we’ve been fascinated as a culture with a I sense.
You know and if you go back you look at it we still hold the Turing test something that was invented in 1943 if I remember correctly.
As you’re one of the Pinnacles if you’ve succeeded so here are Benchmark is you know seven years old and where were starting to achieve a level of you no capabilities.
That can ultimately cheap and so you know what’s interesting was machine learning went through all the vogan and certainly.
Had a lot of of grape press.
But it also had a very very dark side and you know what I say that and then not sort of in a negative way but if you look at some of the early duration to machine learning.
It was a whole Fascination sometimes around detecting faces detecting pictures detecting different thing.
And one of those the real your benefits was if you had something that it could fully construct like the picture.
Or the picture of a child it was really good at sort of assessing and it in your detecting age in detecting motion.
But if you had something and couldn’t account for so maybe a wounded warrior with a traumatic brain injury that wasn’t able to handle you know nuances.
It’s sort of went through a phase where in a machine-learning sort of took a dark. Because it couldn’t account for the bad data the bad day to being something not perfect in a picture.
[3:12] And now you know the reason why defunding is suddenly all the folk is there starting to realize that you can create a model where you can actually allow it to Discount Parts.
It can also create a model where it doesn’t need everything great example is one of the pictures I use in talking a lot is really picture of two legs of a chair.
Deep learning has figured out that that’s a chair it doesn’t where you sit it doesn’t meet the back it doesn’t eat all four legs it literally just has two legs of the chair.
And as we talk to businesses we talk to you no different consumer groups out their customers in you know and all the things the number one thing businesses tell me if they have a lot of good data.
And so if there’s not a way for them to you no have no capabilities to Discount the bad data the results are going to get there.
So that’s sort of you know what I mean she thinks it would have caught up and you know when the things that we’ve been asking about for years are now if you want to look at it that way.
Cory:
[4:16] Yeah it feels like we came out of this I think I’ve heard the term a number of times like a machine learning winter or 1/8 High winter where there was these these there was a speak and kind of interest is,
digitization of information sort of started and then like you said there were some struggles and stumbles along the way,
combined with I think science fiction we saw a whole bunch of really bad examples of what I could turn into think you Skynet in Terminator you jerks,
and then I saw this thing which was Hey and I actually saw, by Jeff Dean from my Google the other day it’s like the ubiquity of.
Doug data now like the amount of digitized data its restitch reach that critical mass.
That was important form for machine learning that defunded take off and the ubiquity of computing capabilities and the Frameworks that could handle.
Large amounts of data and develop models that use as you said work we’re good it’s all kind of come to a head so it feels like we’re in kind of this this air this deep learning spring that’s ahead of us.
[5:23] I guess one thing that we talked about kind of lump them altogether I’m curious your opinion instead of the differences between.
If you’re if you’re a data engineer kind of an architect like how would you understand the difference between what’s machine learning versus deep learning.
Keith:
[5:39] Sure so if if I’m going to you know break it down.
You know just I’ll do the full categorization cuz I get off and ask them like you said if we were to sort of Define your different categories for what they are I like to call a i c.
It is so if I’m going to I love your your reference because you know really a balls back to forward thinking but never something we can feel itchy machine learning.
Was you know so if I went in and did you know any kind of Hadoop model with my out whether it’s parka ml live or any other technology and I wanted to do some form of bucket.
I wanted to potentially take away a pile of data and a classified into a few different strain and then try to then assign maybe some values to that using.
Either we are truly on supervised type of technique.
Maybe they were supervised but yet you know the focus there was the engineer did all the feature selection manual.
The library is really were working what pain just would have a framework where Magics would have happened and I don’t say that and sort of that but you.
There wasn’t a whole lot of dials or hyperparameters or ways that you could go in and tunic you could choose Ensemble models you could choose a different way to do.
[7:12] Different types of models but there wasn’t a lot of granularity that you could dial into what you were.
You couldn’t discount bad data unless you throw it out ahead of time you couldn’t discount you know some of the values and so whatever came out of it and some of the early duration.
You likely if you did any kind of any machine learning up until probably before 2 years ago it was mushy and learn it was you know no matter what it was.
[7:41] Where were starting to see in the Deep learning room usually you will start to see a construct there’s lots of them.
There’s lots of names that go with them.
Additionally they will have some form of recursive capabilities it could be called a convolutional neural network if he called recursive neural network it be called recurrent.
I seen a lot of those you could have you know different types of capabilities but really what it it’s trying to do is it’s trying to emulate sort of how the brain works.
So you’ll see a series of passive and different layers the idea is it’s instead of trying to go through and do it all in one large massive pass to try and categorize.
If you want to go back to the infamous it is said I think everybody who’s ever done machine learning course is going to play with the the flower take data set.
And so you know if you were to do that deep learning it now starts to take over some of the hyperparameters you can start to actually allow it to do feature selection what I mean by that is.
It will have the ability to Discount data that maybe doesn’t contribute to the end goal.
And so you know what will there’s a lot of libraries out there some of them are in the defense Park Spectrum like Intel’s Big D L you have.
Departing for Jay and some of the other yes park with tensorflow are sort of in the you’ll see it in the head dupe and he can spark landscape.
[9:15] You also are starting to see a real big influx where HPC high performance Computing are back in Vogue and you’ll see Libras like tensorflow.
Tiano torch by torch tool kit.
Cafe too and so those are really focused on the high performance with you today you have a turd of passes within a high performer.
And so that’s really been through the evolution of last 2 years.
And almost every machine learning book that I’ve actually read or looked at lately really starts to include all of the deep learning even if they say machine learning them they’re starting to cool out of the deep learning embedded into it.
I know that’s a long way to answer but in a really what you’ll see is do anything where you see the term neural-net you’re probably doing people are.
Cory:
[10:04] Okay that makes sense just machine learning right is applying old math in new ways deep learning is now how do you automate and use.
Maybe old math maybe new math maybe new programming capabilities to allow the model to improve itself.
Riding to go back over the date of more than once not so much more than one batch but are Serta make a sipper passes on some bleeding criteria and and and models that’s really cool I get that no one thing that I.
I’m generally I think I think is that accurate maybe you can tell me if I’m wrong but whether it’s deep learning or machine learning or frankly you could even argue some of the units or the Big Data stuff there seems to be some architectural tenants,
that you know some macros that emerged across all those and maybe even in the HBC space and in your blog you talk about sort of these these four pillars,
of architectural tenants and I want to unpack a Lil Bit of those so,
so tell us what are those critical kind of Architectural Components that you see and are they.
Are they kind of generally true of both mldl and Big Data.
Keith:
[11:13] Sort of and you know what what gets interesting is a lot of it has to do with how the libraries have built.
You know and some of it has to do with the architecture in ecosystem that lives below it.
[11:30] Compute you’re being probably the most,
your fundamental one that everybody thinks about and your computer I say that specifically because you have a standard CPU with eyes and Coors.
You also now have the rise of video.
GPS Otherwise Known ask where you now have the ability to use video cards.
With you know a very limited capability and I’m not knocking them there their they’re not really great at doing a lot.
But they can do a lot with a little and just to give you an idea of what a magnitude which is why we’re starting to see GPS take off so if I have a top-of-the-line.
2 cor 30 North to die 36 core machine out of.
That will give me 72.
[12:31] If I take two GPU standard video card.
Cory:
[12:39] You can’t you can’t get you can’t get the Cinnabons anymore buddy she’s going for Bitcoin miners you got to buy the high-end ones.
Keith:
[12:46] Very true and so from that standpoint to gpus.
Will deliver you and the master everybody’s got two different math but it’s very easy to go in and back into it about 344,000 concurrent processing.
Out of two GPU cars and so you know what you’re seeing is.
Really went when you broke down and sort of some of the libraries that you’ll see out there they’re still very very Taylor to compute which is there’s nothing wrong with that it’s just a different model and a different approach.
A lot of what we’re seeing on the HPC side.
And I expect at some point you will start to see it sort of bleed back into the Dukan sparklier lot more just because the sheer power ratio of a footprint of two gpus that can deliver 344,000 connections is just unheard of.
But you know what you’re able to then do is take a standard model.
And break it into really fine component.
And so in a really that sort of the difference of what we’re seeing now with you know sort of those you look at it from an architecture for.
You know what I’m going to throw I do work load out there I’m going to throw machine learning or deep learning job on it I know the resources in the classroom I know the storage in the cluster I know the fan with him.
And so that sort of a fixed capacity and you know it it’s a little more in the Box nothing I do with it.
[14:22] What were saying with some of the HPC side is it actually can be dialed up and dial down and well that sounds weird,
you know as you go to build some of these models you can intentionally amplify the effect of what I mean by that is if I want to put in a series of texts.
There are ways that I can build a model to actually make the output size 10 x.
So I truly will allow me find a granularity same thing can be done with a picture so I can actually blow it up or I can shrink it so there’s ways I can actually say subsampling is the official time.
And the way you do it is to padding and so you know what we’re seeing is you have the ability with the computer.
And the multiple tiers now with this massive ability with very small cost me to Jeep used in the grand scheme of things is not a lot of money I mean especially the Bitcoin miner.
They’re not a lot of money and so what we’re starting to see is you know you could have a rack.
You know very very new servers with each of them having to or for NVIDIA cards item.
And you can do a whole lot of damage.
Cory:
[15:40] Network is key the other one is you can do a whole lot of damage to your power supplies.
[15:49] And does gpus are power suckers actually one thing that’s interesting I’ve seen a lot of work at the team from Google brain.
Outlined a bunch of there out there out there pick another year in review 2017 one thing I know is they talked a lot about the tensorflow of the TPU Evolution and kind of the tensor flow cloud.
What are your thoughts on those purpose-built Asics things like you know purpose of a 6ft GA in context of machine learning and deep-learning.
Keith:
[16:18] So let me be very very you know specific it’s you know we have today everybody refers to GPU which is a single product from a single manufacturer.
[16:32] Make no mistake that that is not lost on every applicator out there so your point now Google has a TPU.
I cannot say what I know but I’m aware that the competition are coming out with their own so you know yes we call it Kleenex.
Hey you know Kleenex even if we meet at issue.
So you know how much lead over we have between there will be a series of specific you know a 6in and graphic accelerators and different types of cards coming out.
All of them solving the same purpose at some point I think we’ll probably need to come up with some consistency and commoditization across them.
As far as language is across New York tool kits and you know whether it’s Centerfolds in the others there will have to be a reckoning.
But I think you know definitely in the space there will be a lot of competition who are the specialist yet because.
Besides Bitcoin Mike we are starting to see a lot of interest and know that.
Massively parallel and you actually the proper term is embarrassing though which is actually the second North text to tenant which is you know if I dial up with padding Rye down.
Whatever I do with a picture I love examples because.
So let’s pick out of the.
Automated Dragon that’s one of these cases I think I get a lot of questions about and for the record I do not believe in autonomous driving cuz I think that will stay in court for the probably the next decade and really from the whole idea of.
[18:11] It’ll probably take the court to figure out what happens between whether I hit that brick wall or hit a person because you know.
Cory:
[18:20] Yeah we talked about that day the ethical questions around automatic autonomous-driving is an interesting one and it’s one of those that you almost wonder.
Does the underlined ethical decisions that have been made in the models does it have to be something that is clearly exposed to the buyer because that example you just said is one I’ve used,
with who had a conversation with CTO hortonworks and said the question like shouldn’t I know if whichever brand I choose.
Is either going to kill the dump smoked behind-the-wheel or Key Largo.
Or is going to allow me to living take out the pedestrian like shouldn’t I as a consumer know which decisions were made or is that something that the government should be mandated for me.
Keith:
[19:03] That’s a great question that is a fabulous question.
Brett:
[19:06] Yeah yeah yeah I’m shocked it already we haven’t even had a you know a self-driving car out there and production so to speak and they’re already talking about taking the steering wheel off in like.
[19:16] What 20-20 or some like that it’s pretty close to being it has pretty kinds of it will be good enough to have no steering wheel no accelerator no break.
[19:24] 3 years or two years down the road at this point so Keith I agree with you I think that this is a it’s a little bit further down the road then what everyone stay with the self-driving cars but I do have one question for you.
[19:37] We we talked about how we’re now in this deep learning spring and winter has come and gone and there were some constraints compute data that cause the.
[19:47] But I guess the peaks in Winter right in the past what do you think the the next constraint will be too kind of.
[19:53] You know progress or keep us from going or no evolving even more with you learning is it a.
[19:59] Power problem is a cooling problem with G fuse what what do what do you think that is.
Keith:
[20:05] I honestly think all of those are issues you know the bigger issue to me and so you know which is actually the reason why I wrote The Blog.
It’s where I spend most of my time is when you look at I want to build a model.
There’s probably you know probably a hundred really Great Courses about how to do a test from.
I want if I want to look at how do I you know code Cuda.
There’s no real great dialogue going on in the industry today.
About how do you look at your network.
What types of architecture do you need to build to handle this what are the storage requirements to be able to feed a model.
How do you know when you know you have a bottleneck and what is your Jeep you doing.
What are your wet where is the slow wall clock and we have all those tools in a lot of the existing structures in a lot of.
Work has been done in Prior generations to put those pieces together but today there’s not really that conversation.
And so you know back to really I think the biggest challenge is just we need to have an open dialogue about what are the impacts of different model,
how do you build a how do you account for those in your network how do you handle know the the scale of what your developer might do and if Sam and network Tech score storage person or infrastructure.
[21:45] And I don’t really understand the math behind what he’s trying to do.
It’s not hard to crush a networker crush to order question few we have the capabilities that are available today and say in Spanish.
So I think that’s to me you know the big challenges we need to have a really good dialogue about how do we scale this.
Cory:
[22:06] So so on that one question today so you said you can crush a networking Crush storage and I want to go back to the embarrassing parallel thing and suck a butt.
How how how are you seeing people assess that because that that to me feels like one of those big gaps if I’m a systems engineer,
or an arkitektura data engineer and I’m having a conversation with.
You know I CIA operations teams or even a cloud provider where I’m going to buy the stuff as a service what do you start to know like what.
[22:36] You need or how do you know that what system you maybe have already built whether to do based or something else.
Are do you know of ways that are that are good assessments of this is our problem this is where we should invest in terms of all we have a network problem we have a storage problem we have a computer.
Keith:
[22:56] Not today and that’s part of why I started the conversation and Lily where I’m I spend a lot of time actually helping folks to understand what they won’t even do.
[23:13] You know because it’s I think a lot of folks we’re still in the growing phase.
The bleeding Edge’s is started to adopt it and get a production that leading-edge you know so probably like 2 to 5%.
Are now starting to kick the tires and put it in life.
And so International point we don’t have a lot of the capabilities and in understanding out there and so that’s sort of where I’m pushing you know is we need to start that conversation.
Even your otherwise it’s going to be a quick migration in the fall because we’re headed home structures can be.
Yes because we just blew up his Data Center.
Or you know we need to have a way that this becomes Evergreen and continues to grow and to those you know that sort of the conversation we need to have we just what is the impact of running you know.
Convolutional neural network on Allied Arctic or something is coming out of an 8 oz you know what is the impact.
And that’s sort of the conversation that’s not being had you know and really I think what we need to do is there needs to be cooling but there needs to be a conversation about how does this really work cuz I think that even if you Google it you don’t get a really good answer.
Kyle:
[24:31] Hey Keith so add to sing you know it’s it’s huge on the fringes it’s it’s a it’s growing but what do you see being the tools that.
[24:40] Machine learning and deep-learning will standardize on so so what.
[24:45] Companies what applications do you see companies starting to use machine learning and deep-learning to achieve this mix success out of.
Keith:
[24:54] So it’s interesting is in almost every conversation I’ve had to date no one really wants to standard on a sink.
And the reason why is certain libraries within certain models have stronger string.
[25:11] Tensorflow is pretty much guaranteed to be in everybody.
It is the one that probably is the most referenced used tested driven.
We see there are some reasons why people would want to use Cafe.
It tends to have some interesting features around some of the computer vision problems or image recognition that people tend to like,
there’s some other libraries Harkins copia and Tiana 10 to go really really well when people are doing a lot of natural language processing.
So what I what I seen in almost every one of my conversations is the golden stand up in architecture that they could deploy any of those on the same architecture.
And have any of those would just be stools within an ecosystem that was live on a set of architecture be president.
[26:08] So far it’s proving to be adorable feet so you know.
I would say tensorflow is almost guaranteed to be there and everybody’s Arsenal and then you usually have a few more in there and then a name some of them there’s probably always there’s going to be like if you’re meeting entertainment there’s a couple specialized ones within the media and entertainment industry.
I expect we will start to see some you know bespoke ones come out because we’re starting to see a lot of people open source some of their toolkit so I think we’ll start to see some more tokens.
Mia from now there’s going to be a frost in a proliferation weather that collapses at some point it is.
Cory:
[26:45] So let me ask a dumb question what’s the difference between a library and a defining framework.
Keith:
[26:51] So a framework would be sort of how it work.
So the ability for me to spin up a tensorflow and run a model comes with its own Library.
So there are specific libraries within tensorflow that allow oil in if you use cameras which is another higher level API you can actually go in and.
Dial specific features.
And so you know within there you’ll have a lot of the same models there might be some specific permutations to those.
And some cases it actually makes reference to external libraries so you know great example is most of the deep learning is very very python Center.
They take advantage of some of the tools like numpy and pandas and others out there so you know you’ve got this Rich ecosystem so it while tensorflow would be the framework on which you run.
You might actually be referencing a particular elements within their,
Rama turn umpires or sklearn inside Pike or you know something straight out of intention flow itself so it’s,
mango together not in a bad way but yeah they’re they’re sort of high dependency.
Cory:
[28:12] So you went through Canada the three,
Baseline components are architectural tenants really good durable storage that’s embarrassing embarrassing the parallel high-bandwidth which we get you going to design that right compute its it might not be GPU vs. CPU as much as it is a mix of some depended upon the models in the libraries than you’ve got the Frameworks which kind of,
seem to be varied,
at the very least she said growing probably a little bit of frothing which I love that concept as you talked about those Frameworks and those bespoke tools coexisting and cohabitating.
On top of that underline architecture and what you need there is you need some sort of a management and orchestration set of tools and you listed out of handful that I think there are interesting and I want to I want to help understand like how important is that,
m&o stock in the Deep learning constructs.
Keith:
[29:12] Critical and from that standpoint it also drives even deeper sort of architectural.
[29:22] And what I mean by that is so I’d say the top,
you know a couple that we see as far as you know the management layer which really you know,
so tensorflow would just be a framework within their Tiana would be a framework and then how you schedule a job you know if you were working with an I do.
Kubernetes is certainly one that I would say is probably got the hottest flame right now as far as a management.
Capability misses is right behind that so me so severe and sort of that approach there’s blue data which is you know a.
Start up that hill we partner with corporate leak where are you can do your very similar types of interest is all three of those were lying a container.
[30:12] So here you are with a doctor or PCOS or some form of a container which of course now if you’re in a stateless container model.
Ability to bridge to state for storage or.
Overwhelm your containers with you know 10 million images of storage which is really going to blow out your Datacenter really makes it sort of an action challenge.
It’s now really that the number one thing we’re starting to see is containerize tensorflow at scale using one of those management.
Cory:
[30:45] So I guess the other one we’re saying to as you started to say that the the lines are blurring a bit between what we are more traditional Hadoop Centric.
[30:57] Orchestration tools something like spark cluster manager yarn as you said or even in the Big Data space.
[31:03] Then you’re getting into the sort of container Cloud organization and kubernetes me subscribe Docker and those worlds are colliding right cuz Jimmy need to look at even things like horn Works HDPE 300 integration with containers like all that stuff all that’s a convoy confluence,
now you look at then oh my gosh we also have to include the the HPC environment so things like bright cluster manager are.
I honestly like in my time I see like professionally I see them in as many of these conversations as any of the.
Big data plans are you seeing the same thing.
Keith:
[31:39] Absolutely and so you know what we’re seeing is HPC use of NFS storage war in deep learning.
Those are new protocols some ways for Back to the Future.
Because really you know sort of neural Nets had their previous High Point in the 90s.
[32:02] And then you know we sort of went all in a different way and then yell McCune and some of the others actually cracked the math problem.
That you’re really had limited neural networks back in the day and that work weeks one completely back to sort of the HBC contra.
But a lot of the players that have been in use for the HPC space.
Continue to move from your migrate along so yeah they they move right into the deep learning as well because again they were there an HBCU before you know tensorflow is even a dream and Mama’s I.
And so you know a lot of sort of those seen her traditional how am I going to manage 10000 core.
However Mini Storage however many Lunds managing you know under the computer for Scratch and spell space.
And ability to you know reboot and handle that amount of cluster requires some form of a command and control.
And bright and some of the others you know what I feel the command control space would really you know so for Brunetti’s hidden,
but you know,
you’ve got sort of this how am I going to come in to control you know serious or of 10 to be able to reboot it or no it’s down here what’s my alerting if I lose in there.
And so that sort of that you’ve got all of sort of the the scale problems that you know.
Short of solve in Hadoop one way or service Ogden HPC and other way that’s what it what we’re seeing is in those residuals are still caring for cuz they’re still need it and now you just problem or tools on top.
Cory:
[33:41] So if you were so if you’re sitting in front of a customer and let’s say they’re not one of these bleeding edge is already having production and there maybe they’re Leading Edge in there in the labs,
and somebody came and said Keith look we want your professional opinion what do you think the right way to build something.
To meet the what we were like you said right was going to be a growing shifting, juxtaposing set of requirements is going to be what are some of the best practices that you’re advising you know folks in industry.
Architecture what should they be looking at what should they be using as.
Very practical things to do today to build deep learning environments that will be useful both now and in the near-term.
Keith:
[34:27] Great question and it’s the same advice I gave when.
I do pussy so you know.
I’m in a number of conversations actively right now Murray’s that are looking to do plc’s the first thing I help them do a sake great let’s not try to boil the ocean.
And and you know one of the active conversations I have going on where I’ll be in in an in-depth conversation next week we’re literally going to go down and pick a few specific use cases.
Not only are we going to pick a few specific use cases we’re actually going to look at what would the architectural impact of those be,
and so in one of those cases you know an image recognition problem computer vision and no that is not teaching computer to.
[35:21] Like when I get but it is.
Text recognition picture recognition the ability to do you know that type of a scenario and sew in a back to sort of the math.
Take a 16 cake picture that has 132 million pixels about a gigabyte of storage.
Want to CNN so a convolution neural network that will fire 132 million thread.
[35:55] So one can your storage handle a hundred 32 million concurrent.
To do you have a 132 million concurrent processes that you’re capable to feel like it’ll handle it and sort of a waterfall as it fills up but you know.
Think through the ramifications so if we’re going to do a PSA let’s figure out what picture size with the POC do we have to be able to not crush the info.
And so you know it is sort of a crawl walk run approach which is you let’s figure out what it is you want to do let’s make sure the math behind the model will support.
But that way they can start to get comfortable with you let’s run one more time and understand what that does to your neck.
Understand what that does to restore to understand one of that does to computer where are the latency in the bottlenecks how can you tell if your gpus are busy or not.
How can you tell if your computer’s busy or not busy where is the wall clock weight and so what we’re doing is just a very very scientific very very from the basics approach,
we’re doing it with active real use cases that meet the customer.
So short of help building it up you know I’d say the biggest thing I seen that really causes a lot of drama customers are they want they run a PLC I think it’s really good then they deploy and say run all these 80 theme song.
Cory:
[37:18] So one thing I want to want to dig into is you working in an organization within Dell EMC that has some interesting products.
That they really feeling interesting space in one of those four key components are tenants architectural e.
That we talked about and that’s the concept of durable storage that can handle,
high-bandwidth embarrassing a parallel so help me understand this this product that you you and your team are out talking about the market and selling what is it and how does it fit in the context of this architecture.
Keith:
[37:55] So what’s interesting is when we look at the problem your point then we just came.
And all this problem not just in storage but in that work as well but then with the ability to deliver a certain current sea level.
Across you know a given form factor so how many gigabytes per second can I deliver in before you structure.
And then not only that how many concurrent connections to Any Given subsystem or file can I handle on any for you chassis and so those are the two things that I spend a lot of time playing with.
What are what I call a sharp edges how many can I unleash a million concurrent connections against one for you.
[38:44] You know some of that sparked math some of that sparked trial and error to figure out and so yes we we have been looking a lot at it mostly because we’ve had some bleeding edge customers.
Who was actually started using it who have shared sort of their Journey with us.
And you know not every journey is totally seamless but at the same time it’s definitely been an interesting eye-opener but what’s interesting is.
You know our products Heritage comes from,
sort of the you know what I’ll call American Apparel approach so you know if you like to do streaming sound or streaming media.
You can’t really have a whole lot of locks on the file or you’re suddenly going to really have a problem when everybody in the Internet besides they love that new song on.
Or let you want to watch the same media and so you know there’s sort of this idea that.
You can have a weed without a lock and so that’s where you know we’ve sort of approach to.
Historically and what’s interesting now is as for the tensorflow xindi pony takes off on her no network that’s actually a very valuable commodity because most of the actual file systems out there.
Still a database term do it an optimistic or what I mean by that is every day assume every open May potentially do a right.
So when you start to manage latches in the background when you start to manage different you lock mechanisms at some point you reach a ability of.
[40:16] So many users into a given file causes a problem with the name yo is aspirates itself as latency.
And if you get enough connections into there’s enough unmatched or if they were referred to as an asymptote with musically means it goes to zero and nothing more.
And so from that standpoint your what we’re really seeing is it’s really a fun interesting time because.
You now have your things that were interesting in another of segments.
That are now extremely valuable and this sucks like.
If you go back to innovator’s dilemma is really how you know he says everything gets crushed by something that’s been applied to a new use.
I’m so fresh Tampa Emily you interested in this because it Play Store Heritage where we come from we have ice on it some more.
But at the same time you know,
cost of our customers were the first people to start applying this on and then came back to us and say hey we did this with love it but can you look at that show can you look at that.
And so that was sort of the Genesis was hey I actually started doing it even without our knowledge and then started saying hate works great but,
can you help here and so that’s actually what started my journey into this was in order to help them and then we started putting lacquer.
And your point now that you know deep learning a sort of one of the hottest topics in the ecosystem it gets very very interesting.
Brett:
[41:44] Keith I want to ship a little bit and talk about the business and now you’re talking about bleeding edge and as,
these cases for deep learning are growing more and more as an organization’s begin to or continue their Journey with the earning can you share some cool stories.
[42:01] About what you’re seeing out there.
Keith:
[42:05] Sure so I guess I’ve got some customers that you have probably not nice to reference.
But they’ve been you know a couple of them have weaponized use my term tensorflow and you know have some very very big install.
And you know what’s interesting is they were actually doing it on some older hardware and it was really working pretty well and so now as we look at some of the gen-6 and would even bring the boys.
And so that sort of started and you know our understanding and now we really have you know every industry.
Yo if you look at me a couple of that that truly innovators and every industry and what I mean by that is I’ve got a couple and financial service.
We’ve got a couple in media and entertainment we’ve got a couple in the the life scientist Healthcare space.
Where there on the journey and what I mean by that is.
[43:07] The conversation really in a couple of them started with.
You know what do you know about and that quickly morph into you know.
I can have me personally can have a conversation all the way up to the management orchestration layer.
And talk through no use cases architectural consideration.
How does that Mary up with your use cases and it’s really turned into a dialogue where you know.
We corporately are you having dialogues with him about what is it you’re trying to do.
And then Willy we can break down into the use cases and then give them guidance about here’s how to run a PLC in some cases we’re helping them run the POC.
To understand you know how they can consume it how they’re comfortable with it and so that sort of driving it but anyways.
Right now we’re sitting on about 15 different organizations all of them very very large who all and when I mean large billion dollar.
Who are in a really trying to figure out how do they bring this in.
And you’re not really break their architectural tenants they want security they want back up they want you to all the good practices but they also want to be able to do this you know in and some of the scale aspirations I’ve heard are off the chart.
You’re not there today but you have some of the things they want to do the size would be Mass.
Kyle:
[44:35] Hey Keith what industries are you seeing in Embrace deep learning in machine learning quicker.
Keith:
[44:43] Probably the top ones I’ve seen Financial Services.
Me your number Taemin oil and gas Healthcare Life Sciences you know so you know mix medicine you know the ability to.
Floyd some of the other packs images were seeing or even medical swabs is off the chart.
Some of the in a financial services we’re starting to actually add governance and Regulatory.
[45:20] It is really taking off Media entertainment it is really adopting oil and gas so the ability to actually.
No use for the Deep learning and sort of in the end of that case it’s actually for targeting oil serves instead of actually on the existing.
So you know we’re really starting to see what aux all a true cross-pollination across.
All different and auto is probably the oldest I mean Auto has if you look at it in your what they refer to as a task for fence driving is probably the for bearer of almost all.
Cory:
[45:54] The actual you bring up all night she doesn’t become immediately self-evident to me which is media and entertainment so unpack a little bit for me like what what’s her conversations are you having in that space where.
[46:07] They’re thinking deep learning that’s what I want to do.
Keith:
[46:13] This one caught me off guard at first until I started digging in and then when I started here what they’ve done it to Larry.
So there’s two 1/2 sort of abuse cases that I’ve seen one of them I find completely fast.
And you know it’s for the play some sort of Labor Arbitrage sort of approach so there is a major and then he house that does movies.
They literally had deep learning create a trailer for their movie.
And so what they did was they fed it a series of clips from the movie.
And then let the Deep learning algorithm figure out how to stitch together a trailer them at a certain time criteria and a certain set of element criteria.
What I’ve heard is that the if an individual human had done it that would have taken 30 days the actual deep learning Bots did it in less than 24 hours.
Yes and actually what I hear is it’s actually as good as if a human did it the other use cases a little more business focus and what I mean by that is what we’re finding is there taking.
Movie successes is a great way to say which movies made money which movies lost money.
What elements were included in those movies so plot with different things and they’re actually using deep learning to sort of adjust the flow and adjust the elements within movies.
[47:48] To potentially include some elements that may have been you know in a similar movie that made a whole lot of money it was a box office success.
So they’re actually using it behind the scenes the sort of craft sort of you know the way a movie might flow or some scenes you might want to include a different element and so really they’re using it sort of to help.
Figure out how to have another box off exit.
Cory:
[48:13] That’s crazy cuz actually I just,
just brought up to mine that we talked about in our very first episode of the big date of your podcast we talked about how some of the movie houses were talking about putting cameras into the theaters.
To to monitor how people reacted,
you know facial expression body language right everything their reactions to the movies so that they can understand,
you know the emotional response that their product creates and if good like sir to do some interesting analysis of how to make better movies which is that’s just a really interesting yusuke Wild,
good.
Keith:
[48:54] Is it plays into a lot of the use cases that started in machine learning and now really have cared for in a deep learning.
Are really last what I’ll call the classic I want it you know I want to send me grow my my pie tons or I wanted 71 a safe cost.
Enter more on sort of the angle of I want to do what I do better.
And so you know that’s really what we’re seeing is a lot of these cases like this the goal is to make more money but there are a lot more targeted to your point to understand how do we make better movies.
How do I run my business but you no answer that’s really what we’re seeing a lot of the financial services to be used cases where exactly that how do I run my business better is really the strongest use case and financial services right now for deep learning.
Cory:
[49:46] Interested now one thing I do know about you is that this is publicly stated that you have a background and history working with the the federal government I’m guessing.
I’m guessing the stuff is actually I’m guessing there’s a few agencies who are doing some things in deep learning.
Keith:
[50:05] There are but I can’t tell you who.
Cory:
[50:06] I wasn’t trying to mess up your your pass or anything.
Keith:
[50:11] Well I mean let’s be really clear you’ve got capabilities that.
Really in the open-source at a very reasonable cost and so licensing center flows free.
The ability to scale unlimitedly on your lots of data and signal and noise yes I could say you would probably expect that the signal intelligence agencies near would be very interested in this.
Cory:
[50:44] That’s funny you just use the siliconangle cube TVs slogan we extract the signal from the noise.
[50:54] Will keep this is been fun I enjoyed the conversation cuz we learned a bit more about the architectural tenants that make up deep learning environments we we talked about architectures that are important things to plan for Knott’s boil the ocean.
Focus on.
Making the business better and using tools that are broadly available today and keeping an eye on some of those Trends and Technology like a ton of Us cars and others before we jump too far and I want to say one thank you but I wanted to jump.
Into a funnel session with you called rapid fire where I’m just going to ask you a handful of questions what I want you to do is I want you to sit back relax.
[51:32] And give me the first thing that comes to mind whenever I ask you this question you ready what year will Skynet go online.
Keith:
[51:43] 2100.
Cory:
[51:44] If you bought me a book what would it be.
Keith:
[51:49] Probably had a groom your beard.
Cory:
[51:51] Hey easy hot rod.
Keith:
[51:53] I got a beer to now.
Cory:
[51:54] Proud of you he join the club so that’s okay hang on time out okay beard that’s awesome to lives in the Southeast I’ve said this before.
[52:04] The Deep South Dixie is the place where data science is getting real anyways thanks for that number 3 what genre of music are you rocking right now.
[52:15] Classic what is your favorite piece of utterly useless Tech.
Keith:
[52:22] How do I useless.
Cory:
[52:23] Yeah just like Bridget something ridiculous it’s like I have this but it’s kind of dumb.
Keith:
[52:29] Cuz I had a lot of fun of stuff I’m trying to figure out when I actually my SmartWatch cuz I haven’t actually worn it in about 8 months.
Cory:
[52:37] Yeah that’s sad I think you’re like the 5th vote for the Smartwatch being just generally you so so what is your biggest Money Pit right now.
Keith:
[52:47] Probably my toys which would be either my mountain bikes or my my collection of useless raspberry pies and movie DSN and other.
Kyle:
[53:03] What’s been your a favorite use of the Raspberry Pi so far.
Keith:
[53:06] Actually I have a couple of Endive of fume raspberry pies with movidius Intel chips on them so if you know what that is that’s actually you can turn your Raspberry Pi into a deep learning machine.
We’re about a hundred fifty bucks.
[53:21] Okay and little league so I actually have it cut it is trained on a certain set of models so I actually feed date and do it and it runs some progressions for me on on something that I do for kicks and giggles.
I’m a big mountain biker and so it actually chews on some of the data that comes off my Garmin.
Cory:
[53:43] That’s pretty cool.
Kyle:
[53:45] I thought I was doing well with the you know virtualizing video games on that thing but never mind.
Cory:
[53:49] That’s awesome what can you find out what two things you do a lot as you travel so are you going anywhere exceptionally interesting soon.
Keith:
[54:03] Let’s see what’s in my plant love Seattle be there soon love New York be there soon I love London be there soon.
[54:11] Going to Miami in there and possibly going to Vegas I mean so you know all fun places so actually all the places I’m going.
Cory:
[54:20] Nice I like it okay last question what show and it doesn’t three TV could be Netflix Amazon Prime whatever into what show are you binging on right now.
Keith:
[54:31] What do you see try to remember the exact name for the one where they make nice the forging show.
Cory:
[54:42] Oh yeah yeah.
Kyle:
[54:47] Blademasters does.
Keith:
[54:48] No it’s not that’s one of History Channel.
Cory:
[54:50] Yeah it’s the History Channel on that.
[54:52] Forest & Fire that’s there you go awesome well that’s an awesome show I do enjoy it and just don’t carry any large plays well on your mountain bike you can hurt yourself.
Keith:
[55:01] Or in the airport they don’t like that.
Cory:
[55:03] Generally speaking also true key thank you so much for being on today this is been a super fun conversation,
Kyle Brett thank you for being actually all of you thank you for growing your beards and thank you for being awesome be sure to check us out in iTunes,
and make sure you’re at the podcast next game.