[0:03] Hey folks this is Cory Minton from the big date of your team we are at strata data conference in San Jose California.
I’m joined by my almost bearded brother and shape of the few days he’s looking Lil Scrappy Rob hot welcome to show we also got two special guest today from Dell EMC.
These guys are actually what we referred to in the industry as unicorns cuz they’re data scientist so we’re out super excited we’re of a capture you and spend some time talking,
the temperature session this morning we’re going to talk about that today so thanks for being on with gotten Mauro Damo from Dell EMC is data science and Consulting team as well as whale in Chief data scientist.
You’re doing see welcome to the show gentleman.
[0:42] Grant will be here.
[0:43] I’m glad to hear that but thank you.
[0:45] Subway tell me a little bit about what you do for Dell EMC.
[0:49] Is how my name is waiting I am chipped a scientist will you see not LMC Consulting.
A walk is a client facing so we devised a solution and crew that continents strategy.
They are science and engineering for the customer and take then the truth we caught up with a temperature taker run the business inside out way too attached information.
[1:14] Excellent so that’s similar to something I built marzo.
[1:18] He is my girlfriend.
[1:19] Okay excellent so he’s an alumni of the shell we always like I was catching up with folks who he use those models tomorrow what are you do for Dell EMC.
[1:26] So why working today in Wayland at the science team so I’m working.
Provide service to our customers so we we are provided kind of solutions for a lot of different types of Industry banking Finance.
Retail and it’s kind of a customer’s we provide them like a.
The most advanced analytics and knowledge about to deploy machine learning and and this kind of think so.
[1:57] That’s what’s interesting cuz you know most folks when they think of Dell EMC they think you know hardware company right you got some action Technologies a lot,
as the companies are come together there’s a there’s a lot that the offer but Services specifically run data science I thinks it may surprise her folks so what are things that you guys were doing here that we saw earlier was you got selected to present,
are you scared that I thought was super interesting it was around how do you identify and maybe catch bladder cancer.
Don’t talk about what you were what what this what this talk was all about and how it how it came to be.
[2:31] So what’s happening that you have a sum of datasets open to public.
Nets game from Summer and diversity then I’ll decide to have the image from patients that has has or had cancer.
And also so do you do use that you have to use case.
We got began in 2016 and then we realized that we can do something better for try to write a prediction model that can classify specific type of.
Cancer sign and bladder and then in for the weed out any kind of human intervention so this is the way maybe we can apply to you soon.
On the domain but we can try to apply do sound devices or invented this kind of yogurt inside of a.
MRI machines that can provide you automatically watches the classic tee off the type of the cancer that specific patient has.
[3:27] So what so you say public images like what sort of pictures are we looking at here cuz you’re using what you’re saying is you using some sort of machine learning algorithm used built to basically look at a picture.
Without human intervention intervention I think you said.
And that it’s classifying help me understand like what like where do you get these pictures I should have opened it it said they let some pack little bit like what does that mean where they come from.
[3:48] So this is from Terry’s of some hospitals in one Brazil think so it’s 102 in United States.
They provide is e mc they open open source this.
[4:00] Image and video princess to diagnose up with you about this position so we have to buy at bi so we don’t know who is the patients O2 something that is confirmation.
Let Media providers to do open source data to try to get the community and try to do some analytics over there.
[4:18] Next so it’s an open data source so it but you said it doesn’t have any of the patient identifiable information so no no GDP GDP are concerned that’s good but so this is like a art of the possible like it wasn’t.
Was this a customer-driven thing where you had a customer that was trying to do this and you help them or was this more of like we wanted to show an idea of what we could do.
[4:40] Yeah it started with an idea that we like to show that we have this capitalism and skills to do that fake butt.
It’s it’s it’s gold Footers I like that spread to other customers to order Christmas like to have interesting about it so we’ll just start in January when we do sir have after we had this a use case.
Begin to start some positions with customers about this listing.
[5:03] These are practical Rock ideas also coming in.
I passed the paper of last year so we went sweet picture perfect. Do you say idea and the way your star offered Lisa 12 customer.
So the concert but he is not only for bladder cancer is All 4 or am I related so they can be identified as wrong so they have labeled information about those account of yourself then we can use outside PK has a training.
Total scale use up a security at Transcendence from the statistical but you’re learning and it was way too. Featuring hurt you and the whole 40 yard combine all this week.
Provides a week old actionable information which of facilitate in the sustained even distribution of a doctor’s.
So enjoy major city Potter concentration is really high you have Specialties that they can titanosaur these are number for.
I entered all the pincer but the all the other urban area Lucy’s or something that can be help her supporting system.
[6:09] Yes I did the formula can a developed so I don’t know what he’s word model that does that imply something else,
what did the South America developed then it will be kind of easily trainable to do other other kinds of medical research or more medical identification identification off different kinds of cancer or other things even.
[6:25] I got.
[6:26] Yes sure we can we can we can we can if we thinking about.
[6:32] Vector so we can run record at different types of organs so we can try to to to have different types of models that.
Predictive some type of cancer has different type of organs that we can figure out it’s just because.
We use at one specific playing off of the MRI image so we can I use all the type of planes.
So so what is the planes look like in the depredation we talked about Colonel planes and transverse plane so we can use different types of friends around we have 3 so we have we can.
For example we can have one Pacific breast breast cancer we can have three planes or three models that together run together we train together this models and then we build a simple motor that put all the street together and.
[7:18] We can try other types of retraining architectures on all the types of Architecture is to improve the accuracy of the model so this kind of think it’s possible to do.
[7:30] Yeah so I consider this a disease I need you to become a nation and.
[7:36] So that that application is Reedy abroad so you can use us as medical data that will be for all your cares industry.
[7:44] Basically the giant intake Mi machine to take those picture so the second why you so you can transform that we call that I’m serious later vibration data into 2D spectralink you speak I know you made you from there you can lay boy.
[8:00] That’s kind of cool that the other thing I thought about when you were talking about the.
No I’ve been under the three for the sagittal the coronal and the transverse so you could possibly be creating 3D versions of the model that are 3D images of the organ that you could then run a model can you run a model in 3D is that even possible.
[8:18] Yeah we can return of the treaty for example have a sum of image recommendations for when you talk about color images.
So is RGB so we each of these.
Red green and blue so which of these callers R11 Matrix inside of the counselor so it possible to do a 3D 2D game which one thing that I would like to to add this.
We are using just MRI in which but we have a pet scan we have different water types of devices that we can.
Don’t feel different models and then and then puts all the skin together and is this going to work.
[8:53] So tell me the name of the model cuz I think I wrote it is a multi is it multi no no no no meal convolutional neural network is it.
[9:00] It’s a it’s a congressional network is it that connection and the correctional not took his emotional relationship crashing so they don’t have to.
[9:13] Xbox now when you when you started thinking about it cuz I love you this very kiss very cool art of the possible like.
It’s almost like you build a prototype car cuz if they agree to let you know in like hey mr. Custer or you could do this and this is a way to leverage your team knows that you put up there was you started talking about architectural building that you showed some Hardware.
It looks like did you guys run this on like a desktop computer with a GPU on it or did you run into Datacenter like how big was this project and how how large-scale did you go with.
[9:42] So we we we working with the server server it says.
[9:50] A Chorus and a 380 City 4 gigabytes of storage.
[9:57] And we run into the two gpus agreed with two gpus.
And what is its cable we can’t because we was not so flow so just flew can I build like a country Malton distributed way.
So we can like I have a blister off House on the Left these machines and down train model of course which I have data and I’ll just try to do approach that you can.
Ron different models of different configurations simulation your hyperparameters and then you can I need the computer power cycles and this kind of thing that you going to need.
[10:32] Ryan so I one sings all that. You know my to you so you have need to have a minimum viable product.
These things are you can deployed in the city in the hospital you can also deployed in more like Urban setting that. Which is a.
Maybe use a family physician hot take care of the whole baited you you never said.
This type of machine does not take a lot of effort to install incomplete of course you need to have to be so cold out. Connectivity that that you have a lot to infrastructure that I can connect to the hospital Center technical support.
Butter for that particular one that you can start from that simple computer Asian and growth from it.
Even some of the customer to help when they would like to perform the POC they were not willing to a python larger pipe of because of cost or how much you might be right down the road so it so you better Asian is a bare minimum this is a meme I buy Apple products.
Who is this you can standing up in the world.
[11:34] That’s very cool this one things.
What things I think that maybe is a bit misunderstood in the the concept of like when a data scientist helps develop a model when you talk about deployment at the edge my mind goes to.
It from a hardware perspective in architecture if if the model is being trained in an in a Datacenter where there’s.
You know resources were that’s not Arts are you able to say you have a large-scale tensorflow environment or or whatever your chosen platform is you do your training there we have large amounts of storage and memory in computer resources that can do you do your work to go to the model.
What is the model for deployment is it light or wait at the edge meaning you know if in your case we talked about going to a rural environment where it lets say they have a specific.
You don’t think that type of machine or whatever imageclass that they’re trying to use to identify some specific thing.
Even need all that hard work doesn’t the motto is the model capable of running on a running in a much lighter weight weigh than it needs to run in a training set.
[12:27] Yeah I see now this is a good question basically a week actually / Motor Corporation that’s a whole Bravo deployment concert coming in user being where is the layer as well as wwh which is what I heard.
Been wheezing that have a we caught on the latex fabric.
And the date is February 10th or two we could have made out of the fabric and then finish that is called Arizona.
Who sings data and processing power.
It’s always in that you can share the accountant for them how do you spell particular Hospital have a bladder cancer am I.
You can HRT Transit model and move tomorrow because of the weight and that’s where it’s at a little complication for net worth.
Is total DPI phai independent you cannot. Translate that to which customer you can move it tomorrow though you’re not moving the theater.
So that’s number one the second one Lisa so not eat you can be tooth who sings Why you so we called her to let him transfer so you train the Tacoma model.
This, model may be recognized as I said all the things are by the DC area for a dimple in the Long Island before the place breast cancer is a one of the highly concentrated area then they can attach is a second they walk into it.
Training breast cancer specifically for that so now that you have a large Network and the large computing power can do we call that.
And then they tried that and the small one they can attach you that a general concept will use a smaller specific they have said that they have insurance through it so it is.
[14:06] It is a very versatile model in and out this way.
[14:09] That’s interested she brought up something called worldwide heard which I think we may be heard of before I want to tell me a little bit more about worldwide heard where that came from I just at a high level.
[14:19] Well I heard h o t is a panda by Patricia she is at the one that actually comes across as a concept of the current situation.
Most simple example of why I want to calculator brighter cleanser worldwide distribution how can I do it right now the French and the German British in North America South America what is Zone you can now move the data out.
[14:48] So in the way that you need to break a top reason to send her and the teacher be apart.
So what I will be are you say you so that we could have met earlier that I actually spend Eliezer categories so you don’t send me your whole thing formation.
You send me it’s okay. Worries and accounting belt the categories know that it will get it into the distribution now that I know the better cancer over the world.
So that’s the accountant over.
[15:14] So it allows you to basically use.
In some cases in some way have multiple Hadoop environment some sort of the large-scale perseverance but having communication between them pulling out a small number of key value pairs like likes accounts.
[15:30] Yeah yeah yeah I’m or less because we are not able to to retrieve the data from the data centers in and put on centralized at 4.
So what we are doing is redesigning now they’re all great analytical agrees that can do the following you can like send the information that is it is it cute or seen each of these data centers Dan do we have alpacas Central or mustard.
Are we going to push like a bush like a deer in the in the age we need to.
A processing and do all this to physics and send back to the to the central district is All rights summarized it so.
For example if you think about the mean or average or just kind of stupid statistics you need to do this kind of processor back and forth in between the edge and centralized it to not to.
[16:25] Sorry to not to have problems.
[16:32] Yeah it’s almost like you know if I think about the mapreduce process right yeah this map and you can spread it out amongst the workers it’s almost like you’re going to layer up and said.
We cannot produce across many men produce capabilities up is that an accurate assessment.
[16:44] Yeah we don’t move the data.
[16:46] Cuz you’re just you’re you’re you’re bringing back some amount of a summation or some calculation that’s very interesting so you did mention VMware in there what’s a VMware contribution there like.
Cuz of what most people think of him or they just think VCR or you know kind of those operations what is he and where is contribution in this context.
[17:02] Virtual Computing note so I basically across all the different. It has on you. But this cold and it is fabric.
Easter break they can have a week or the almost like Yellow Pages you know where is a resource so basically got in this particular sense how we feel the same thing called.
[17:22] Medical candidate is Factor.
[17:27] Midland Adidas Factory can you say this particular Leia when will provide to talk to not only that I may have made a pothole so that that test product these are three people in the center so now that you can use the resource in the dispersed for men.
Play Tempo you have cohesion and his Factory by the tester that the testing this they have said is belong to that is all.
So there’s no GDP are concerned because this person is physically there other users being well so we can have a cohesion processing.
[17:59] Okay I’ll help with Orchestra.
[18:01] It’s so you actually something there that I thought was in your presentation earlier so that the analytics Factory could you talk about that little bit because the the picture that I saw and the words are going to run with it was really cool and it could you describe a little bit more.
[18:14] Show in the 90s Factory was so this is a 2.0 specific for medical and it is Factory.
But he’s lying to you so you suck on site by superscalar basically if your phone into Super scared of pipeline you feel up the pipeline you can cut down those are so put latency so much.
So while we sit at the Wii building stage of you so we saw we clean the latest beat and the Premier League and the Usos versus minor.
Stop in the street because I opened the connectors and the wheezing the call he is missing his energy sign in ID and those are we going to have scientists and then the tester.
Hey. Has a cold portion so they cannot take it that way then put the medical part of this quite a lot of used case they need to be done yesterday so.
Subway Temple prediction and the house why do the physician nurses scheduling out why to Medical restocking how do I admit that a patient what is our only patient see.
So how do you say very cold outside he Pig and the sensor Cerner type of medical software.
By the way you feel. In this reporter is it because of congestion of an ad a taser case was a padlock so long they lost week all that action about window.
It’s always this then we know the how much are resources you need and how much time do you quit you spelled it all kind of it what was the testing quality across of the appropriate for you require one.
[19:48] So the concert by yourself is a picture of yourself on Victory point of view.
[19:55] Excellent that’s very cool so what’s next with that so you guys said you talked about you know you’ve got this this interesting model you develop your perfecting it right you’re getting better.
You got some technology underpinning at that that that’s pretty interesting right that can allow this model to be deployed and interesting ways across borders that’s all very cool.
What’s next like what are you guys going to do next when you get when you leave here and you and you take what you if you what you presented or maybe you’re already working at but like what what are you guys going to do next in terms of art of the possible or in like actual practical implementation.
[20:26] Yeah I think you said that’s great question I see a Visa. That’s why I saw you called up.
Still have 5:30 and she have the same mind about tomorrow you can describe a working progress by the basically skill out.
And not only that we can do we say one we already proved you have a minimum viable product you can do it now that you’ve always screw out what kind of super that you can have.
[20:54] So I bought the right bundle with my working with the hash and the Samaritan on the way in.
[21:01] And I order folks home today also a lot of people sorry if I don’t pop machine everybody here.
But we are we are we are at the first results of the project is a wee wee.
[21:16] We pick up our training model from resnet is that the one that would.
[21:23] Most famous architecture in their Network architecture.
That I forgot we’re having you much and I will what what it does it’s not recognize hard object inside of the image and we should do this in this afternoon at Boca lstm neural network.
Dad’s just want image give me the contest what the action side of Animation so for example if we imagine that we have.
Boy and the dad playing in soccer.
If I have a list of red nuts running and Pathfinder this image you going to see just a boy and a boy a ball and then the.
So in this case we going to Great captions we can create at the counter so we going to say okay Dad playing soccer with the boy so just using the image and using I miss Coco it someone of another set from Microsoft.
And so we are able to run these on 300 machines inside out for our lab it’s took 15 minutes to run.
And 81000 English so we classify do Saturday 81000d wages in 15 minutes running on 300 machines you use a tensorflow.
And I know you must contain a true we have similarities one container so so we spray we start the job writing this container and Elsa flow and all of 300.
Service and then we got this 50 minutes so he’s a very good approach because we we haven’t used gpus in this cage the CPUs because it was something that we like to assure that we can be.
[23:04] Fast enough to sport like a oldest kind of deep learning.
[23:12] Techniques in the CPU environment.
[23:14] Yeah so CPU and GPU there’s obviously erase going on there cuz machine learning yell in videos clearly doing some great things in the GPS face clearly and tells you know the LED in the space AMD certainly making some progress,
it’s just that you talk about hardware and you just put her in with any of the like the Google TP use like the tensor specific processing units yet.
[23:34] And not yet.
[23:36] Yeah that’ll be interesting to see what happens in one thing I took away two from a again to be the hardware geek in the room the the thing that you guys called out in the in the.
Presentation about scaling out performance design was.
You talked about the importance of memory which surprised me I guess I don’t I don’t guess I thought that memory would be the thing I thought GPU processing would be the capability how did memory impact your processing throughput.
[23:58] Yeah so in our case that we have a network have a thousands of different parameters to to update so.
So far as for me another case is better to have more RAM than Cycles because even have to take weeks or out not weeks but oral hours a day at least of them are going to run or you can bring them all.
If you don’t have RAM memory in your case this happened.
The tranny face is saw you give you a lot of memory ever in so you can confirm them all that has a lot of problems have to reduce the framework had to use this the layers so it’s a painful.
So sorry something that way like I like to do for him we run this all this is all good without.
To shrink the image just not used to the real date that really raw data so he’s at the rink because when you shrink your loss information so it’s something that is going to be good to try to run it is on a beaker.
[24:50] Yeah so it’s not good so it’s not just the number of machines in the cluster but actually the memory density per machine has an important feature which is good that’s a lot of folks.
When they thought about General kind of big data or close they’ve got a thought why you need fast CPU and recently passed this can Emery’s okay but it’s not like the big long pole in the tent,
I totally see that should be especially as you start to look at things like video and audio or video on and picture sort of as your source.
That becomes very interesting so you’re working on ready bundles for machine learning your dude can I come up with those next are you like are you working directly with customers on on projects that are interesting that you can extract and tell us about.
[25:32] That’s a okay yes we do for you then pull out is also his health care place to eat in addition to the ocean we also do I sear it later each EKG,
they only do you create a seizure when do you please take that to heart attack for them both those are specific that age should have come back. We call.
I just seen that ETA. They actually have very very similar driver.
In the way that we are biological entity which our sensitize in the log where the other might be at the screen the digital based on the sample rate.
So those things are increasing my work shirt and yeah they are song.
Some will cut that we cannot discuss further to Walgreens everyday that’s who I hope we lose application maintenance.
Airplane the poison pool and how to use potato what is a different altitude even the velocity.
Those are Samsung because the Rail Event detection though so you think when you have a vibration data.
And are those things I can be recognized easy D but you’ll have other people in them.
[26:48] And we we have we have a working working as partners so is a Healthcare company service.
[26:55] And we use a look at them Fallout 3 spark will you play a vital role inside of the partners to provide them like a some.
What kind of House of Lies they’re cold or how to how to help them into some kind of electric spark effect.
And after that we we are we are already working in the newspaper in the college.
[27:21] So it’s about tg-2 electrical supply Graham.
And I will try to creating we could all this all this information side off all these EEG files I think it’s through that 13000 patients and it’s more less something 200 gigabytes of storage memory does a really really really big.
And I we run these over spark and we use it at BDL has a pepperoni framework so we are trying to redo that prediction model for using try to predict seizures for patients use in deg.
[27:52] Christ Asia the other one you so so very interesting is airport we also are used to report a tip on the sensor popper Taqueria video potion.
And identified the event and then translate that you saw, and translator that you been.
Into a recorder to action about which is a user iOS ATM which is translated accounting.
But I understand what’s trending on the airport emergency menu so that they can translate that the event and Aunt were executed, I see you been analysis.
[28:25] Very cool so automating the response to what could be critical events.
She talked about you working on your next paper and it sounds like you some of the stuff is come from competitions where you want are you are you guys competitive data scientist are you guys doing things like a goal and and competed in those sort of things or is this a different competition framework.
[28:44] Yeah that’s a good question it is at LMC I imagine once a year that’s our major one because of that these are persistent ranking of our of paper we can refer to that that’s why paper.
How to pass the two-year are we have A2 paper I told you is a piece of paper or potato.
Why you so mad at you a photo customer trim model how do you protect the damn you so we could I find model recency frequency amount to remodel across a different aspect Rock the other uses a prototype off operator cancer.
[29:21] So how do how the folks that are you know our other customers that are people in the industry how do they engage you like how does how does somebody get your team and your folks involved to help them solve.
You know these problems that you guys are kind of you you’ve done something school art of the possible but how do they actually get you to come help them like what’s an engagement look like.
[29:42] That’s a good question.
[29:44] Engage my ears are going through we called the shelves Channel and then search Anna will reach out to us right now they told ya we stepping into the Seltzer Channel and now so if I say that say that I appreciate.
So that’s wrong as a customer I have neither the reach out to any of us or any of our sales Channel we will be our wedding at work I’ll talk to them and the Sheraton.
Identify the root cause and then execute a plan.
[30:13] Nice once again I think a lot of folks when they think of Dell EMC they don’t immediately think.
Wow these guys are could help me with data science problems I’m trying to solve so that’s good to know especially as you know these digital transformation objectives are at the top of the line for some of the folks.
So thank you guys for being on I want to shift gears here for a second when I get a little personal you have told us about what you do.
What value you’re bringing the customers and the interesting work that you’re working on which can’t wait to see what the next papers all about so this is called our rapid fire session so.
Basically it’s really easy all you got to do is sit back relax and tell me the first thing that comes to mind when ever ask you these questions and I’ll start with you and then we’ll go to you tomorrow back and forth.
What year do you think Skynet will go online.
[30:59] 2015 alright want you tomorrow.
[31:03] Alright so you guys are thinking it’s her that makes me feel better.
[31:06] A lot better.
[31:08] And a lot of people are like it’s already online so I’m thankful that you guys are thinking that that’s good for you what’s the last good book you read.
[31:20] Are your favorite book you read last year.
[31:26] Tipping Point by Malcolm Gladwell.
[31:28] Yeah big fan of Malcolm okay what about you why.
[31:30] Victory is a paper it’s okay so for polka polka frog it was reading about.
[31:43] How people can reminder themselves when they can how to remind people to do the best and it is a very small article but have impact.
[31:54] Really powerful okay cool what genre of music are you currently listening to metal under that’s all.
[32:05] Gosh me too.
[32:11] Data scientist go hard alright so what is your favorite piece of technology that’s kind of useless.
So something that just get its technology but it’s silly you don’t you know you don’t love it but it’s do you like it.
[32:28] It’s silly but I.
[32:29] Yeah it’s like a little piece of technology so the common answers like you’re like the Apple watch like it it’s it’s that useless if it’s actually had a negative impact on my life.
[32:40] I think it’s a good David pineapple Apple watch or something that I don’t know why maybe I like it. It’s really beautiful but I think it was.
[32:50] Yeah that’s right way what about you any any technology that’s making your life worse.
[32:55] Rachel is watching ladies a digital clock that I told you I can broke ass and tell but it’s a Google home just taking over the whole nine yards.
[33:05] Google I got it okay what is your biggest personal Money Pit right now what are you spending all your money on.
[33:14] My youngest son he have autism.
[33:17] Okay I thought that’s definitely can be expensive understand that with you.
[33:22] And my kids and my wife and my family they drain all my money.
[33:24] That’s good that’s good things to spend money on you get it you could be doing really bad thing so you listen to Hard Rock and you do nice things that’s good are you going anywhere interesting soon.
[33:35] I’m going to go to Europe Portugal.
[33:40] Portugal nice very cool how about you anywhere cool.
[33:44] I intended to are taking my son to put your pain he is a big fan over Japan so we have a my teacher due to the travel so he would like to see that.
The architecture he would like to Orlando Japanese and that he would like to in case you always such a penny song animation.
[34:04] Oh yeah very cool house in Japan a few months ago on it took my kids with me and it was an incredible to me what a beautiful country okay what show are you currently binging on where have you been done recently.
[34:18] Define the Benji.
[34:19] So what’s your what’s your favorite TV show you’re watching right now.
[34:26] I did not watch TV often.
[34:27] That’s a good answer that’s it that’s a good answer what about you tomorrow everything.
[34:31] I like to see the talk shows and I like that you can Jimmy come out and this kind of thinking.
[34:36] Okay I’m late night shows very cool.
Will gas thank you so much for being on it’s it’s interesting as a said to hear from from folks that are Dell EMC guys that aren’t Hardware sellers that are doing interesting things to solve interesting data science problems for a customer so thank you so much. I hope you enjoy the rest of strata conference.
[34:53] Thank You song Thank you so much.