August 14, 2008Long Zheng

Uni. Washington and Microsoft Research collaborates on (yet another) mindblowing 3D photo viewer

If you think you’ve seen what’s possible with Photosynth, then you’ve seen nothing yet. The collaborative research team from the University of Washington and Microsoft Research who only two years ago in 2006 published their paper “Photo Tourism” and their technology demonstration “Photosynth” have again pushed the boundaries of what can be achieved by intuitively processing the abundance of digital images shared on the web.

This week at SIGGRAPH 2008 they’re sharing with the world some even better technology they’ve been working on which they call “Finding Paths through the World’s Photos“. Don’t let the name fool you, it’s damn cool. If you’re not much of a reading person like me, take a look at this video demonstration. (Watch it till the end)

This technology is much better than Photosynth simply because instead of just presenting individual photographs in a cool 3D environment, it actually manipulates the photo to give you a seamless and more lifelike experience. It’s one thing to click around different photos taken at a particular museum, it’s a whole other story to “walk through” the museum.

Now if you want to know exactly how they did it, and you’re a rocket scientist, take a look at their conference paper. For the rest of us, just take it for granted.

75 insightful thoughts

deeper2k says:

August 14, 2008 at 2:29 am

Nice like most of Microsoft Research projects, but will we see this technology in commercial product anytime soon?
Avatar says:

August 14, 2008 at 2:39 am

MS tech in photography is now the best around. just from what they presented at the Photo Pro event. it is truly insane how many photo tech they got now.
Chustar says:

August 14, 2008 at 2:53 am

This is cool. What they need to do is buy flickr and build this into the website. Suddenly, going to flickr becomes useful to me.
Imran says:

August 14, 2008 at 3:28 am

Nice 🙂
tinwheeler says:

August 14, 2008 at 3:32 am

Put enough of these frames together and you could have a 360 degree 3D movie sans glasses.Speaking of Microsoft Research projects; go to the MS Research project page and pick up their super cool astronomy project.
Albert says:

August 14, 2008 at 4:31 am

Wow…great implications in slideshows and movie video fx. I would love to use it for a scene showing time progression and at the same time with the object being rotated.
MJ says:

August 14, 2008 at 5:41 am

it’s unusual for you to use YouTube Long!
Edootjuh says:

August 14, 2008 at 6:11 am

Sounds, and looks, cool 😀
I’d just prefer if they would show multiple photos so that the entire view is filled with photos. It’s just a bit too slow to appear like really walking through the place.
Pingback: www.vistablog.at
Yert says:

August 14, 2008 at 9:11 am

Now if only they could eliminate the people in these pictures with enough of them; assuming high enough quality we could get the best 3D models with just pictures… I think I sound random, but the possibilities of this technology are almost endless! I would like to see this implemented into Live Maps very soon!
Evan says:

August 14, 2008 at 12:27 pm

Wonderful, wonderful stuff.

I’m guessing that, as someone else mentioned, this amazing and wondrous technology will never actually be used anywhere, but will fill blog posts with wonderment.

That makes me sad 🙁 Why won’ t Microsoft actually yet people use this stuff?
Pingback: mindblowing 3D photo viewer - Surpass Web Hosting Forums
Pingback: CrunchGear » Archive » Really damn cool: Photosynth team demos new version of compositing app
Pingback: Receiver Gallery - A San Francisco Art Gallery
Himanshu says:

August 14, 2008 at 3:08 pm

This is great.. but

is it scalable? What i mean is, I haven’t seen anything new coming up at Photosynth. there are so many point of interests around the world but they are still not ‘photosynthed’.
ArMyZ says:

August 14, 2008 at 3:47 pm

Great work. I wonder about workstation insane processing power.. 🙂
mrmckeb says:

August 14, 2008 at 4:10 pm

I was wondering when they’d do this… they mentioned a technology like this when they launched Photosynth.

The question I want answered is why display photos at all? Why not just make a fully textured 3D object?
dexotaku says:

August 14, 2008 at 8:41 pm

Impressive stuff. It’s a shame that Microsoft is involved.
Pingback: » Photo Tourism - MicroMiel
Pingback: Freebase的超震撼演示
Fred says:

August 14, 2008 at 11:14 pm

@mrmckeb: Where would you get the 3D data?
Steve says:

August 15, 2008 at 1:18 am

@fred: its pretty evident that they are deriving 3d data from the photos. It’s clear they are working on presenting the 3d data. perhaps a problem is one of isolating the object in question from the background.

@himanshu: If you try the photosynth demo, you’ll see that it is a viewer only. The viewer works pretty well, but works on images that have already been analyzed. But microsoft haven’t released the software to analyze the images, find common keypoints and assemble them into some kind of 3d data set, and its not clear if they will ever do that.

@chustar: flickr is owned by yahoo. microsoft was trying to buy yahoo recently, but the deal fell through.
Pingback: Some interesting videos « Vijay Chidambaram
Tim says:

August 15, 2008 at 2:03 am

@Fred: that’s the point of the algorithm, picking out key features and mapping them into 3D space. It’s not a complete model, but with more data (more, better pictures) it could be.

@dexotaku: I disagree, Microsoft funds a lot of research; and after the manipulative behavior they used to corner the market, and the damage they’ve done to software development’s progress for the last 30 years, it’s really the least they can do.
Tom says:

August 15, 2008 at 3:40 am

I know http://www.everyscape.com does something similar so that you can walk around the exteriors a la google street view and interiors as well.
Elgato says:

August 15, 2008 at 4:19 am

I cant help but wonder what an organization like the NSA or CIA could do with technology like this. Imagine including satalite data, IR data, Landsat data, LIDAR, blueprints, etc all on a large multi-touch screen! That would be fun.
Scott L says:

August 15, 2008 at 4:35 am

The computation required to correlate images is high, but not too high for multi-core desktop computers, never mind the computers of 2013. One strong implication of this technology is that it can be applied – with enough correlating input – towards facial recognition, or to combine multiple images of a given car model, or similar-looking dogs… all kinds of applications as a heuristic algorithm useful to streamlining image analysis of all kinds. Implications for intelligence gathering are clear, and I shudder to think what will happen when this technology is combined with porn.
Scott L says:

August 15, 2008 at 4:38 am

Oh, I forgot to include this demo of what’s going to move it to the next level…
Pingback: Mind boggling 3D photo viewer | Good Times & Happy Days
Howard says:

August 15, 2008 at 5:00 am

@steve: Actually, the software to bundle the images is available.. not only that, but the source code is available.

http://phototour.cs.washington.edu/bundler/

That is the tool that calculates the 3d data points. I do not know, given that it is actually from the Phototour – Univ. of Washington side of the aisle, whether it produces data that can be fed directly to Photosynth or not.
Pingback: tdaxp » Blog Archive » 3D Visualization of Interesting Places from Publicly Available Photo Sources
needy says:

August 15, 2008 at 7:35 am

Can please someone compile it for Windows as well? Microsoft is involved but the compiled version is for linux 🙂
Otto Von Bismark says:

August 15, 2008 at 7:36 am

You know that the military and Homeland Security are going to eat this shit up. Imagine this with video, and using all the cameras that are being installed everywhere. Panopticon.
dj_cityboy says:

August 15, 2008 at 9:12 am

well it seems interesting but not really mind blowing to me, i would like to see a view with complete photo views that white fuzzy background is horrible, security is going to be a huge issue as multiple groups of people would and could utilize this for the wrong reasons as well…Hell! you can already walk the streets in google maps…

i wish there was a way to enable much more cleaner transitions, it looks as though they had some good ideas on removing that sort of strobing effect from night and day pics being together, but it didnt really seem like it did much other then removing all the night pics and left day ones in there place, creating a less animated effect…

i dunno maybe um stuck on older concepts like phototriage, phodeo and timequilt concepts for browsing my person photo collection not sumthing mashed together with someone elses pics as well…

i would still like to check it out though, cause like i said, it does look interesting!!

peas
cityboy
Matt says:

August 15, 2008 at 10:13 am

@Chuster…. yeah, that was probably a large part of their desire to buy Yahoo. Flickr comes with Yahoo.
Mirta Khashtur says:

August 15, 2008 at 12:25 pm

This software is way ahead of the time.
I think the results would be even more realistic, if multiple videos would be used instead of the images.
Sam Powers says:

August 15, 2008 at 2:58 pm

I think this is going to be on sites like ebay within 2-3 years, and Silverlight is going to put it there.
Frank says:

August 15, 2008 at 5:52 pm

“Fred: that’s the point of the algorithm, picking out key features and mapping them into 3D space. It’s not a complete model, but with more data (more, better pictures) it could be”

….which is why I don’t mrmckeb’s question. Why not map the photo’s to a fully textured 3D object? This -is- the best 3D object they can get from these pictures.
Frank says:

August 15, 2008 at 5:53 pm

And for some reason, now my name is Frank instead of Fred.
mrmckeb says:

August 15, 2008 at 6:22 pm

It would be complicated Fred/Frank, but I can’t see why they can’t work out the object’s 3D parameters from a large collection of photos of the object… they seem to be almost there as it is,
Some CADD geek says:

August 15, 2008 at 10:12 pm

I honestly think that Google’s freeware program SketchUp is much more useful for viewing real 3D objects. With the ability to create complex 3D shapes quickly and easily using a diverse range of extruding options, people have made accurate, scale models of Ships, Buildings, city streets, and even animals. I’ve personally made accurate 3D models of some of my products, as well as a few real buildings – buildings that look almost identical when I walk in them.

This program uses a lot of creative technologies that make 3D imaging easy, but it’s not useful 3D imaging. Google SketchUp, Rhino 3D, AutoCAD, SolidWorks, and other Computer Aided Drafting programs allow any degree of detail to exist, and the context of the image (which is actually an environment consisting of an infinite number of images) is far more valuable for any professional application.
Pingback: The Daily Bizarre » Blog Archive » Photo Frenzy
matt says:

August 16, 2008 at 12:40 am

That’s cool, but wasn’t quicktime VR doing stuff like that 10 years ago?

maybe not that good, but it seems like it was going down that path.
Pingback: Microsoft shows off updated Photosynth tech at SIGGRAPH | MostReviews.com
joe says:

August 16, 2008 at 3:07 am

no matt – quicktime VR is used to view a panorama (or object) which is composed of photos made under very carefully controlled conditions.

this stuff takes a random collection of pictures of a scene, recovers the 3d geometry of the scene from all the pictures, and then from that geometry can place where the pictures were taken. then you can do panoramas or flybys of objects using the original, uncontrolled shots.

its really awesome and steve and his team have had 2 (or maybe 3) consecutive best of siggraph papers based on this work, which is unprecedented.
Pingback: Mitch Butler - Mindblowing 3D Photo Viewer
Steve says:

August 16, 2008 at 5:43 am

Wow… cool technology. They need to merge this with GIS, particularly the work of Microsoft virtual earth or ESRI based mapping systems.

Maybe this is the foundation for next gen video games??

Cheers
Steve
Pingback: danshust.com » Blog Archive » New 3D Photo Viewer From Microsoft Research
Jay says:

August 16, 2008 at 12:38 pm

Nice, but is this actually technology … or a presentation of what’s possible. More vaporware? Please release a demo rather than just spread the fact there’s a good idea! Because these are very old ideas!
joe says:

August 16, 2008 at 12:43 pm

man you guys are jaded. its not vaporware. its just research quality code. check out noah’s home page:

http://www.cs.washington.edu/homes/snavely/

note this related work:

http://www

and the comments from “pro” who is the author. he says he’s going to release his source code.
joe says:

August 16, 2008 at 12:45 pm

also, jay, you pretty much have no idea how university-level graduate computer science research works. people don’t get PhD’s for just coming up with an idea. they have to implement it as well. everything you see in that video is the result of a computer algorithm running on a PC – its not a powerpoint presentation or something like that. furthermore the predecessor to this, photosynth, IS actually available as a demo program from microsoft labs.
Jay says:

August 16, 2008 at 1:36 pm

Thank you for the reality check, Joe, and thank you for your patience. Alas, you are correct, I am not a CS researcher. Is there a URL I from which I can download this product?
joe says:

August 16, 2008 at 1:39 pm

http://labs.live.com/photosynth/
Jay says:

August 16, 2008 at 1:40 pm

Only let’s me download if I have “Windows XP SP2 and Windows Vista” … I’ll see if I can find a Windows machine.
joe says:

August 16, 2008 at 1:43 pm

you also need a pretty recent video card… since microsoft co-sponsors this research i wouldnt hold my breath for a linux or mac port…
Jay says:

August 16, 2008 at 1:50 pm

Thank you, Joe. I’m disappointed, but understand. As a non-technical user, I’m hoping for cross-platform applications … Again, your patience is appreciated.
Jay says:

August 16, 2008 at 11:57 pm

BTW, I used to work on videos about research at MIT’s Media Lab, and I’m as excited as the next gal about research. I’ve seen a lot of exciting demos and code over the years. Now, however, my primary interest is how intellectual property is transferred, over years, to the consumer market, and how it can change the world.
Gilbert Cassy says:

August 18, 2008 at 5:45 pm

Well, that quickly went from “meh” to “holy crap”. Bravo, UW/Seadragon team, bravo.
Frank says:

August 18, 2008 at 8:01 pm

@Some CADD geek: Comparing this to Sketchup or other 3D modeling tools is pointless. It’s not about 3D modeling. It’s about displaying a large image library in such a way that you can navigate through them in such a way that you can tell where each picture was taken.
theone says:

August 19, 2008 at 7:05 pm

Looks great, but these day is it not easier just to video the bloody thing ?
Pingback: Foto Turismo em 3D | TECNOSH | Tecnologia, Gadgets e Design.
Pingback: Microsoft libera Photosynth | lobo_tuerto
Pingback: Spyware Confidential mobile edition
Pingback: Photo Tourism: ¿El Futuro de la Fotografía? | Mapas del mundo
Pingback: WEngineering Blog » Uni. Washington and Microsoft Research collaborates on (yet another) mindblowing 3D photo viewer
Schlomo says:

December 20, 2008 at 8:50 am

Mind-blowing O_O
Nathanael says:

May 11, 2009 at 11:37 am

I just wanted to point out that the path-finding work has finally begun to be integrated into Photosynth.

Since 2009 April 21, when you use the Highlights list (on the right hand side of any photosynth whose author has added highlights) you will be flown along a path calculated through any photos between your current position in the synth and the highlight you have just selected, just as shown in the Photo Tourism work above.

Also of note is that this new feature coincides with the switch to the Photosynth viewer being written in Silverlight 2.0 which means that all synths should be viewable on all Intel Macs. If all goes well with Moonlight, Linux users will be able to view synths by September 2009.
joe says:

May 11, 2009 at 11:43 am

Nathanael:

awesome! i can’t wait to try this stuff out… mac-only shop here.
Nathanael says:

May 11, 2009 at 12:06 pm

Joe,

Just a clarification and full disclosure:
Synths are still only *calculated* in Windows, but you should be able to *view* a whole world of synths now.

http://photosynth.net/geoexplore.aspx

As always, there’s Boot Camp, Parallels, or VMWare Fusion if you’re dying to create a few.
joe says:

May 11, 2009 at 12:08 pm

bogus 🙁

yeah i can run fusion i guess.
Pingback: Nuova soluzione 3D in arrivo | Il taccuino di Armando Leotta
Pingback: 3D morphable model face animation | FrontPageSearch
Pingback: Microsoft Research: Finding Paths | My Blog
Pingback: Microsoft Research: Finding Paths | WindowsBlog.at
Pingback: Bob Pritchett

Comments are closed.