This video is a review of the Apple Vision Pro, presented by Marques Brownlee. He discusses the device's immersiveness, display quality, pass-through capabilities, ecosystem integration, app selection, comfort issues, and the "eyes on the outside" feature. Brownlee explores both the strengths and weaknesses of the first-generation product, considering its potential future impact and whether it's a worthwhile purchase.
Okay, here is the transcript with timestamps for each segment:
0:00:00:000 - 0:00:03:679 - [Music] 0:00:00:320 - 0:00:05:080 - I actually love this thing I love this thing 0:00:03:679 - 0:00:07:720 - not because it's Flawless or anything it is far from Flawless but 0:00:05:080 - 0:00:10:559 - because it's actually interesting like 0:00:07:720 - 0:00:13:040 - don't forget the last two three years of 0:00:10:559 - 0:00:15:920 - Apple product review comments are just 0:00:13:040 - 0:00:17:680 - um that's boring oh it's just a spec 0:00:15:920 - 0:00:19:520 - bump there's nothing really new here oh 0:00:17:680 - 0:00:21:160 - they hardly change anything or try 0:00:19:520 - 0:00:24:400 - anything new these days but this this 0:00:21:160 - 0:00:27:760 - thing is interesting it's risky and most 0:00:24:400 - 0:00:30:720 - of all it's new now it's actually not 0:00:27:760 - 0:00:33:360 - fundamentally new it's a VR headset but 0:00:30:720 - 0:00:35:000 - it's new for apple and there are a bunch 0:00:33:360 - 0:00:37:760 - of things in here that are new in a way 0:00:35:000 - 0:00:40:320 - that only Apple would try and just as 0:00:37:760 - 0:00:43:320 - interesting as this individual product 0:00:40:320 - 0:00:47:239 - is the possible future that this implies 0:00:43:320 - 0:00:49:000 - like when you get a first generation 0:00:47:239 - 0:00:50:160 - product like this you sort of 0:00:49:000 - 0:00:52:160 - automatically assume that there are 0:00:50:160 - 0:00:54:640 - goals for its future that it'll have 0:00:52:160 - 0:00:56:440 - another generation and another one after 0:00:54:640 - 0:00:58:120 - that and that there is some goal for 0:00:56:440 - 0:01:00:160 - what this will turn into 10 years down 0:00:58:120 - 0:01:02:359 - the road because we saw what happened 0:01:00:160 - 0:01:05:479 - with the iPhone and the mac and the iPad 0:01:02:359 - 0:01:07:280 - and all sorts of other first generation 0:01:05:479 - 0:01:09:680 - products and on top of all of that as 0:01:07:280 - 0:01:12:520 - far as I know Apple has never released 0:01:09:680 - 0:01:14:840 - any other first generation product with 0:01:12:520 - 0:01:17:759 - the word Pro already in the name which 0:01:14:840 - 0:01:19:280 - comes with a whole another set of 0:01:17:759 - 0:01:22:840 - implications so is the world ready for 0:01:19:280 - 0:01:23:840 - all of 0:01:22:840 - 0:01:27:760 - this let's get into 0:01:24:400 - 0:01:28:840 - it 0:01:28:840 - 0:01:31:690 - [Music] 0:01:31:690 - 0:01:33:840 - [Applause] 0:01:33:840 - 0:01:39:159 - so I might be one of the 20 people 0:01:36:690 - 0:01:41:240 - outside of Apple who has been using the 0:01:39:159 - 0:01:43:200 - Vision Pro the most over the past two 0:01:41:240 - 0:01:45:719 - weeks like I've spent hours in this 0:01:43:200 - 0:01:47:640 - thing with both bands with multiple Macs 0:01:45:719 - 0:01:50:880 - in different setups different rooms 0:01:47:640 - 0:01:52:719 - indoors and Outdoors lightness and 0:01:50:880 - 0:01:55:200 - darkness there are parts of this thing 0:01:52:719 - 0:01:57:920 - that are absolutely amazing unparalleled 0:01:55:200 - 0:02:00:600 - best I've ever seen but the reason it's 0:01:57:920 - 0:02:02:880 - so interesting is because it's actually 0:02:00:600 - 0:02:06:000 - new and there are downfalls and flaws 0:02:02:880 - 0:02:09:200 - and tradeoffs that come alongside all of 0:02:06:000 - 0:02:11:339 - this stuff so at the end of the last 0:02:09:200 - 0:02:13:480 - video I gave you guys a sort of a 0:02:11:339 - 0:02:14:519 - preview of my pros and cons list if you 0:02:13:480 - 0:02:17:379 - haven't already watched that video it is 0:02:14:519 - 0:02:19:359 - definitely worth watching almost like a 0:02:17:379 - 0:02:21:439 - prequel to this one it is a 30 minute 0:02:19:359 - 0:02:23:520 - monster all about how to use this thing 0:02:21:439 - 0:02:25:480 - how it works what's inside what it's 0:02:23:520 - 0:02:28:000 - capable of and then at the end I got to 0:02:25:480 - 0:02:30:159 - my upsides which are immersiveness 0:02:28:000 - 0:02:31:519 - placement and space eye tracking and 0:02:30:159 - 0:02:34:840 - hand control pass through ecosystem and 0:02:31:519 - 0:02:38:519 - spatial audio and the downsides which 0:02:34:840 - 0:02:41:160 - are weight and comfort the eyes on the 0:02:38:519 - 0:02:43:519 - outside app selection right now battery 0:02:41:160 - 0:02:46:640 - life and price so okay for starters I 0:02:43:519 - 0:02:49:599 - want to amend immersiveness to Fidelity 0:02:46:640 - 0:02:53:320 - I think that's more accurate here I have 0:02:49:599 - 0:02:55:400 - used a bunch of different VR headsets 0:02:53:320 - 0:02:57:759 - now and this Vision Pro has has the 0:02:55:400 - 0:03:00:360 - sharpest best looking micro OLED display 0:02:57:759 - 0:03:04:599 - out of all of them the size of 0:03:00:360 - 0:03:04:980 - individual pixels on these displays is 7 0:03:04:599 - 0:03:09:480 - 1 12 microns which means you could fit 0:03:04:980 - 0:03:09:400 - 64 of them in the size of a single 0:03:09:480 - 0:03:12:159 - iPhone screens pixel you can't see 0:03:09:400 - 0:03:14:040 - individual pixels there's no screen door 0:03:12:159 - 0:03:16:440 - effect it's awesome the native refresh 0:03:14:040 - 0:03:19:000 - rate is 90 HZ and it will crank up to 96 0:03:16:440 - 0:03:20:399 - HZ when there's 24 FPS content playing 0:03:19:000 - 0:03:21:800 - to be an even multiple and apple says 0:03:20:399 - 0:03:23:959 - that they calibrate every single one of 0:03:21:800 - 0:03:26:080 - these Vision Pro displays 0:03:23:959 - 0:03:28:599 - from the factory for maximum color 0:03:26:080 - 0:03:28:520 - accuracy they're really good and this is 0:03:28:520 - 0:03:32:120 - a big reason why this headset is so 0:03:30:240 - 0:03:34:840 - expensive but then and this is going to 0:03:32:120 - 0:03:37:760 - be a recurring theme Here The Vision Pro 0:03:34:840 - 0:03:39:200 - runs up against the technology of today 0:03:37:760 - 0:03:42:239 - not being quite Advanced enough to 0:03:39:200 - 0:03:45:000 - accomplish what they were probably 0:03:42:239 - 0:03:48:599 - hoping as ideal so in the case of these 0:03:45:000 - 0:03:48:280 - screens right they're amazing there are 0:03:48:280 - 0:03:51:200 - so many pixels but because there's so 0:03:48:599 - 0:03:53:280 - many pixels the computer inside cannot 0:03:51:200 - 0:03:55:400 - actually render everything in high 0:03:53:280 - 0:03:58:000 - resolution all the time at 90 HZ so 0:03:55:400 - 0:03:58:000 - instead it does something clever it 0:03:58:000 - 0:04:00:479 - combines the insanely fast eye tracking 0:03:58:000 - 0:04:04:599 - with what's called fiated rendering 0:04:00:479 - 0:04:04:520 - meaning it's only actually rendering in 0:04:04:599 - 0:04:09:159 - high resolution exactly what you're 0:04:04:520 - 0:04:09:200 - looking at when you're looking at it the 0:04:09:159 - 0:04:13:360 - rest is soft and fuzzy that actually 0:04:09:200 - 0:04:13:360 - works really well because that's exactly 0:04:13:360 - 0:04:18:079 - how our eyes work it's really clever 0:04:13:360 - 0:04:18:000 - like you don't think about it but the 0:04:18:079 - 0:04:20:400 - thing that you're looking at at the 0:04:18:000 - 0:04:22:160 - moment is sharp but then the rest of 0:04:20:400 - 0:04:25:040 - your peripheral vision is is soft and 0:04:22:160 - 0:04:27:040 - fuzzy and that's fine so really now all 0:04:25:040 - 0:04:27:480 - the Computing work is being done to 0:04:27:040 - 0:04:30:400 - track your eyes as fast as possible so 0:04:27:480 - 0:04:32:360 - that there's no lag between when you 0:04:30:400 - 0:04:34:200 - look at something and when it becomes 0:04:32:360 - 0:04:34:799 - sharp fun fact you can actually see this 0:04:34:200 - 0:04:37:120 - in screen recordings from The Vision Pro 0:04:34:799 - 0:04:39:919 - you can see the piece of the screen that 0:04:37:120 - 0:04:42:680 - I'm looking at is sharp and then 0:04:39:919 - 0:04:45:200 - everything else around it even parts of 0:04:42:680 - 0:04:46:800 - the same window are fuzzy on purpose but 0:04:45:200 - 0:04:49:320 - to my eye that looks totally natural 0:04:46:800 - 0:04:53:000 - because I'm focusing on one thing at a 0:04:49:320 - 0:04:55:320 - time I found that you can also screen 0:04:53:000 - 0:04:55:280 - record with developer mode in xcode and 0:04:55:280 - 0:04:58:400 - that will make the clips 4K and it'll 0:04:55:280 - 0:04:58:400 - render everything in HQ all at once but 0:04:58:400 - 0:05:00:360 - every time I did that it would be choppy 0:04:58:400 - 0:05:02:320 - and scrolling would be slow and jittery 0:05:00:360 - 0:05:04:400 - and I'm thinking that's just because the 0:05:02:320 - 0:05:07:320 - computers aren't really used to 0:05:04:400 - 0:05:07:080 - rendering everything in high quality all 0:05:07:320 - 0:05:09:560 - the time so it looks like a higher 0:05:07:080 - 0:05:10:880 - quality recording but the second I did 0:05:09:560 - 0:05:12:560 - any scrolling it didn't look as good so 0:05:10:880 - 0:05:15:360 - I just didn't use those recordings as 0:05:12:560 - 0:05:15:360 - often so the screens are great the 0:05:15:360 - 0:05:18:840 - position tracking of objects in space 0:05:15:360 - 0:05:20:800 - are great the eye tracking is incredibly 0:05:18:840 - 0:05:23:160 - good the one ding against immersion on 0:05:20:800 - 0:05:25:400 - the Vision Pro though and not a lot of 0:05:23:160 - 0:05:28:000 - people are talking about this but it's 0:05:25:400 - 0:05:28:719 - the field of view see the first few 0:05:28:000 - 0:05:31:360 - times you use this headset you don't 0:05:28:719 - 0:05:33:560 - even really think about it that much 0:05:31:360 - 0:05:35:400 - you're so distracted by all the fun and 0:05:33:560 - 0:05:37:080 - the newness and how cool it is that your 0:05:35:400 - 0:05:38:960 - eyes are controlling the thing but 0:05:37:080 - 0:05:40:560 - eventually you start to poke around the 0:05:38:960 - 0:05:42:800 - edges and it turns out you know how 0:05:40:560 - 0:05:44:639 - people are saying this kind of looks 0:05:42:800 - 0:05:46:639 - like ski goggles from the outside well 0:05:44:639 - 0:05:49:160 - it also kind of looks like ski goggles 0:05:46:639 - 0:05:49:919 - from the inside a little bit too again 0:05:49:160 - 0:05:53:360 - the middle is super sharp and Incredibly 0:05:49:919 - 0:05:56:080 - impressive iive but if I can do my best 0:05:53:360 - 0:05:56:080 - here through a YouTube video the edges 0:05:56:080 - 0:05:59:360 - of the headset are a little bit further 0:05:56:080 - 0:06:00:160 - in than the edges of your vision and so 0:05:59:360 - 0:06:03:160 - there's a little bit of like a cone 0:06:00:160 - 0:06:07:000 - effect going on and there's some 0:06:03:160 - 0:06:07:000 - chromatic aberration around the outside 0:06:07:000 - 0:06:10:320 - so you kind of have this slight feeling 0:06:07:000 - 0:06:11:360 - of looking into a large tunnel at 0:06:10:320 - 0:06:13:880 - everything there are actually no field 0:06:11:360 - 0:06:16:000 - of view numbers published by Apple 0:06:13:880 - 0:06:17:480 - anywhere about Vision Pro as far as I 0:06:16:000 - 0:06:17:880 - can tell and I kind of think that's on 0:06:17:480 - 0:06:20:520 - purpose because I have noticed from 0:06:17:880 - 0:06:22:439 - using them both that the quest 3 has a 0:06:20:520 - 0:06:24:000 - better wider field of view just looking 0:06:22:439 - 0:06:26:519 - inside the headset so if I could change 0:06:24:000 - 0:06:28:880 - one thing about the Vision Pro to make 0:06:26:519 - 0:06:28:880 - it more immersive it would be a wider 0:06:28:880 - 0:06:31:360 - field of view no 0:06:28:880 - 0:06:31:360 - question Vision Pro has the best pass 0:06:31:360 - 0:06:35:120 - through of any headset I've ever used 0:06:31:360 - 0:06:38:000 - that much is super clear to me and 0:06:35:120 - 0:06:38:800 - weirdly enough this doesn't actually 0:06:38:000 - 0:06:40:479 - surprise me either maybe because this is 0:06:38:800 - 0:06:42:639 - one of the products that makes it so 0:06:40:479 - 0:06:45:000 - obvious that they're thinking a lot 0:06:42:639 - 0:06:47:680 - about the future like apple talks a lot 0:06:45:000 - 0:06:47:680 - about AR and how they want things to 0:06:47:680 - 0:06:52:200 - just be clear and just overlaying things 0:06:52:200 - 0:06:55:000 - onto your real world but with today's 0:06:52:200 - 0:06:57:160 - technology again that's not quite 0:06:55:000 - 0:06:59:240 - possible yet so instead they have a VR 0:06:57:160 - 0:07:02:680 - headset but they are using the highest 0:06:59:240 - 0:07:02:680 - quality camera feeds possible and the 0:07:02:680 - 0:07:07:960 - highest quality displays on the inside 0:07:02:680 - 0:07:07:960 - possible to let you almost feel like 0:07:07:960 - 0:07:11:960 - you're looking through it at the real 0:07:07:960 - 0:07:11:960 - world so you put this headset on and the 0:07:11:960 - 0:07:15:199 - first thing you see is pass through I 0:07:11:960 - 0:07:15:199 - mean you might as well call it 0:07:15:199 - 0:07:17:319 - transparency mode and the sharpness and 0:07:15:199 - 0:07:17:319 - the colors and the very low lat are all 0:07:17:319 - 0:07:20:120 - so good that I really don't experience 0:07:17:319 - 0:07:20:120 - any eye fatigue no matter how long I am 0:07:20:120 - 0:07:25:560 - in this pass through mode despite my 0:07:20:120 - 0:07:25:560 - eyes being inches from these screens I 0:07:25:560 - 0:07:27:759 - can interact with the real world around 0:07:25:560 - 0:07:27:759 - me pick things up and look at them I can 0:07:27:759 - 0:07:30:560 - walk around between rooms and not trip 0:07:27:759 - 0:07:30:560 - on things I tried having people throw 0:07:30:560 - 0:07:33:560 - things at me and I could just catch them 0:07:30:560 - 0:07:33:560 - I played table tennis successfully with 0:07:33:560 - 0:07:37:120 - the headset on which is crazy if you 0:07:33:560 - 0:07:37:120 - think about what's actually happening 0:07:37:120 - 0:07:40:480 - here the total latency Apple says is 12 0:07:37:120 - 0:07:40:480 - milliseconds that's from the outside 0:07:40:480 - 0:07:42:960 - light hitting the outside sensors to the 0:07:40:480 - 0:07:42:960 - inside image being updated and hitting 0:07:42:960 - 0:07:45:280 - your eyeballs that's incredibly fast 0:07:42:960 - 0:07:45:280 - that's and that includes the exposure 0:07:45:280 - 0:07:48:120 - time of the cameras that's the specially 0:07:45:280 - 0:07:48:120 - designed R1 chip at work but as Nei from 0:07:48:120 - 0:07:52:680 - The Verge has put it it's still cameras 0:07:48:120 - 0:07:52:680 - and screens like the technology of today 0:07:52:680 - 0:07:57:080 - isn't Magic so you still have to expose 0:07:52:680 - 0:07:57:080 - a camera sensor and set ISO and shutter 0:07:57:080 - 0:08:00:360 - speed Etc and you can kind of play 0:07:57:080 - 0:08:00:360 - around with this a bit just by looking 0:08:00:360 - 0:08:02:720 - around like at bright objects or high 0:08:00:360 - 0:08:02:720 - dynamic range environments and you know 0:08:02:720 - 0:08:04:880 - what for the variety of situations I've 0:08:02:720 - 0:08:04:880 - thrown at this thing it's handled it 0:08:04:880 - 0:08:08:800 - very impressively the whole time mostly 0:08:04:880 - 0:08:08:800 - prioritizing smoothness and high shutter 0:08:08:800 - 0:08:10:920 - speeds at the expense of cranking up the 0:08:08:800 - 0:08:10:920 - ISO and getting way more noise 0:08:10:920 - 0:08:12:800 - especially in Darker environments but 0:08:10:920 - 0:08:12:800 - you can still see stuff like the hand 0:08:12:800 - 0:08:15:360 - occlusion break sometimes or look really 0:08:12:800 - 0:08:15:360 - janky when you put your hand in front of 0:08:15:360 - 0:08:18:800 - something you can still see objects 0:08:15:360 - 0:08:18:800 - start to float a little bit more an X Y 0:08:18:800 - 0:08:20:880 - and Z space