Become a MacRumors Supporter for $50/year with no ads, ability to filter front page stories, and private forums.

chromite

macrumors regular
Original poster
Jul 6, 2013
187
657
AI is the future and Apple is severely lacking in this department. There’s no point in digital glasses if there’s no information to display

I don’t want to wait 5 seconds for Siri to tell me it can’t read the barcode on the cereal I’m buying
 
AI isn't in the Quest 2 and that seems to be doing fine

What the hell would you even use AI with in the Vision Pro? It's a spatial computer designed primarily to display apps around you, not be some kind of Iron Man hud.
 
AI is not in many products... most if not all they call in AI is actually ML (Machine learning). There is no real intelligence to it, not even that of a cockroach.
 
Also Apple would NEVER say AI anyway and Apple is always a year behind before it joins the race with a superior product (just like with Vision Pro's). Likely/hopefully next year we will see Siri improved with "real time spatial processing" and "interactive predictability" and "involved human communication".

Now back to how Siri presently is:
Me: Hey Siri It's dark so turn on the lights.
Siri: Playing Turn on the Lights by the rap group Future.
Me: Hey Siri stop playing music and turn on the lights.
Siri: (proudly boast) I found this on the web.
 
Last edited:
what are you talking about?
Apple loves to trademark their own things under their own names....faceID, touchID etc...AI in their world is under the "Machine learning" trademark category
This topic can be closed
 
Last edited:
what are you talking about?
Apple loves to trademark their own things under their own names....faceID, touchID etc...AI in their world is under the "Machine learning" trademark category
This topic can be closed
Machine learning is not an Apple term
In any capacity. It was around long long long before Apple started using it
 
  • Like
Reactions: Janichsan
what are you talking about?
Apple loves to trademark their own things under their own names....faceID, touchID etc...AI in their world is under the "Machine learning" trademark category
This topic can be closed
Machine Learning (ML) is actually the correct term for what most of the public companies term as Artificial Intelligence (AI). And as @Cloudyskies22 pointed out - it is not an Apple term. Machine Learning is actually a subset of Artificial Intelligence, but tends to be limited to just recognising patterns from data. When you build ML models, there are both assisted and unassisted learning to generate a model that is then used to basically categorize or recognize what the data provides (example of assisted which is more common - you have a human sit there and they see a picture for example and they tell the computer it is a cat, from all that human decisions - the computer builds an ML Model). Driverless car systems are basically the same, humans are telling the cars what things are and it builds models from it. Artificial Intelligence goes beyond that to analysing and contextualizing observed data and make decisions based on that.

I don't know any company that does not like (need) to trademark stuff - in fact, they would be stupid if they did not. Apple has thousands, so does Microsoft and Google.
 
Last edited:
  • Like
Reactions: Tagbert
AI is the future and Apple is severely lacking in this department. There’s no point in digital glasses if there’s no information to display

I don’t want to wait 5 seconds for Siri to tell me it can’t read the barcode on the cereal I’m buying
Seems like AI is already starting to become quite a dirty word, and I suspect a lot of the "Apple is doomed because they can't do AI properly" claims will soon begin to age pretty poorly.
 
  • Like
Reactions: Tagbert
Machine Learning (ML) is actually the correct term for what most of the public companies term as Artificial Intelligence (AI). And as @Cloudyskies22 pointed out - it is not an Apple term. Machine Learning is actually a subset of Artificial Intelligence, but tends to be limited to just recognising patterns from data. When you build ML models, there are both assisted and unassisted learning to generate a model that is then used to basically categorize or recognize what the data provides (example of assisted which is more common - you have a human sit there and they see a picture for example and they tell the computer it is a cat, from all that human decisions - the computer builds an ML Model). Driverless car systems are basically the same, humans are telling the cars what things are and it builds models from it. Artificial Intelligence goes beyond that to analysing and contextualizing observed data and make decisions based on that.

I don't know any company that does not like (need) to trademark stuff - in fact, they would be stupid if they did not. Apple has thousands, so does Microsoft and Google.

Good points on the product.

However, the discussion here (not just in your post) is not quite exact in terms of the various concepts and relations in the milieu of AI/ML. I don't really want to ruffle feathers here, but as a long time researcher in this field, I just want to help paint a more accurate picture of what we do, and set the record straight.

AI (artificial intelligence) refers to the natural scientific and engineering disciplines that correspond to discovery and replication of human and other biological systems' abilities. AI is often discussed in terms of its various application sub-domains such as computer vision, NLP, robotics, etc. Those who work primarily in the various application doamins are probably behind those technologies from AI/ML that most people normally interact with. AI in its modern form (in about the last 15 years) primarily relies on various techniques in ML, but there are still certain applications where a more traditional types of non-ML techniques, such as formal logic (e.g. syntactic parsing in language) still play a major role.

ML (machine learning) is the theoretical (and for the most part mathematical) discipline that lies behind many of modern AI's capabilities. While the primary application of ML is in AI, there are a variety of other science and tech fields where ML also play some role, such as meteorology, agent based economics, simulation based physics, geostatistics, etc. In addition to mathematics, ML to a lesser degree also involves a certain amount of cognitive and neural science.

ML for the most part relies on, and is an outgrowth of three more fundamental types of mathematics:
  • frequentist and Bayesian statistics (kind of goes without saying that statistics and probability theory are important here),
  • numerical optimization (especially constrained convex optimization), which is critical any time when "training" of a model is involved
  • differential geometry (including linear algebra, which is a special case of diff geom) which is intimately involved in the basic representations within parametric space of ML models, any time you hear the reference to terms like "tensor" or "embedding" (meaning embedded manifold in ambient representation space) in AI/ML, it likely has to do with diff geom.
ML also involves to various degree other areas in mathematics; such as dynamical systems (such as in recurrent neural nets and certain types of reinforcement learning); stochastic processes (such as using stochastic differential equations in diffusion models, e.g. DALL-E that you might have heard of); computational semantics (such as embeddings used in data space representations); and a bunch more.

AI/ML is actually a very diverse set of things that can be sliced and diced in many different way. The most important ways to distinguish different forms of ML probably are:
  • the type of relation between the data and the model in the training procedure, sometimes referred to as the "setting", in other words how information from the data gets into the model. These include supervised, unsupervised, self-supervised, distributional filtering, normalizing flow, adversarial learning, reinforcement learning, guided diffusion, imitation learning, GA/ES (genetic algorithms evolutionary strategies), and a number of others.
  • the type of flow of information through the structure of the model during inference. The two best known types are discriminative and generative models, although these are just the two extremes, there are various others in between such as parametric ensemble.
  • the information density of the model within parametric space. Here again there are two very commonly seen types, which are point-estimate/frequentist, vs Bayesian/probablistically-coherent. But these are really just two within a larger continuum of options.
  • the form of representation within model/parametric-space. It would be either parametric, meaning that the number of numerical parameters of the model is pre-determined at the beginning of the training process; or would be non-parametric, meaning that the number of parameters can vary, and usually grow throughout the training process (so the model can vary structurally as it gains knowledge from data).
What you have referred to in "you have a human sit there and they see a picture for example and they tell the computer it is a cat, from all that human decisions - the computer builds an ML Model", is a very specific type of ML system that is
  • supervised (in how information from data is integrated into the model, in this case you mentioned this always treating human response as the gold standard)
  • purely discriminative (in inference mode the information flows straight from input through a sequence of transformations to output)
  • parametric, since the type of model and its structure is fully determined prior to training base on your description
  • and likely frequentist, base on your description of a single model and just general knowledge these type of vision tasks do not involve Baysian systems
While your description is very good and correct, it is only a very narrow slice of what AI/ML is. This specific type of system is the simplest, and possibly the most common type in the industry. So it is unsurprising that it gets brought up all the time. But a lot of people often do not realize that there are many other variants, some are also quite often used in real world scenarios.

The other example application, driverless cars (or planes, helicoptors, other verhicles for that matter), are usually a very different type of ML system. Some of the simpler types of self-driving cars, such as those that run on mostly fixed route or only drive on highways, generally are of the simpler types such as using a regulator setting (e.g. linear quadratic regulator / LQR) as the control system, and some convolutional/residual neural nets + a filtering setting (e.g. Kalman filter) for the perception and sensor integration. Those typically are very far from the full capabilities of a human driver/pilot. Those systems that are much closer to human ability in these applications typically has a reinforcement learning based control system; but that is not very close to being realistically deployable in real life by a long shot.

Again, I don't want to bore you guys, or get into a whole a lot of minutiae. But there are so many misconceptions and gross simplifications out there right now about AI/ML, that I just want to bring a bit more clarity, and to help inform folks in these discussions, here in this forum. I actually primarily do research in reinforcement learning, among various other forms of AI/ML, so would be happy to answer things, time permitting from my busy schedule.
 
Last edited:
Again, I don't want to bore you guys, or get into a whole a lot of minutiae. But there are so many misconceptions and gross simplifications out there right now about AI/ML,
This is so true. Unfortunately, the field is very complex, and hard to understand. I'm not sure I followed all the details of your explanation, though I greatly appreciate your efforts at bringing clarity to this discussion.

When I saw the WWDC presentation of Vision Pro, I thought it was chock full of AI/ML. Things like tracking your eyes to determine what you are looking at, scanning your eyes and displaying a representation of them on the front visor, creating a persona to display to other people in FaceTime, sensing when someone approaches you and letting them "break through" into your immersive environment -- all these things are based on AI/ML technologies, aren't they?

But some people think of AI as just language interaction -- things like saying "Siri, take me to Yankee Stadium," and it gives you directions to the stadium. I think Apple is in fact working on speech recognition, as reflected in features like Live Speech and Live Captions. But they haven't gotten around to improving Siri interactions. I suspect because the underlying technologies needed to improve that aren't quite there yet. If Apple wanted to let Siri send our spoken queries to ChatGPT and speak back the responses to us, it could. But the responses would include lots of inaccurate information, and wouldn't be real-time. If there is a big accident on the route to Yankee Stadium and you need to take an alternate route, ChatGPT won't know that.

Is Apple working on a language interaction model that solves those problems? Who knows? But I'm sure Tim Cook means it when he says he intends to be "thoughtful and deliberate" in how Apple uses AI/ML technology.
 
  • Like
Reactions: HardBall
Register on MacRumors! This sidebar will go away, and you'll see fewer ads.