Edge AI for Beginners

nivter · 2025-10-13T01:08:08 1760317688

This is far from what I expected. There is not much related to quantization, pruning, common architectures, precision or benchmarking. For those interested in this topic, I would recommend content from MIT HAN Lab.

keyle · 2025-10-13T02:45:59 1760323559

Can you provide links or more information?

soumendrak · 2025-10-13T03:11:04 1760325064

May be this one: https://hanlab.mit.edu/courses/2024-fall-65940

btown · 2025-10-12T22:10:28 1760307028

It seems this is focused on on-device computation - as distinct from, say, Cloudflare's definition of the "edge" as a smart CDN with an ability to run arbitrary code and AI models in geographically distributed data centers (https://workers.cloudflare.com/).

Per Microsoft's definition in https://github.com/microsoft/edgeai-for-beginners/blob/main/...:

> EdgeAI represents a paradigm shift in artificial intelligence deployment, bringing AI capabilities directly to edge devices rather than relying solely on cloud-based processing. This approach enables AI models to run locally on devices with limited computational resources, providing real-time inference capabilities without requiring constant internet connectivity.

(This isn't necessarily just Microsoft's definition - https://www.redhat.com/en/topics/edge-computing/what-is-edge... from 2023 defines edge computing as on-device as well, and is cited in https://en.wikipedia.org/wiki/Edge_computing#cite_note-35)

I suppose that the definition "edge is anything except a central data center" is consistent between these two approaches, and there's overlap in needing reliable ways to deploy code to less-trusted/less-centrally-controlled environments... but it certainly muddies the techniques involved.

At this rate of term overloading, the next thing you know we'll be using the word "edgy" to describe teenagers or something...

bigger_cheese · 2025-10-13T02:44:04 1760323444

I work at an industrial plant, we use "edge" to refer to something inside the production network.

As an example the control system network is air-gapped so to use ML for instrument control or similar the model needs to run on some type of "edge" compute device inside the production network all of the inferencing would need to happen locally (i.e. not in the cloud).

pclmulqdq · 2025-10-12T22:13:18 1760307198

Yeah, Cloudflare is in the minority with their definition of "edge."

vlovich123 · 2025-10-12T23:11:26 1760310686

No, edge is just poorly defined. Plenty of companies call their servers “edge” because they’re collocated with ISPs. Even ISPs when they talk about edge compute aren’t talking about your laptop but about compute in their colo.

notatoad · 2025-10-13T01:08:47 1760317727

edge just means as close to the user as you can get.

microsoft's edge is closer to the user than cloudflare's edge or an ISP's edge because microsoft runs your laptop.

disqard · 2025-10-13T02:23:04 1760322184

Wow, they really do have an edge over the competition there...

echelon · 2025-10-12T22:59:03 1760309943

In GPU compute land, "edge" means on the consumer device. The latency of delivery is negligible in comparison to the wall clock compute demands, so it doesn't make much sense to park your GPUs near the consumer.

IoT is "edge".

The only place I've seen "edge" used otherwise is in delivery of large files, e.g. ISP-colocated video delivery.

globalnode · 2025-10-12T23:42:14 1760312534

micro-edge?, medge, wedge, xedge...

davnicwil · 2025-10-12T23:10:56 1760310656

maybe a decent definition could be compute as close to the user latency-wise as practically possible while having full access to the necessary data.

For certain things this will be able to go as far as the device if you're only ever operating on data the user fully owns, other things will need data centers still but just decentralised and closer to the user via fancier architectures ala the Cloudflare model.

rocauc · 2025-10-12T23:13:15 1760310795

One of the most common uses for edge AI not listed in this course is computer vision. You similarly want real-time inference for processing video. Another open source project that makes it easy to use SOTA vision models on the edge is inference: https://github.com/roboflow/inference

yalogin · 2025-10-12T21:50:56 1760305856

Isn’t edge AI just a way to deploy AI to meet product requirements? What is special about this course? Is Microsoft trying to sell this as a service? If so what is the revenue model and hardware used?

jbrooks84 · 2025-10-13T01:30:57 1760319057

This was made by AI

alansaber · 2025-10-12T21:09:03 1760303343

Always cool to see SLM support from a big company, albeit for inference

fishmicrowaver · 2025-10-12T21:54:31 1760306071

MS GitHub seems to be featuring a lot of beginners courses all at the same time. Wonder if they're just pumping them out with AI at this point.

geraldwhen · 2025-10-12T22:02:14 1760306534

Seems to be. There’s little chance this was written by a human.

nurettin · 2025-10-13T03:04:04 1760324644

There's little chance this was even seen by a human.

tdhz77 · 2025-10-12T22:23:22 1760307802

Not comfortable with the phrase edge ai.

TZubiri · 2025-10-12T22:28:35 1760308115

Google has a similar product with Vertex

iJohnDoe · 2025-10-12T22:36:27 1760308587

What are the best Small Language Models (SLMs) these days?

jerpint · 2025-10-13T01:50:42 1760320242

Best is very subjective depends what you want it to do and if you want to fine tune and how big you consider small

gl-prod · 2025-10-12T22:16:27 1760307387

It's funny that they used AI to translate into other languages, because the Arabic cover image is just gibberish.

thenthenthen · 2025-10-13T01:00:02 1760317202

Oh this is hilarious, it is like they used Google Lens like method of translating (overlay the translation, you can see the text blocks). In the Dutch one, the cpu AI text just reads: ‘een’ aka ‘a’ in English

layoric · 2025-10-13T01:47:03 1760320023

Interestingly the French is completely different.

https://github.com/microsoft/edgeai-for-beginners/blob/main/...

flexagoon · 2025-10-12T23:30:41 1760311841

In Russian, the cover image says "Al" (with an L) instead of AI, and on the little CPU icon in the corner "AI" just got replaced with "A".

Edit: seems like it's like that in most languages lol, at least those with a latin script

gl-prod · 2025-10-12T23:36:08 1760312168

It looks like a box with new text inserted over the original image

bn-l · 2025-10-12T21:08:48 1760303328

They are really embracing ai! I can feel them all around even. Above me. Below me.

blibble · 2025-10-13T02:51:56 1760323916

given how bad their software has been historically

imagine how much worse it will be soon, given everything they seem to be outputting now is entirely generated slop

liamkearney · 2025-10-12T22:57:35 1760309855

TL;DR

This is a course on how to use Microsoft compute to maximise their profits

tkzed49 · 2025-10-12T23:43:19 1760312599

Too long for you to read? It's about running AI on local devices

rmccrear · 2025-10-12T22:53:27 1760309607

I clicked hoping the models would be available in the “Edge” browser.

doctoboggan · 2025-10-12T22:33:21 1760308401

The very first sentence:

> Welcome to EdgeAI for Beginners – your comprehensive...

Em dash and the word "comprehensive", nearly 100% proof the document was written by AI.

I use AI daily for my job, so I am not against its use, but recently if I detect some prose is written by AI it's hard for me to finish it. The written word is supposed to be a window into someone's thoughts, and it feels almost like a broken social contract to substitute an AI's "thoughts" here instead.

AI generated prose should be labeled as such, it's the decent thing to do.

lxgr · 2025-10-12T22:38:50 1760308730

Or just by somebody that knows how to use English punctuation properly.

Is it so hard to believe that there are some people in the world capable of hitting option + “-“ on their keyboard (or simply let their editor do it for them)?

doctoboggan · 2025-10-12T22:46:42 1760309202

I said em dash _and_ the word comprehensive. If you work with LLM generated text enough it gets very easy to see the telltale signs. The emojis at the start of each row in the table are also a dead giveaway.

I am guessing you are one of those people who used em dashes before LLMs came out and are now bitter they are an indicator of LLMs. If that's the case, I am sorry for the situation you find yourself in.

accoil · 2025-10-12T22:53:43 1760309623

If it makes a difference: it's an en dash used in the readme.

I've been wondering why LLMs seem to prefer the em dash over en dash as I feel like en (or hyphen) is used more frequently in modern text.

schrodinger · 2025-10-13T03:02:52 1760324572

In my experience the em dash is still correctly used, the modern style has just evolved to put a space around it.

So:

* fragment a—fragment b (em dash, no space) = traditional

* fragment a — fragment B (em dash with spaces) = modern

* fragment a -- fragment b (two hyphens) = acceptable sub when you can’t get a proper em to render

But en-dashes are for numeric ranges…

cal85 · 2025-10-12T22:56:05 1760309765

It's not an em-dash, it's an en-dash, which is rare in LLM output. Also just stop being insufferable.

username223 · 2025-10-13T02:27:44 1760322464

> The emojis at the start of each row in the table are also a dead giveaway.

What's up with the green checks, red Xs, rockets, and other stupid emoji in AI slop? Is it an artifact from the cheapest place to do RLHF?

keyle · 2025-10-13T02:46:41 1760323601

Doesn't a word document essentially convert dashes to emdashes?

oofbey · 2025-10-13T01:11:35 1760317895

You forget that MS Word loves to substitute things like em dashes in where you don’t want them. The “auto correct” to those directional quotation marks that every compiler barfs on used to be a real peeve with I was forced to use MS junk.

username223 · 2025-10-13T02:18:58 1760321938

> AI generated prose should be labeled as such, it's the decent thing to do.

The decent thing to do is to prefix the slop with the prompt, so humans don't waste their time reading it.

Legend2440 · 2025-10-13T01:05:44 1760317544

I don’t really care if it was.

It’s also documentation for an AI product, so I’d kinda expect them to be eating their own dogfood here.