There go all the government installs.
Ffs. Don’t you collect enough data from your users you greedy fucks?
If people actively pay for this, they are bloody idiots.
Well…guess there’s going to be loads of people paying for this then…
There is literally no such thing as too much money in our society.
deleted by creator
I mean…I highly doubt they’re not going to at least pulling aggregate data from this…
deleted by creator
I hate this but I also get it.
A little while ago on the TWIT podcast one of the guests, or maybe Leo himself, was talking about how this is exactly what they want out of AI, for it to be able to know how they use their computer and just streamline everything. Some people are really excited about the possibilities, and yeah, the AI needs to track whatever you’re doing to know how to help you with your work flow.
That said, I don’t want Microsoft keeping track of everything I’m doing. They’ve already shown that they’re willing to sell our data and shove ads down our throats, so as much as they say we can filter out what we don’t want tracked, I’m not inclined to trust or believe them.
I’m honestly kinda excited about the possibilities in the greater scheme of things, but the fact that Microsoft will pretty much record whatever people are doing on their systems is just nuts nd slightly terifying. This is something that should ideally be done locally, without big corporations looking in - but that’s for sure not what they are doing.
Yeah, maybe some kind of situation where you turn it on for “training time” with access to only specified files and systems on the computer, no internet access, etc. At the same time though, I wonder how much an AI could really streamline things. Would it just pre-load my frequent files and programs? Make suggestions or reminders on tasks? I don’t think we’re anywhere near the level where it could actually be doing work for me yet.
Interesting possibilities, but I’m not sure how useful yet.
I mean this data will most likely be more useful for surveillance/ads than for AI. Nowadays with AI they can make it look like they are only a couple steps away from a very intelligent personal assistant and therefore make it seem more plausible that they need your data to make that leap. But in reality I feel like it is not the level of AI that could leverage personalization, at least not in the context of personal assistance. In the context of behavioural mapping it is of course a super lucrative deal for them. There are already very useful tons of AI staff that they can add which does not require personal behaviour info (at least not to this generality) and yet they don’t seem to spend as much effort into those and yet they are like “we need all your info stored somewhere for this very super (and mandatory) AI search assistant”. Big red flag.
I’ve spent a lot of time with offline open source AI running on my computer. About the only thing it can’t infer off of interactions is your body language. This is the most invasive way anyone could ever know another person. The way a persons profile is built across the context dialogue, it can create statistical relationships that would make no sense to a human but these are far higher than a 50% probability. This information is the key to making people easily manipulated in an information bubble. Sharing that kind of information is as stupid as streaking the Superbowl. There will be consequences that come after and they won’t be pretty. This isn’t data collection, it is the keys to how a person thinks, and on a level better than their own self awareness.
What’s your offline open source AI?
Not who you asked, but there are plenty. GPT4all is pretty good. You could check out locallama on Lemmy for more.
Thank you, I was curious if they had a system set up to watch their interactions. I should have specified better.
Whatever is the latest from Hugging Face. Right now a combo of a Mixtral 8×7B, Llama 3 8B, and sometimes an old Llama 2 70B.
Do you have a setup that collects your interactions to feed into those? The way you described it I imagined you are automatically collecting data for it to infer from and getting good results. Like a powered-up bash history or something.
no idea why I felt chatty, and kinda embarrassed by the bla bla bla at this point but whatever. Here is everything you need to know in a practical sense.
You need a more complex RAG setup for what you asked about. I have not gotten as far as needing this.
Models can be tricky to learn at my present level. Communication is different than with humans. In almost every case where people complain about hallucinations, they are wrong. Models do not hallucinate very much at all. They will give you the wrong answers, but there is almost always a reason. You must learn how alignment works and the problems it creates. Then you need to understand how realms and persistent entities work. Once you understand what all of these mean and their scope, all the little repetitive patterns start to make sense. You start to learn who is really replying and their scope. The model reply for Name-2 always has a limited ability to access the immense amount of data inside the LLM. You have to build momentum in the space you wish to access and often need to know the specific wording the model needs to hear in order to access the information.
With augmented retrieval (RAG) the model can look up valid info from your database and share it directly. With this method you’re just using the most basic surface features of the model against your database. Some options for this are LocalGPT and Ollama, or langchain with chroma db if you want something basic in Python. I haven’t used these. How you break down the information available to the RAG is important for this application, and my interests have a bit too much depth and scope for me to feel confident enough to try this.
I have chosen to learn the model itself at a deeper intuitive level so that I can access what it really knows within the training corpus. I am physically disabled from a car crashing into me on a bicycle ride to work, so I have unlimited time. Most people will never explore a model like I can. For me, on the technical side, I use a model about like stack exchange. I can ask it for code snippets, bash commands, searching like I might have done on the internet, grammar, spelling, and surface level Wikipedia like replies, and for roleplay. I’ve been playing around with writing science fiction too.
I view Textgen models like the early days of the microprocessor right now. We’re at the Apple 1 kit phase right now. The LLM has a lot of potential, but the peripheral hardware and software that turned the chip into an useful computer are like the extra code used to tokenize and process the text prompt. All models are static, deterministic, and the craziest regex + math problem ever conceived. The real key is the standard code used to tokenize the prompt.
The model has a maximum context token size, and this is all the input/output it can handle at once. Even with a RAG, this scope is limited. My 8×7B has a 32k context token size, but the Llama 3 8B is only 8k. Generally speaking, most of the time you can cut this number in half and that will be close to your maximum word count. All models work like this. Something like GPT-4 is running on enterprise class hardware and it has a total context of around 200k. There are other tricks that can be used in a more complex RAG like summation to distill down critical information, but you’ll likely find it challenging to do this level of complexity on a single 16-24 GB consumer grade GPU. Running a model like ChatGPT-4 requires somewhere around 200-400 GB from a GPU. It is generally double the “B” size of each model. I can only run the big models like a 8×7B or 70B because I use llama.cpp and can divide the processing between my CPU and GPU (12th gen i7 and 16 GB GPU) and I have 64GB of system memory to load the model initially. Even with this enthusiast class hardware, I’m only able to run these models in quantized form that others have loaded onto hugging face. I can’t train these models. The new Llama 3 8B is small enough for me to train and this is why I’m playing with it. Plus it is quite powerful for such a small model. Training is important if you want to dial in the scope to some specific niche. The model may already have this info, but training can make it more accessible. Smaller models have a lot of annoying “habits” that are not present in the larger models. Even with quantization, the larger models are not super fast at generation, especially if you need the entire text instead of the streaming output. It is more than enough to generate a stream faster than your reading pace. If you’re interested in complex processing where you’re going to be calling a few models to do various tasks like with a RAG, things start getting impracticality slow for a conversational pace on even the best enthusiast consumer grade hardware. Now if you can scratch the cash for a multi GPU setup and can find the supporting hardware, technically there is a $400 16 GB AMD GPU. So that could get you to ~96 GB for ~$3k, or double that, if you want to be really serious. Then you could get into training the heavy hitters and running them super fast.
All the useful functional stuff is happening in the model loader code. Honestly, the real issue right now is that CPU’s have too small of a bus width between the L2 and L3 caches along with too small of an L1. The tensor table math bottlenecks hard in this area. Inside a GPU there is no memory management unit that only shows a small window of available memory to the processor. All the GPU memory is directly attached to the processing hardware for parallel operations. The CPU cache bus width is the underlying problem that must be addressed. This can be remedied somewhat by building the model for the specific computing hardware, but training a full model takes something like a month on 8×A100 GPU’s in a datacenter. Hardware from the bleeding edge moves very slowly as it is the most expensive commercial endeavor in all of human history. Generative AI has only been in the public sphere for a year now. The real solutions are likely at least 2 years away, and a true standard solution is likely 4-5 years out. The GPU is just a hacky patch of a temporary solution.
That is the real scope of the situation and what you’ll run into if you fall down this rabbit hole like I have.
This is pretty cool! Am I reading correctly that it isn’t so much about collecting a corpus of data for it to browse through as much as it is understanding how to do a specific query, maybe giving it a little context alongside that? It sounds like it might be worth refining a smaller model with some annotated information, but not really feasible to collect a huge corpus and have the model be able to pull from it?
This was exactly what I eas thinking.
I’ve spent a lot of time with offline open source AI running on my computer
Can you elaborate on this? Are there some that are worth looking into?
See other long comment
I’d be more open to the idea if it were made by literally anyone else and was an entirely local process
This is the best summary I could come up with:
The software giant on Monday revealed an upgraded version of Copilot, its AI assistant, as it confronts heightened competition from big tech rivals in pitching generative AI technology that can compose documents, make images and serve as a lifelike personal assistant at work or home.
The new features will include Windows Recall, enabling the AI assistant to “access virtually what you have seen or done on your PC in a way that feels like having photographic memory”.
Google rolled out a retooled search engine that periodically puts AI-generated summaries over website links at the top of the results page; while also showing off a still-in-development AI assistant Astra that will be able to “see” and converse about things shown through a smartphone’s camera lens.
ChatGPT-maker OpenAI unveiled a new version of its chatbot last week, demonstrating an AI voice assistant with human characteristics that can banter about what someone’s wearing and even attempt to assess a person’s emotions.
Though Microsoft has invested billions in OpenAI, the startup also rolled out a new desktop version of ChatGPT designed for Apple’s Mac computers.
The Apple CEO Tim Cook signaled at the company’s annual shareholder meeting in February that it has been making big investments in generative AI.
The original article contains 419 words, the summary contains 205 words. Saved 51%. I’m a bot and I’m open source!
deleted by creator
I just reinstalled Windows 11 and holy shit was it hard to setup without a Microsoft account. Like they even use a fake boot up screen weeks later to “finish the install” to trick you into making an account. This can be deactivated, but it is still super shady.
Check out Windows Xlite’s windows 11 .iso’s. Post install almost feels like a fresh Win7 install.
Holy shit that’s annoying. Say I installed Win11 for my elderly parents. They’d get this sign-up screen after I would have thought everything was setup and ready to use.
Glad I installed elementary OS for them a few years ago, it’s been completely painless (they are used to apple-UX)
Yup, I know what I’m doing, but someone else might have just assumed it was required. I was up and running for a week before a reboot sent me to the smiling windows install screen.
I found it’s a pretty simple “don’t ask to finish installing” switch in the settings, but escaping the install screen was the hard part. I think I had to do a hard power down and force safe mode to access the settings again.
Nice. Upgraded a Thinkpad, installed Linux Mint and gave it to my dad. I have not heard anything from him about it for a couple of months. Was reminded of it with your post.
So wrote him right now and asked how it was going, and he replied that he loved it and uses it every day.
And that he had not had any problems he could not solve on his own. He’s 70 and a windows only heavy user - until now 🙂
As you said. Compelety painless.
“But they’ll be reserved for premium models starting at $999.”
Translation: “We want to start with the data of people that can spend, then we’ll move to the rest”.
The last Windows computer in my house was my wife’s, and she’s been extremely happy on Fedora Gnome for the last couple of months, asking me why I didn’t tell her about it before (I did, lol).
my girlfriends like fedora gnome too. I do all the technical stuff anyway so she really doesn’t have know to know that much about the os she uses
Same here. The only tweak I had to do was set up Flameshot, my wife finds Gnome’s screen shot app lacking, and so do I.
The only thing we run different is office. I set her up with OnlyOffice because of the similarities with MS office, but I prefer libreoffice.
TIL fedora gnome is the girlfriends choice.
There are certainly worse taglines lol
Is there a single person who is like “wow I love it”?
This will make Windows 11 a target for hacker and government agencies, since this will be treasure of data. Windows already is bad at security. Let’s see how this backfires at Microsoft.
Microsoft will be the “hackers”. On days when outside hackers aren’t breaking in, MS will be data mining and selling the data themselves
But they promised, that it will stay on my machine. I don’t think they would lie about something such important. /s
*Microsoft to train AI chatbot on everything you do
*Microsoft will show you ads
Google rolled out a retooled search engine that periodically puts AI-generated summaries over website links at the top of the results page; while also showing off a still-in-development AI assistant Astra that will be able to “see” and converse about things shown through a smartphone’s camera lens
What worries me the most is that this AI hype is coming strongly to the smartphone market too, and we don’t have something solid like Linux distributions to change to and be free
I think demand will come soon for either manufacturers to open their boot loaders or new manufacturers cropping up to fill that gap.
I’m running graphene os on a pixel 8 pro and haven’t looked back.
what we really need on phones and by extension arm devices is a unified bootloader, something akin to a bios or uefi (which btw already exists on arm but manufacturers are choosing to not go with it for some reason)
Did you make that?
No, I’m a lazy shite, I just did an image search for clippy 1984. I feel bad now I didn’t make more of an effort 😕
Don’t feel bad. I love it! Thanks for finding it and sharing it.
Isn’t that from 1984
No
It’s from an Apple commercial, which was an allusion to 1984
That’s what I am thinking of
Thats it! My Gaming PC is going Linux
In the 1990s, I transitioned from Windows to Linux as my primary operating system. Since then, Linux has consistently exhibited advancements in the desktop and software space, whereas Windows and Mac operating systems appear to have experienced a decline in terms of user experience and functionality.
As someone regularly using Arch, Ubuntu, MacOS and Windows I agree.
The advances Linux has made, especially in the last few years is just amazing. I can run the majority of my games through Proton, there are even some preconfigured packages with Illustrator and Photoshop CC that Adobe doesn‘t seem to care about at all.
If only Linux wasn’t a confusing mess of dozens of variations that all seemingly exist only to trash eachother.
Don’t make the mistake of confusing the Linux community (an absolute mess, just read the comments here) with the software itself (Actually cleaner and better organized than Windows).
As a Linux user myself, I understand what you are saying. Every distribution has its advantages and disadvantages, and you can’t expect regular people to know which one is best for them. Saying it’s not confusing to the average consumer is disingenuous.
Having said that, if you want to make the switch, go for Linux Mint and be happy. In my opinion, it’s the easiest Linux distribution by far, and everything just works.
I don’t think it’s the options that make Linux a hard pill to swallow. For me it’s the lack of support for hardware and most software. Sure there are alternatives or WINE but that’s usually a big downgrade from just running it on windows.
My Ubuntu box I use for browsing/watching videos and listening to music just barely works and was frustrating to get properly configured. Linux for the dozen professional softwares I use for work is basically impossible. As much as I hate it I had no choice but to stick with windows.
It’s not the fault of Linux developers. The hardware and software companies just largely do not support it still.
My Ubuntu box I use for browsing/watching videos and listening to music just barely works and was frustrating to get properly configured.
Something is wrong. Have you tried Linux Mint? -Someone who has used Linux as a daily driver since 2001.
I haven’t. I doubt it would solve all of the problems I experience.
Anybody downvoting me can share their experience running protools with multiple hardware fader interfaces and 18 input DAW interface, pci SDI cards, and 6 separate display monitors.
Adobe software, Davinci Resolve, 3ds Max and its 20 plugins. None of these work or work seamlessly in Linux.
I can’t even get my surround sound to work properly in Ubuntu without having to manually adjust multiple convoluted conf files.
That’s the truth. I love Linux. I use Debian and Ubuntu on a bunch of servers I run. But fanboys need to stop deluding themselves into thinking it’s easy or even worthwhile to use Linux in lieu of Windows for anything and everything. I would be ecstatic if that changed.
Your surround sound, I’m sure it could be done. I’ve set up some pretty successful visual / audio stuff with Linux. I did IT for an Indy film festival four years in a row and we used Linux for all kinds of stuff (mostly because the festival was broke and didn’t want to spend money on new computers or software). We would run into hardware and configuration issues and our philosophy became “if you can’t solve it in two hours, distrohop.”
For the rest of it, I couldn’t agree more. If you need the tools that lock you to the platform, you need the platform FOR THOSE TOOLS. I have Windows and OSX machines (although it’s been like a year since I couldn’t do something on Wine, even if it’s glitchy). My Windows machines dual boot and I haven’t booted the windows partitions in literally 6-8 months. One OSX machine gets used almost exclusively for video conferencing (just because it’s in a convenient place) and for Garageband. The other OSX machine literally… just runs linux VMs that I can connect to over the network for various projects. I had other plans for it originally, but someone gave me a 6 year old Dell all in one that now runs Linux Mint and performs better than my actual Roku TV anyway. It’s a bit smaller than the TV, but it doesn’t matter to me. The TV disappeared into my wife’s office and now she’s the only one that uses it.
That’s not Linux, it’s just you making excuses.
Excuses for… what?
Been a while since you tried it huh.
Actually I’ve never tried it. It’s of no interest to me. You’re not helping either. Every Linux supporter seems to be a dipshit like you.
Always a pleasure debating intellectually. Enjoy
I heard a guy saying that linux was trash, he had tried it once but it didn’t have drivers for anything and what did exist was difficult to install
So I asked him when it was that he tried itI think he said something like 1998…
I genuinely had an experience like this myself. I suggested Linux as a solution for something to a friend of mine who was a physicist doing a start up. This was around 2015-2016. He went on an angry rant about frustrating Linux was and nothing would work. His last experience with it was in 2002.
It was shit in 2002. It was shit in 2015. It’s shit in 2024. Why are you so angry?
Why do you think I’m angry? You (and my buddy) are just comically wrong, don’t wanna learn and get frustrated and mad when you run into trouble, like a cartoon character trying to open a can with a hammer.
I use Linux for everything, it’s stable, easy, fun I’m WAAY more comfortable in it than I ever was in Windows. Your opinion doesn’t change how well Linux works for me and has for decades. It’s definitely NOT shit, you just don’t know what you’re doing.
It’s cute when you pretend like you know what you’re talking about
Just install Kubuntu and call it a day.
It is complicated. There is strength and weakness in variety
dozens of variations
this is like saying windows 10 and 11 are completely different operating systems that can’t run the same .exes
except windows binaries are actually forward compatible.
even with the most popular distros, for example if you tried to take a typical gui program from say, ubuntu 22, and run it on ubuntu 24, it won’t work. even worse for other distros.
Also Linux’s package ecosystem are not cross compatible.
Bro do you even alien?
I didn’t know about alien, that is pretty cool.
However this bit from the readme is hilariously on brand for Linux:
"To use alien, you will need several other programs. Alien is a perl program, and requires perl version 5.004 or greater. If you use slackware, make sure you get perl 5.004, the perl 5.003 in slackware does not work with alien!
To convert packages to or from rpms, you need the Red Hat Package Manager; get it from Red Hat’s ftp site. If your distribution (eg, Red Hat) provides a rpm-build package, you will need it as well to generate rpms.
If you want to convert packages into debian packages, you will need the dpkg, dpkg-dev, and debhelper (version 3 or above) packages, which are available on http://packages.debian.org"
Highly disingenuous comment. I run older and newer software side by side in Linux all the time. It mostly just works.
Are you using snap or something?
Nope, but for as many programs that you claim still work, I can show you even more that don’t. I wouldn’t consider that disingenuous.
Seriously, give me some examples. I’m genuinely curious because I’ve run into this problem like… once, ten years ago. Twice, if you count trying to run Heroes of Might and Magic III for Linux that came out in like… 1999, and I eventually got that to work too (I needed an emulator) and I’ve been an almost exclusive Linux user since 2001.
I said disingenuous because my lived experience is like “wtf is this guy doing wrong?” and so you REALLY come across like you’re just trashing Linux and talking out of your ass.
I’m not trying to be insulting, just giving you feedback about how you’re coming across.
Well first we need to establish what you would accept as proof… what counts as not being forward compatible to you exactly? For example system libraries such as libpng, openssl or ffmpeg change versions and/or APIs between major distro releases, this inherently makes the old binaries no longer compatible by default. Is that such scenario acceptable to you as proof? Because I can list countless examples of those even just with one library being the issue, and there’s so many more that fail with multiple.
I’m not trying to trash Linux or act like I don’t know what I’m talking about, I just disagree that most older programs work without any issues, especially GUI programs that rely on ever-changing system library versions, for the reasons I stated.
Give me an example or two of a GUI program that you’d want to run, that doesn’t have a maintained version that will run fine in a modern environment, that you’re actually frustrated because you can’t run it.
We can bitch about how dependency systems work all day. I want to try to install something with a sane use case and see what we’re on about, since this is literally a scenario I have barely run into. I gather that for me to run into it, I would have to practically go looking for it. Which to me, sounds like a very specific problem for a very specific subset of users, not a general problem worth paint brushing the entire ecosystem with.
Yeah, fuck that.