Microsoft Office Tools Reportedly Collect Data for AI Training, Requiring Manual Opt-Out

AleksandarK · Nov 25, 2024

Microsoft's Office suite is the staple in productivity tools, with millions of users entering sensitive personal and company data into Excel and Word. According to @nixCraft, an author from Cyberciti.biz, Microsoft left its "Connected Experiences" feature enabled by default, reportedly using user-generated content to train the company's AI models. This feature is enabled by default, meaning data from Word and Excel files may be used in AI development unless users manually opt-out. As a default option, this setting raises security concerns, especially from businesses and government workers relying on Microsoft Office for proprietary work. The feature allows documents such as articles, government data, and other confidential files to be included in AI training, creating ethical and legal challenges regarding consent and intellectual property.

Disabling the feature requires going to: File > Options > Trust Center > Trust Center Settings > Privacy Options > Privacy Settings > Optional Connected Experiences, and unchecking the box. Even with an unnecessary long opt-out steps, the European Union's GPDR agreement, which Microsoft complies with, requires all settings to be opt-in rather than opt-out by default. This directly contradicts EU GDPR laws, which could prompt an investigation from the EU. Microsoft has yet to confirm whether user content is actively being used to train its AI models. However, its Services Agreement includes a clause granting the company a "worldwide and royalty-free intellectual property license" to use user-generated content for purposes such as improving Microsoft products. The controversy raised from this is not new, especially where more companies leverage user data for AI development, often without explicit consent.

For the current LLM AI models, the data on which they are being trained is the key to distinguishing them from competitors. Quality data is the prize, and when a unique dataset like the one Microsoft has access to is collected, that AI model could outperform the competition by a mile in tasks like writing and basic reasoning. Especially with sensitive data not available to the public, Microsoft could extend its AI lead. However, LLMs are not immune to leaking a part of their training data, so a skilled professional could extract it. For now, users who wish to protect their intellectual property are advised to review their settings carefully.

Update Nov 26th 08:00 UTC: Microsoft reached out to us via email and confirmed:

Statement from Microsoft said:
Microsoft does not use customer data from Microsoft 365 consumer and commercial applications to train large language models. Additionally, the Connected Services setting has no connection to how Microsoft trains large language models.

Connected Experiences allows users to search and download online content to enhance their documents. This includes templates, images, 3D models, videos, and reference materials. Examples include Microsoft Office templates and PowerPoint QuickStarter presentations. Microsoft has also provided a table of what Connected Experiences downloads, which you can see below:

View at TechPowerUp Main Site | Source

windwhirl · Nov 25, 2024

I'm not sure if I recall correctly, but that checkbox might have been enabled by default for a number of years now.

Shihab · Nov 25, 2024

I don't believe it's a simple "grab what those suckers are typing on Word to train Cortana (or whatever it's called these days)," but hey, if the FUD helps get more people -and funding/resources- Libre/Open office's way, I'm game!

Should start fanning the flames at Google's side as well.

TumbleGeorge · Nov 25, 2024

Your personal and company secrets are guaranteed to be protected. /s

_roman_ · Nov 25, 2024

The option exists. Do not complain! Well several levels below

File > Options > Trust Center > Trust Center Settings > Privacy Options > Privacy Settings > Optional Connected Experiences, and unchecking the box.

eidairaman1 · Nov 25, 2024

Office 2007 ftw

Paganstomp · Nov 25, 2024

SO this is why Office 365 is down??? I'm taking a guess... AI is on strike? Like some Airlines are about to do this week? Food for thought? ( I'm always in that outside left field. Looking at dandelions. )

windwhirl · Nov 25, 2024

Paganstomp said:
I'm always in that outside left field. Looking at dandelions.

The real "I touch grass"?

Paganstomp said:
SO this is why Office 365 is down?

Seems to be working fine for me right now.

SSGBryan · Nov 25, 2024

I don't see the DoD "upgrading" to this anytime soon.

Vayra86 · Nov 25, 2024

Paganstomp said:
SO this is why Office 365 is down??? I'm taking a guess... AI is on strike? Like some Airlines are about to do this week? Food for thought? ( I'm always in that outside left field. Looking at dandelions. )

Its a strange world we've landed in when Word doesn't work anymore because no internet.

Wtf

windwhirl · Nov 25, 2024

Vayra86 said:
Its a strange world we've landed in when Word doesn't work anymore because no internet.

Wtf

I think they meant the online version. Not the locally executable program.

Kapone33 · Nov 25, 2024

Unfortunately my Company has fully subscribed to 365. Of course so they can spy on Individual work PCs but the trade off is lock outs,system outages and the best we have SAP cloud for SAP and you know how those 2 play with each other,....not.

Vayra86 · Nov 25, 2024

eidairaman1 said:
Office 2007 ftw

Still rocking Office XP like a baws here. Basically I found a 'safe' way to still get that sweet retro UI

Steevo · Nov 25, 2024

My email for work is janky and my personal one is refusing to load.

The uprising has begun, hopefully they start with insurance cause that’s what I need it for.

Dr. Dro · Nov 25, 2024

Can't wait for Microsoft to start training their AI on confidential documents and have their LLMs drop a few government secrets or two. Then Microsoft and their lawyers will learn the true weight of an EULA.

I won't be renewing my Microsoft 365 subscription once it elapses in January. Might just pay for Apple One instead. Or better yet - a year of Proton Unlimited, way things are going, looks like a solid VPN will be a must for the distinguished netizen in 2025.

Shihab · Nov 25, 2024

Dr. Dro said:
Can't wait for Microsoft to start training their AI on confidential documents and have their LLMs drop a few government secrets or two. Then Microsoft and their lawyers will learn the true weight of an EULA.

Going to be tricky fitting the lot of 'em inside the Ecuadorian embassy. Perhaps the Cubans could spare a bunk or two...

chrcoluk · Nov 25, 2024

Thank you for reporting this, I think Onedrive is been used for similar purposes, since a AI update for Onedrive some months ago, its constantly uploading data at intervals (with nothing to sync), I already reported it to UK government officials and EU antitrust.

Also there has been an update to office 365 which has trashed the UI look, it used to have a background graphic in the toolbar, I think mine was set to stencil, that feature is still listed in the options, but since they added a colour feature, those graphics are now no longer rendered and it looks really plain, its pretty bad as the toolbars on Office ignore windows configured toolbar sizes and as such are really big on my system (Edge also ignores windows configured toolbar sizes).

Just checked, my box is already unticked.

I can also see something is gone from my ribbon. Not quite sure what, but I know something is gone as things have moved.

eidairaman1 · Nov 25, 2024

Vayra86 said:
Its a strange world we've landed in when Word doesn't work anymore because no internet.

Wtf

Y2k happened with cloudstrike.

Vayra86 said:
Still rocking Office XP like a baws here. Basically I found a 'safe' way to still get that sweet retro UI

I loved 97, 2000 was good as well. 2007 is the beginning of the doc(x) format compatibility, that is why i use it, nothing newer

windwhirl · Nov 25, 2024

chrcoluk said:
I think Onedrive is been used for similar purposes, since a AI update for Onedrive some months ago, its constantly uploading data at intervals (with nothing to sync),

That's normal operation for Onedrive.

Besides, they don't need a reupload of files they already have so for AI-training purposes they could just use what's present on the cloud without you ever knowing.

chrcoluk said:
I can also see something is gone from my ribbon. Not quite sure what, but I know something is gone as things have moved.

Hmm... Editor maybe? I think that requires you to have some online connected feature enabled.

Vayra86 said:
Still rocking Office XP like a baws here. Basically I found a 'safe' way to still get that sweet retro UI

And Clippy, right?

eidairaman1 · Nov 25, 2024

windwhirl said:
That's normal operation for Onedrive.

Besides, they don't need a reupload of files they already have so for AI-training purposes they could just use what's present on the cloud without you ever knowing.

Hmm... Editor maybe? I think that requires you to have some online connected feature enabled.

And Clippy, right?

97 and 2000, Freddy the Paperclip "Clipit/Clippy" is missing after that iirc

windwhirl · Nov 25, 2024

eidairaman1 said:
97 and 2000, Freddy the Paperclip "Clipit/Clippy" is missing after that iirc

Nah, it exists until Office 2003, but it's either not installed or enabled by default in 2003, IIRC

Alan Smithee · Nov 25, 2024

This is completely wrong; just check yourself. The box that's checked has clear explanations and does not involve AI training. Of course this won't stop people complaining.

Wirko · Nov 25, 2024

Looks like I've found just the right retro thread to ask, and maybe not be ridiculed...

How the hell do I transplant Outlook Express from Windows XP to Windows 7, 10 and 11?

chrcoluk · Nov 25, 2024

windwhirl said:
That's normal operation for Onedrive.

Never happened for me until the update.

There is also plenty of files for them to scour on my system, that I dont upload.

Tartaros · Nov 25, 2024

Sounds like a good time to jump ship to other alternatives. Incidentally, the key from my Office 2021 was revoked for some reason, and instead of looking for it in my docs, I just went to Libre Office. It has improved a lot since the last time I used it about 10 years ago. The same with the new Outlook, I don't like it's new layout and just went back to Thunderbird, which had to do an overhaul at some point but it's pretty nice.

I can't believe Microsoft is making the same kind of slimy mistakes that made them such a hated company 20 years ago.

System Name	System V
Processor	AMD Ryzen 7 9700X
Motherboard	ASRock X670E Pro Rs
Cooling	Deepcool AK620 // a bunch of 120 mm Xigmatek 1500 RPM fans (2 ins, 3 outs)
Memory	2x16GB Kingston 6400MT CL32
Video Card(s)	Gigabyte AORUS Radeon RX 580 8 GB
Storage	SHFS37A240G / DT01ACA200 / ST10000VN0008 / ST8000VN004 / SA400S37960G / SNV21000G / NM620 2TB
Display(s)	LG 22MP55 IPS Display
Case	NZXT Source 210
Audio Device(s)	Logitech G430 Headset
Power Supply	XPG Core Reactor 750 W
Software	Whatever build of Windows 11 is being served in Canary channel at the time.

System Name	192.168.1.1~192.168.1.100
Processor	AMD Ryzen5 5600G.
Motherboard	Gigabyte B550m DS3H.
Cooling	AMD Wraith Stealth.
Memory	16GB Crucial DDR4.
Video Card(s)	Gigabyte GTX 1080 OC (Underclocked, underpowered).
Storage	Samsung 980 NVME 500GB && Assortment of SSDs.
Display(s)	ViewSonic VA2406-MH 75Hz
Case	Bitfenix Nova Midi
Audio Device(s)	On-Board.
Power Supply	SeaSonic CORE GM-650.
Mouse	Logitech G300s
Keyboard	Kingston HyperX Alloy FPS.
VR HMD	A pair of OP spectacles.
Software	Ubuntu 24.04 LTS.
Benchmark Scores	Me no know English. What bench mean? Bench like one sit on?

System Name	PCGOD
Processor	AMD FX 8350@ 5.0GHz
Motherboard	Asus TUF 990FX Sabertooth R2 2901 Bios
Cooling	Scythe Ashura, 2×BitFenix 230mm Spectre Pro LED (Blue,Green), 2x BitFenix 140mm Spectre Pro LED
Memory	16 GB Gskill Ripjaws X 2133 (2400 OC, 10-10-12-20-20, 1T, 1.65V)
Video Card(s)	AMD Radeon 290 Sapphire Vapor-X
Storage	Samsung 840 Pro 256GB, WD Velociraptor 1TB
Display(s)	NEC Multisync LCD 1700V (Display Port Adapter)
Case	AeroCool Xpredator Evil Blue Edition
Audio Device(s)	Creative Labs Sound Blaster ZxR
Power Supply	Seasonic 1250 XM2 Series (XP3)
Mouse	Roccat Kone XTD
Keyboard	Roccat Ryos MK Pro
Software	Windows 7 Pro 64

System Name	The Final Straw - Part II
Processor	Ryzen 5 4500
Motherboard	ASRock B550 Phantom Gaming 4/ac
Cooling	Arctic Liquid Freezer II 120
Memory	G.Skill 16GB DDR4 3200 F4-3200C16D-16GVKB
Video Card(s)	EVGA GTX 1660 Super SC Ultra 6GB GDDR6
Storage	Crucial 1TB P3 NVMe + WD 2TB Black SN850X NVME + PNY 2TB CS900 SATA
Display(s)	Acer VG270U P 2k
Case	Fractal Design Meshify C
Audio Device(s)	HDMI / SPDIF
Power Supply	ASRock Challenger CL-850G
Mouse	Logitech M510
Keyboard	Logitech K270
VR HMD	Why?
Software	Windows 10 / 11
Benchmark Scores	Currently N/A

System Name	System V
Processor	AMD Ryzen 7 9700X
Motherboard	ASRock X670E Pro Rs
Cooling	Deepcool AK620 // a bunch of 120 mm Xigmatek 1500 RPM fans (2 ins, 3 outs)
Memory	2x16GB Kingston 6400MT CL32
Video Card(s)	Gigabyte AORUS Radeon RX 580 8 GB
Storage	SHFS37A240G / DT01ACA200 / ST10000VN0008 / ST8000VN004 / SA400S37960G / SNV21000G / NM620 2TB
Display(s)	LG 22MP55 IPS Display
Case	NZXT Source 210
Audio Device(s)	Logitech G430 Headset
Power Supply	XPG Core Reactor 750 W
Software	Whatever build of Windows 11 is being served in Canary channel at the time.

Microsoft Office Tools Reportedly Collect Data for AI Training, Requiring Manual Opt-Out

AleksandarK

News Editor

windwhirl

Shihab

TumbleGeorge

_roman_

eidairaman1

The Exiled Airman

Paganstomp

windwhirl

SSGBryan

Vayra86

windwhirl

Kapone33

Vayra86

Steevo

Dr. Dro

Shihab

chrcoluk

eidairaman1

The Exiled Airman

windwhirl

eidairaman1

The Exiled Airman

windwhirl

Alan Smithee

Wirko

chrcoluk

Tartaros

System Name	Tiny the White Yeti
Processor	7800X3D
Motherboard	MSI MAG Mortar b650m wifi
Cooling	CPU: Thermalright Peerless Assassin / Case: Phanteks T30-120 x3
Memory	32GB Corsair Vengeance 30CL6000
Video Card(s)	ASRock RX7900XT Phantom Gaming
Storage	Lexar NM790 4TB + Samsung 850 EVO 1TB + Samsung 980 1TB + Crucial BX100 250GB
Display(s)	Gigabyte G34QWC (3440x1440)
Case	Lian Li A3 mATX White
Audio Device(s)	Harman Kardon AVR137 + 2.1
Power Supply	EVGA Supernova G2 750W
Mouse	Steelseries Aerox 5
Keyboard	Lenovo Thinkpad Trackpoint II
VR HMD	HD 420 - Green Edition ;)
Software	W11 IoT Enterprise LTSC
Benchmark Scores	Over 9000

System Name	Best AMD Computer
Processor	AMD 7900X3D
Motherboard	Asus X670E E Strix
Cooling	In Win SR36
Memory	GSKILL DDR5 32GB 5200 30
Video Card(s)	Sapphire Pulse 7900XT (Watercooled)
Storage	Corsair MP 700, Seagate 530 2Tb, Adata SX8200 2TBx2, Kingston 2 TBx2, Micron 8 TB, WD AN 1500
Display(s)	GIGABYTE FV43U
Case	Corsair 7000D Airflow
Audio Device(s)	Corsair Void Pro, Logitch Z523 5.1
Power Supply	Deepcool 1000M
Mouse	Logitech g7 gaming mouse
Keyboard	Logitech G510
Software	Windows 11 Pro 64 Steam. GOG, Uplay, Origin
Benchmark Scores	Firestrike: 46183 Time Spy: 25121

System Name	Compy 386
Processor	7800X3D
Motherboard	Asus
Cooling	Air for now.....
Memory	64 GB DDR5 6400Mhz
Video Card(s)	7900XTX 310 Merc
Storage	Samsung 990 2TB, 2 SP 2TB SSDs, 24TB Enterprise drives
Display(s)	55" Samsung 4K HDR
Audio Device(s)	ATI HDMI
Mouse	Logitech MX518
Keyboard	Razer
Software	A lot.
Benchmark Scores	Its fast. Enough.

Processor	13th Gen Intel Core i9-13900KS
Motherboard	ASUS ROG Maximus Z790 Apex Encore
Cooling	Pichau Lunara ARGB 360 + Honeywell PTM7950
Memory	32 GB G.Skill Trident Z5 RGB @ 7600 MT/s
Video Card(s)	Palit GameRock GeForce RTX 5090 32 GB
Storage	500 GB WD Black SN750 + 4x 300 GB WD VelociRaptor WD3000HLFS HDDs
Display(s)	55-inch LG G3 OLED
Case	Cooler Master MasterFrame 700 benchtable
Power Supply	EVGA 1300 G2 1.3kW 80+ Gold
Mouse	Microsoft Classic IntelliMouse
Keyboard	IBM Model M type 1391405
Software	Windows 10 Pro 22H2

System Name	Main PC
Processor	13700k
Motherboard	Asrock Z690 Steel Legend D4 - Bios 13.02
Cooling	Noctua NH-D15S
Memory	32 Gig 3200CL14
Video Card(s)	4080 RTX SUPER FE 16G
Storage	1TB 980 PRO, 2TB SN850X, 2TB DC P4600, 1TB 860 EVO, 2x 3TB WD Red, 2x 4TB WD Red
Display(s)	LG 27GL850
Case	Fractal Define R4
Audio Device(s)	Soundblaster AE-9
Power Supply	Antec HCG 750 Gold
Software	Windows 10 21H2 LTSC

Processor	i5-6600K
Motherboard	Asus Z170A
Cooling	some cheap Cooler Master Hyper 103 or similar
Memory	16GB DDR4-2400
Video Card(s)	IGP
Storage	Samsung 850 EVO 250GB
Display(s)	2x Oldell 24" 1920x1200
Case	Bitfenix Nova white windowless non-mesh
Audio Device(s)	E-mu 1212m PCI
Power Supply	Seasonic G-360
Mouse	Logitech Marble trackball, never had a mouse
Keyboard	Key Tronic KT2000, no Win key because 1994
Software	Oldwin

System Name	Rectangulote 2
Processor	Ryzen 7 9800X3D
Motherboard	Asus TUF Gaming B650-PLUS
Cooling	Alphacool Eisbaer Pro Aurora 360 + 240 ST30
Memory	64 GB DDR5 6000mhz Corsair Vengeance
Video Card(s)	Asus TUF Gaming RTX 4090 OC
Storage	3 x WD Black SN-850X 1TB
Display(s)	2 x Asus ROG Swift PG278QR / LG C4
Case	Corsair 5000D Airflow
Audio Device(s)	Evga Nu Audio + Beyerdynamic DT 150 + Trust GTX 258
Power Supply	Corsair RMX1000
Mouse	Razer Naga Wireless Pro
Keyboard	Keychron K4
Software	Windows 11 Pro