More

itsibitzi · 2025-09-15T11:23:04 1757935384

What tool or ecosystem does this well, in your opinion?

taminka · 2025-09-15T12:57:34 1757941054

any language that has a standardised build system (virtually every language nowadays?), but doesn't have a centralised package repository, such that including a dependency is seamless, but takes a bit of time and intent

i like how zig does this, and the creator of odin has a whole talk where he basically uses the same arguments as my original comment to reason why odin doesn't have a package manager

zoobab · 2025-09-15T14:14:26 1757945666

"a standardised build system (virtually every language nowadays?)"

Python packages still manage poorly dependencies that are in another lang like C or C++.

taminka · 2025-09-16T13:53:01 1758030781

that's two different languages, they don't have have a standardised build system across them

itsibitzi · 2025-06-09T21:07:47 1749503267

SecureDrop is great and we still will be using it at the Guardian for the foreseeable future. At the very least just to support sources who want to blow the whistle but don't use our app.

In terms of how it's different. We attain anonymity without requiring a user to install Tor Browser, which we think is significant. Building this feature into our news app lowers the barrier of entry for non-technical sources quite significantly, and we think helps them achieve good OPSEC basically by default.

CoverDrop (aka Secure Messaging) has a few limitations right now that we'll be working to overcome in the next few months. Primarily that we don't support document upload due to the fact that our protocol only sends a few KB per day. Right now a journalist has the option to pivot the user onto another platform e.g. Signal. This is already better since the journalist can assess the quality of, and risks posted to, the source before giving their Signal number.

The current plan to improve this within the CoverDrop system is to allow a journalist to assess the risk posted to a source and, if they deem it acceptable, send them a invite link to upload documents which the client will encrypt with their keys before sending. This affects anonymity of course so we'll be investigating ways in which we can do this while doing our best to keep the source anonymous. There are a few techniques we could use here, for example making the document drop look like an encrypted email attachment being sent to a GMail account. I like this[1] paper as an example of an approach we could take that is censorship resistant.

Another limitation is that the anonymity of our system is largely predicated on the large install base of our app. In the UK/US/AU we have a pretty large install base so the anonymity properties provided by the protocol are nice, but if another smaller news agency were to pick up our tech as it stands right now then they wouldn't have this property. That said, in practice just having our plausibly deniable storage approach is a pretty big improvement over other whistleblowing approaches (PGP, Tor based, etc), even if you're the only person in the set of possible sources using the app.

[1] https://petsymposium.org/popets/2022/popets-2022-0068.pdf

itsibitzi · 2025-06-09T15:23:15 1749482595

Tech lead on the project here.

I would certainly recommend that readers not use a work phone, not only for the reasons you've stated but also that a lot of work devices use mobile device management software which is functionally spyware. To your point, dealing with having a very small anonymity set is tricky regardless of the technology used.

We do go to great lengths to make usage of the app to blow the whistle plausibly deniable. Data is segmented to "public" and "secret" repositories, where any secret data is stored within a fixed-sized/encrypted vault protected by a KDF technique that was developed by one of the team in Cambridge (https://eprint.iacr.org/2023/1792.pdf)

But of course, all this could be for nothing if you've just got corporate spyware on your device.

This is certainly something we've talked about internally but I've double checked the in-app FAQs and I think we could be more clear about recommending users not use on a work device, especially with MDM. We'll get that updated as soon as possible. Thanks!

-- edit

I should add that we do some basic detection devices that have been rooted or are in debug mode and issue a warning to the user before they continue. I'd be interested in what we can do to detect MDM software but I fear it might become a cat-and-mouse game so it's preferable that folks not use their work devices at all.

mdhb · 2025-06-09T15:31:02 1749483062

Yeah besides that bit of feedback, I think the project is brilliant and actually has a lot of nice parts to it that go way beyond the technical aspects but really show a sophisticated understanding of what you actually want out of a real life somebody might end up seriously harmed if this goes wrong covert communications system so kudos to you and the team on that!

Edit: you might want to consider putting that warning about work devices in the app itself right before someone pushes forward with making potentially life changing decisions and doesn’t rely on them reading an FAQ. I see you already have an onboarding flow in place. It would be really simple to make that the first screen of it.

itsibitzi · 2025-06-09T16:05:45 1749485145

I agree, it should certainly be front-and-centre, either the landing page or the on-boarding carousel.

I'll see if we can get something together before the next app release. Thanks again!

mdhb · 2025-06-09T16:16:05 1749485765

Happy hunting! Hope you’re able to deliver some really nice scoops safely with this in the future. It was actually really refreshing to see a news organisation take this seriously beyond just “here’s my signal”

irq-1 · 2025-06-09T20:20:10 1749500410

Any plans for spam? Does the app have a device id or account, so you can disable them?

If the message is encrypted for the reporter and they're the only ones who can read it, what does the organization do to manage this? Are passwords for private keys saved with the org, or are the keys saved with multiple accounts? What do you do when someone forgets their password?

Cool app; just encryption management when it comes to human users must have lots of trade-offs.

itsibitzi · 2025-06-09T20:40:21 1749501621

On spam:

We’ve got some basic filtering for full on DoS type attacks already.

The difficulty here is that a user can produce a reasonable amount of spam from a spread of IP addresses which would be disruptive to our journalist users but below threshold to be considered a DoS attack.

It’s tricky because we can’t have anything that could link a given message to a given user as that would break anonymity.

We’ve got some ideas with anonymous credentials from app attentions for the more long term. E.g. if you’re expected to submit 1 message an hour from your queue you can request 24 single use tokens from the API by performing an attestation that you’re running a genuine app. You then spend these as you send messages. We don’t have a full spec for this right now such that it can be fully anonymous but that’s the general idea.

There’s also some possible spam detection we can do in the journalist GUI which we’re interested in exploring. Right now the spam control is quite basic (muting) but the message rate is low due to the threshold mixer anyways so not so bad.

On key management:

Each journalist has an encrypted vault which requires a key derived from a password. If this password is lost and the journalist has no backup then it’s game over. We need to regenerate their identity in the key hierarchy as if they were a new user and messages they’ve not seen are lost, there is no way to pick up those sources again.

We have some plans on using MLS as an inter-journalist protocol which should enable having multiple actual humans per journalist/desk listed in the app. That would depend on the journalists agreeing to have their vault be shared of course. Once multiple humans are backing a single vault then the risk of password loss becomes smaller as if one journalist loses their password the other journalist should be able to share their back messages to them.

itsibitzi · 2025-06-09T15:01:40 1749481300

Project lead here, the Secure Messaging part of the app is accessible without a subscription through the main menu.

k310 · 2025-06-09T15:54:03 1749484443

THANKS! The app store always makes it difficult or impossible to discern what features are available via IAP/subscription rather than the free version.

Your feedback is much appreciated.

itsibitzi · 2025-05-29T12:28:47 1748521727

Out of interest, how did you stream the video data to the frontend?

platevoltage · 2025-05-29T17:58:29 1748541509

I used MediaStream, which is part of the standard Web API

https://developer.mozilla.org/en-US/docs/Web/API/MediaStream

itsibitzi · 2025-03-10T11:24:43 1741605883

The elite Silicon Valley venture capitalist class has made it pretty clear which side they're on.

nonrandomstring · 2025-03-10T12:09:57 1741608597

They pick their sides for profitable expedience. I expect they would betray those sides just as fast for profitable expedience. US, EU, it's all one big world, and business never prospers in the long-run under the heel of a boot.

chillingeffect · 2025-03-10T12:10:05 1741608605

Is it a monoculture? obviously theres Thiel and his clan. But are there other powerful elements?

itsibitzi · on Jan 15, 2025

As someone who works in the news industry I find it pretty sad that we've just capitulated to big tech on this one. There are countless examples of AI summaries getting things catastrophically wrong, but I guess Google has long since decided that pushing AI was more important than accurate or relevant results, as can also be seen with their search results that simply omit parts of your query.

I can only hope this data is being incorporated in some way that makes hallucinations less likely.

nerdjon · on Jan 15, 2025

Unfortunately this has just been the reality over the last couple years. People just ignore the hallucination problem (or try to say it isn't a big deal). And yet we have seen time and time again examples of these models being given something, told to summarize it, and still hallucinate important details. So you can't even make the argument that its data is flawed or something.

These models will interject information from their training whether or it is relevant or not. This is just due to the nature of how these models work.

Anyone trying to argue that it doesn't happen that often or anything is missing the key problem. Sure it may be right most of the time, but all that does is build a false sense of security and eventually you stop double checking or clicking through to a source. Whether it is a search result, manipulating data, or whatever.

This is made infinitely worse when these summaries are one and done, a single user is going to see the output and no one else will see it to fact check. It isn't like an article being wrong that everyone reading it is reading the same article, can then comment that something is wrong, it get updated, and so on and so forth. That feedback loop is non-existent with these models

umvi · on Jan 15, 2025

> Anyone trying to argue that it doesn't happen that often or anything is missing the key problem. Sure it may be right most of the time, but all that does is build a false sense of security and eventually you stop double checking or clicking through to a source. Whether it is a search result, manipulating data, or whatever.

Same problem existed before AI summaries.

"Briefly stated, the Gell-Mann Amnesia effect is as follows. You open the newspaper to an article on some subject you know well. In Murray's case, physics. In mine, show business. You read the article and see the journalist has absolutely no understanding of either the facts or the issues. Often, the article is so wrong it actually presents the story backward—reversing cause and effect. I call these the "wet streets cause rain" stories. Paper's full of them.

In any case, you read with exasperation or amusement the multiple errors in a story, and then turn the page to national or international affairs, and read as if the rest of the newspaper was somehow more accurate about Palestine than the baloney you just read. You turn the page, and forget what you know."

– Michael Crichton (1942-2008)

extr · on Jan 15, 2025

Hallucinations are not a big problem with SOTA models at this point, especially grounded against an actual news article.

dismalaf · on Jan 15, 2025

> I can only hope this data is being incorporated in some way that makes hallucinations less likely.

The key word is "real-time". LLMs can't be trained in realtime, so it's obviously going to call an API that pulls up and reads from AP news, just like their search engine.

notatoad · on Jan 15, 2025

i don't think you can assume that - "real time" in this context could just mean they feed every article into their training system as soon as it's published.

iamjackg · on Jan 15, 2025

That seems more unlikely to me -- training is not free and takes a long time, so it would not result in "[enhancing] the usefulness of results displayed in the Gemini app" and it being "particularly helpful to our users looking for up-to-date information."

Fine-tuning, which is cheaper and faster, has been proven to not be a good solution to "teach" models new facts.

I think what's most likely here is that Gemini will have access to a form of RAG based on a database of AP articles that gets updated in real-time as new articles are published.

CuriouslyC · on Jan 15, 2025

They can't deploy that fast and people want to pin model version so it's not feasible anyhow.

summerlight · on Jan 15, 2025

If there's any company who can afford "real-time LLM training" at this moment, I'm 100% sure they will win this AI race since they probably have at least ~10x compute compared to competitors. Of course, no one can do that right now.

dismalaf · on Jan 15, 2025

Have you ever asked an LLM what time it is? It takes months to train them...

tomrod · on Jan 15, 2025

But it can be trained to access basic, limited APIs to get current information.

dismalaf · on Jan 15, 2025

Yes, which is literally what I suggested it does in my original comment.

tomrod · on Jan 15, 2025

<3 thanks for calling out what I missed. I didn't realize you were supporting an earlier comment on the chain.

scarface_74 · on Jan 15, 2025

The examples that have made news were with iOS. iOS doesn’t really do a summary of the content. It just tries to do a summary of the headline.

The on device model that it uses is also literally 1% the size of the large models like Gemini

paxys · on Jan 15, 2025

The news industry capitulated to big tech the moment it got reliant on big tech for the majority of its revenue. The entire media landscape today is the direct result of that.

asdff · on Jan 15, 2025

Take it a step back further, and you will see that the media landscape capitulated to Big Anything a long time ago. For probably generations now, if we consider people like william randolf hearst and other newspaper men.

micromacrofoot · on Jan 15, 2025

It's responsibility laundering — AI can say whatever they want and they can shrug it off by saying bots are sometimes unreliable

asdff · on Jan 15, 2025

Uhh, has your head been in the sand? Look at the average output of your industry without ai. It gets things wrong. It misleads. It hallucinates. It has incentives that fundamentally differ from what the readership seeks in news. The fact that your industry took so readily to the technology to output ever more garbage says it all about the state of the industry vs any condemnation of the fundamental technology.

onlyrealcuzzo · on Jan 15, 2025

Gemini is the leading model with the lowest hallucination rate: https://www.visualcapitalist.com/ranked-ai-models-with-the-l...

I would expect that number to go down from 1.3% to below 1% over the course of the year.

There's always a chance what you're reading is wrong - due to purposeful deception, negligence, or accident.

Realistically, hardly anything is 100% accurate besides math.

itsibitzi · on Jan 15, 2025

I think people really don't understand the effort, care and risk that goes into producing quality reporting.

I work with investigative reporters on stories that take many months to produce. Every time we receive a leak there is an extensive process of proving public interest before we can even start looking at the material. Once we can see it in we have to be extremely careful with everything we note down to make sure that our work isn't seen as prejudiced if legal discovery happens. We're constantly going back and forth with our editorial legal team to make sure what we're saying is fair and accurate. And in the end, the people we're reporting are given a chance to refute any of the facts we're about to present. Any mistakes can result in legal action that can ruin the lives of reporters and shut down companies.

Now, imagine I were to go to a reporter who has spent 6 months working on a story about, for example, a high profile celebrity sexually assaulted multiple women, how the royal family hides their wealth and are exempt from laws, or how multinational corporations use legal loopholes to avoid paying taxes, and said, "oh, 1% of people reading this will likely be given some totally made up details".

Given that stories often have more than a million impressions, this would lead tens of thousands of people with potentially libellous "hallucinations".

It simply should not be allowed.

LLMs have their place, for sure, but presenting the news is not it.

tokioyoyo · on Jan 15, 2025

Although I agree with every single sentence you've said, we've seen in the past decade how only very small percentage of people actually care about the content of the news. Everyone just discusses and gets their information from the headlines, so this is a natural consequence of "let's just summarize it to a couple of sentences since nobody reads it anyways".

simonw · on Jan 15, 2025

The Gemini models themselves may score well on this, but Google's feature implementations are a whole other thing. AI Overviews frequently take untrustworthy search results (like a fan fiction plot outline for Encanto 2) and turn those into confidently incorrect answers. https://simonwillison.net/2024/Dec/29/encanto-2/

kccqzy · on Jan 15, 2025

And doesn't bringing in The Associated Press solve this problem? No need for the AI to decide what is trustworthy or not. For the vast majority of people everything The Associated Press publishes is trustworthy.

stusmall · on Jan 15, 2025

1.3% isn't great. I'd rather just go, and pay, directly to trusted news sources. Everyone has different tolerance for falsehoods and priorities I guess.

jonas21 · on Jan 15, 2025

What's the error rate for human journalists? Based on my experience, I'd guess it's much higher than 1.3%.

stusmall · on Jan 15, 2025

As others have already pointed out, feeding these new articles aren't magically going to make them any more accurate. These hallucinations are going to be on top of any errors in the data sources.

I'm not replying to point that out, I think others have done a better job. It's mostly that this conversation made me think of this classic Babbage quote that I've always enjoyed.

"On two occasions I have been asked, – 'Pray, Mr. Babbage, if you put into the machine wrong figures, will the right answers come out?' ... I am not able rightly to apprehend the kind of confusion of ideas that could provoke such a question"

marsavar · on Jan 15, 2025

Except when that happens, a clarification is almost always added at the bottom of the article ("This article was amended on [date]. An earlier version said xxx" or some variation thereof). You're not gonna get a second push notification from an AI summary saying "Oopsies, the previous notification was wrong". Once it's out, it's out, and that sort of damage is difficult to repair.

onlyrealcuzzo · on Jan 15, 2025

Yes, but that's going to be on top of the ~1.3% hallucination rate (largely, there's always some very small chance it hallucinates the truth when the article had it wrong - but basically not worth considering).

marsavar · on Jan 15, 2025

Anything other than 0% is borderline immoral. Imagine sending a push notification to somebody's phone with a completely made-up headline summary. Even if it happens once in a hundred times, that's too much. Things like that slowly but surely erode trust and make it harder and harder to trust anything that's generated by AI, especially when it comes to news, where trustworthiness is essential, and probably the main reason people pay for news. See for example https://www.bbc.co.uk/news/articles/cge93de21n0o

anomaly_ · on Jan 15, 2025

This is a ridiculous standard. News headlines at the moment would have an error rate wildly above 1.3%. The articles about Apple having trouble with LLM headlines is that the on-device model is weak and it's trying to compress too much into too few characters. I'd guess the chance of Gemini incorrectly summarising an article to be almost 0%.

scarface_74 · on Jan 15, 2025

Have you ever read a news article on a subject where you have expertise and knew it was inaccurate? The news is probably more inaccurate than you think.

I bet you think the news is accurate all other times. It’s called “Gell-Mann Amnesia”

baq · on Jan 15, 2025

You’d have to pay quite a bit to get journalists to answer your questions specifically.

The whole isn’t about generating news articles, it’s about getting the model up to date on facts so it can synthesize a newspaper for you. I’d say it’s a way to get journalists to be journalists again instead of clickbait composers - as long as the model doesn’t inject clickbait there itself. I don’t trust Google to not do it sometime, but they aren’t doing it now and the infrastructure is being made for others to consume when Gemini suffers from inevitable enshittification.

stusmall · on Jan 15, 2025

> You’d have to pay quite a bit to get journalists to answer your questions specifically.

This isn't what I meant. I pay directly for subscriptions/donations to news organizations that employee journalists that do this original reporting. I don't want a middle man that just messes it up. This goes for LLMs and for free news sites that don't do much more than summarize original reporting. I've seen more than a few times where they inject opinions, mess up facts or put focus on what was originally a small side point in the article.

peanutz454 · on Jan 15, 2025

> There's always a chance what you're reading is wrong - due to purposeful deception, negligence, or accident.

I am quite certain my personal hallucinations level is more than 1.3%, obviously we want our machines to be better than us, but my doctor once said folic acid is not a vitamin.

sandspar · on Jan 15, 2025

Journalism is dying, killed by its own excesses and by the internet. Google is offering it life support. The other option is death.

contagiousflow · on Jan 15, 2025

And what are we going to feed into the models without the journalism?

asdff · on Jan 15, 2025

Abrogate the direct primary inputs. Weather sensors. Wildfire cameras. Police scanners. Court proceedings. Changes to ordinances. New LLC filings. Bankrupcies. Birth records. Death records. The whole corpus of society that is automatically logged and used as the primary data for people to then perform research or develop journalism upon.

That is what you siphon up. And in output you can mad lib out an article just like those johnny on the spot AP reporters do anyhow, filling in the skeleton article about a death or an attack or a banquet or award show with the relevant input concerning the event. LLM isn't even used for finding this input but to just adjust the boilerplate, perhaps to tailor news specifically to the reader's own inclinations based on engagement with other articles collected via fingerprinting.

contagiousflow · on Jan 16, 2025

I recommend you find some journalists who's work you find impressive and ask yourself what types of passive inputs could have written pieces like that.

asdff · on Jan 19, 2025

Those represent what 1% of the bulk output of the field if I had to guess? By volume most news is wire service a la terse AP reports that get reposted everywhere. And they have to be terse because it's breaking news and there is no time to opine beyond reporting the inputs mainly as they are in shortform.

baq · on Jan 15, 2025

Exactly the right question. Journalism has been given a lifeline, a way out of the attention economy.

great_tankard · on Jan 15, 2025

You might want to look into why "journalism is dying" and whether Google (and Facebook) had anything to do with it.

Mainsail · on Jan 15, 2025

Theoretical question. What is replacing it? Is it this? Is it something else? Nothing? Curious to peoples thoughts on this.

mattlondon · on Jan 15, 2025

Potentially social media to a certain degree, at least for raw "news" of what is happening.

Of course that willm be shit, but there we are.

airstrike · on Jan 15, 2025

I'd wager 95% of what we call journalism today could simply disappear with no replacement and the world would be better off.

spankalee · on Jan 15, 2025

"killed by its own excesses"?

sandspar · on Jan 16, 2025

Doubling and tripling and quadrupling down on behaviors that most consumers wish they'd stop doing. You used to work at Google so you must be familiar with how groupthink operates.

itsibitzi · on Dec 4, 2024

I think we can be pretty confident that he wasn't shot because an AI product wasn't accurate.

itsibitzi · on Oct 14, 2024

I would agree and expand on this and say such hyper-luddites tend to make picking up new technology a self-fulfilling bad idea. Even if you can present a fantastic business case for something they don't want to learn new things so if you do introduce it they will refuse to learn and progress will suffer as a result.

itsibitzi · on Jan 4, 2024

Disclosure: I'm an engineer working at the Guardian on investigations tools.

In your opinion, what can newspapers and their technology teams do to improve the process of whistleblowing. Any perspective is really interesting!

zthrowaway79875 · on Jan 4, 2024

I appreciate the work you are doing although there is nothing in my experience that I think can assist you. I went through legal channels and it is starting to have the desired effect. The most important thing you can do as a newspaper is be a newspaper. That way I can give my story and shame them publicly as a last resort if needed. Your work will be invaluable for others I have no doubt!

loceng · on Jan 4, 2024

I guess Julian Assange did with Wikileaks was the correct technology and process - and why the establishment has scared everyone away from replicating what he did by his most poor treatment.

So, do what Assange did.

cdibona · on Jan 4, 2024

Assange worked with Rusbridger, Guardian's editor in chief at the time, I think. So .... the guardian should go to the guardian and get their help! :-)

lcnPylGDnU4H9OF · on Jan 4, 2024

It's worth noting that The Guardian's response to Assange's persecution left a lot to be desired[0-2] from a whistle-blowing perspective, especially against a powerful organization. That's not to say one should avoid a journalist publication for telling their story, just that it can backfire in its own way if (presumably) that publication starts to get heat from law man.

0: https://www.jonathan-cook.net/blog/2016-02-05/no-fair-hearin...

1: https://www.jonathan-cook.net/blog/2018-11-28/guardian-vilif...

2: https://www.jonathan-cook.net/blog/2020-09-26/guardian-assan...

loceng · on Jan 4, 2024

So it's a human courage problem, then.