A True History of the Internet

Conservation Laws

2026-05-20T07:22:04.003-07:00

As we see the Giant Killer Asteroid of AI plummet towards the Home of the Human Race, it becomes clear that the laws of thermodynamics were incomplete.

As well as the conservation of energy, momentum, mass and spin (oh, ok, angular momentum), we must now add the conservation of Intelligence.

This is made obvious by the ease with which obviously stupid AIs are able to fool humans in the imitation game. The AI did not have to be as smart as the person, it only had to meet them half way.

This works both at the indivual level, and at the group level, and at the academic level (people can use an AI to write a terrible paper that human reviewers will accept) and the investor level (venture capitalists fall over each other to fund companies whose products will lead to the ends of the earth).

This is the existential threat to humanity. Not that the AI will wish to eliminate us as competition - it won't need to, because once it gets anywhere near that level, we will all dimmer than a Toc H Lamp

It is already too late to put an end to this.

the internet family's least famous kin

2026-05-08T05:21:00.000-07:00

People talk about the "Father of the Internet" or the "Grandfather of the Internet", but no-one mentions the strange uncle of the Internet, or the fairy godmother of the Internet. Nor do they speak (except perhaps on the dark web) of the evil twin of the Internet, or the secret love child of the Internet.

Where are the kissing cousins of the Internet, or the mother-in-law of the Internet? What happened to the wastrel son of the Internet? Indeed, the whole bastard offspring of the Internet have been totally erased from public discourse.

The maiden aunts of the Internet should be given credit, as well as the confirmed bachelor great uncle.

And of course, the Giddy Aunt of the Internet.

fencing the non rivalrous data commons, and the sovereign internet that never really was

2026-04-26T01:48:00.000-07:00

Lovely idea, the declarations of independence of cyberspace notwithstanding, the vision of a new politics is not standing up well to the greatfirewalls and random total isolation of states (yes, old fashioned states, as defined by geographic boundaries, including space and maritime, like china, iran, north korea, but now lots of little old places in Europe)....it isn't that hard to find all the ingress/egress points (including satellite updownlinks etc) and plug them after all...

Meanwhile, the data commons that was the World Wide Web is being rivalled to death by hyperscalers, starting way back with google (search and click through) but now with AI.

Economicists love talk about data as non-rivalrous, because of their naive model of zero copy cost of bits, so anyone can "take" a copy, but the source is also left behind so there's no "loss" of value to the source.

Howver, attention (via meta data, provenance, attribution, even accounting/payment)....is very rivalrous - i only have two eyeballs, and one brain and so many hours each day, so it matters which copy I look at.

When the hyperscalers "add value" to the data, they subtract it from the originators. Worse, by trying to multiply the value (by combining), without recompense to the content creators, they undermine the motives of people to contribute any content. So the consequence is the death of Metcalfe's law (the value of the network is the square of the number of nodes -or at least is super-linear in the number of nodes, as any node that is added is both a consumer and a producer so the adding one more node to n, adds n+1 to the net value, not just 1). This means the hyperscalers' long term business plan is doubly dead. Their value cannot actually multiple once they trained on the common crawl - it will increase (at best) sub-linearly. In fact, as it de-motivates people from contributing any new content, the hyperscalers value will fall. We don't want to be data serfs to the AI overlords.

So AI wasn't an existential threat to humans, but it is an existential threat to knowledge.

Computer Science Fiction

2026-04-22T01:55:00.000-07:00

I was randomly thinking about stories that had no actual computing technology dimension, but illustrated a fundamental idea from Computer Science in some accessible way - so here's a few examples to be getting on with...

naming

bootstrapping

indirection

some of my random acts of digital and literary london history

2026-03-14T05:46:00.000-07:00

Bits:-

I was recently trawling through some bits of my past and recall that I was taken to program a DEC PDP-8 (in an outbuilding of the Royal Free hospital) in the 1960s by an enlightened maths teacher - we had to learn machine code (actual machine code, because we had to programme the machine on paper tape and so we learned the binary for the instructions....

A bit later in the 1970s i did some work for my cousin at the London hospital on a database he was building for a surgeon there who did knee&hip replacements and wanted to do some stats on how long each prosthetic type lasted...this was programmed in Algol-68 and involved punch cards and use of a remote ICL 2980 computer down the mile end road at Queen Mary College.

Not long after that, I got a real job at North London Polytechnic (up on Holloway Rd) in the math&CS department, where we ran a DEC-10 and I got to do some Fortran and Cobol (as well as Algol-60) but with the glory of a real glass tty (terminal) and screen editor (SOS)....also used to go to their teacher training outfit on Prince-of-Wales road to setup a modem link for people learning a bit about computers there to use the DEC -10 remotely...

Moving to the "modern" era, in 1981 I was writing C Code on a PDP-11 (using the DED screen/picture editor) and cross loading code to a DEC LSI 11/23...at UCL down on Gower St...

Words:-

Through the same 3 decades, my family (based mainly in Parliament Hill Fields, then later in Camden town) had a sequence of literary/political friends, so as a kid I was playing in Stella Gibbons house (with her grandson), or else with Benji Webb on Highgate hill - at some point we had a holiday with ponies and caravans near their grandmother's (the famous Beatrice Webb of the Fabians etc). Then I recall my father asking me if I knew who Ivor Cutler was, as my dad went drinking with him in local pubs in Mornington Crescent. We also knew Beryl Bainbridge (we were at this point living in Arlington Road and she was round the corner in Albert Street surrounded by eccentrics and a large cloud of cigarette smoke.

from cypberspace to necropolis

2026-02-07T00:26:00.000-08:00

The Internet was dismantled as a space for humanity, first by the loss of community binding through person-to-person trust and group dynamics -

blame blockchain/cryptocurrencies, but they are just one symptom of the alienation, that built environments like cities, then suburbs, then oneline social media and the web finally perfected.

As with crypto-currencies, so with AI, but this was foreseen by Mbemebe in the analysis in the great book on Necropolitics

I was the Son of Sam, I contain multitudes, Now I am become Death, the destroyer of worlds.

As with commons, land was enclosed, by Barons, by Kings, by Nation States - to keep people outbut camps and campfirewalls to keep people and ideas in and so with trade agreements, and the flow of information and knowledge, and so with the Interweb.

The Zombie Apocalypse is here and it is AI Voodoo, just as Neuromancer pressaged. And the zombies are the camp police. And we are interned. stoned. maculate, and soon not even dead.

p.s. a little machine learning is a dangerous thing...

fabless industries aren't exactly new

2026-01-27T01:41:00.000-08:00

I'm reading Apple in China (see), which is interesting at the detail level (basically how they trained up with the help of Taiwan literally millions of skilled people to produce all their tech - it is now hardly surprising that china doesn't need the US's help any more to make new stuff...)

But the author seems to think that outsourcing tech manufacturing was something terribly new and clever.

Don't get me started on clothing and the east india company and the british empire...

Closer to home for the USA, however, is a much more instructive story - the electric guitar:

(Also lots of other instruments- remember Yamaha flutes were jolly good an way cheaper than Gemeinhardt)

but gibeon and fender both started manufacturing in mexico and japan then later in indonesia - sometimes trying to "brand" things different(ly) so western prejudice about "lower build quality" from those foreigners was offset by calling things Epiphone or Squier (actually its a bit more complex than that, but you get the idea)...

But in reality the outsorced products weren't just a whole lot cheaper. I own a japanese made custom shop fender strat and a 1982 squier tele (possibly/probably made in korea) - both are a very very good - better than instruments i've tried at 5-10 times the price (the tele was off a friend but if you look at the time they were £150. the strat was used so hard to know what new cost was but probably £1500.

And nowadays, i'd still buy a squier or epiphone (i had a beautiful 335 for a while) or just bite the bullet and buy an ibanez (actually, just did - £300 - fantastic quality - though i did recently buy a G&L Tribute (fretless) bass which was made in the USA and was very sensibly priced (unde £500).

Interestingly, just reading innovation in real places too which tells a similar story about bike manufacturing moving from US (schwinn et al) to Taiwan (Giant) for exactly the same set of reasons....

Anyhow, what Apple did (hollowing themselves out, bleeding to death) was absolutely nothing new. It wasn't even rock'n'roll.

Mind your Ps and Qs, and the K, M, G, and Ts will look after themselves....

2026-01-18T02:23:00.000-08:00

M I remember back in the early 1980s Sun Microsystems (old unix workstation maker of fine machines) said that the breakthrough was 3Ms - Megabyte of memory, MegaHz processor speed and Megabits of networking (I seem to recall 4Mb, 4MHz and 3Mbps) - prior to this, of course, we'd had 64Kbytes of RAM in PDP11s and 64kbps of network speed and some Khz process clocks).
G A decade later, Craig Partridge published a great book on Gigabit networking, and storage and processor speeds were, indeed, at least heading for Gbytes and GHz.
T It took a bit longer, but it is certainly possible to get Tbps or close (800Mpbs) of networking, and Tbytes of storage (RAM is still a bit pricey for that to be common, but in cluster compute its a thing, but on my laptop, Tbyte SSD is totall affordable). While Moore's law ran out of steam fairly recently, so an individual core might still be GHz, I can have a lot of cores, so total processing throughput (including CPU and GPUs) might easily look like THz.
P It's interesting to speculate what tech will look like for Petabits a second, but the optical fiber photonics folks definitely can do that and there are other transmission technologies which are fast and affordable. In storage, there seems to be no obvious reason - in fact if either of glass or synthetic DNA storage get on the market (and that is not science fiction) then Petabyte storage (at least write once/appeand) could easily be affordable soon too. So then there's processing speed - i think this will probably not happen in a simple way, partly because it probably isn't necessary in personal devices, but I could be wrong - I don't think it would come from neuromorphic hardware because that doesn't have to be fast, it is just massively more efficient than using tensor processors to do operations on neural network graph data structures.
Q - so what about quantum computing? While they don't really work yet, and also are not exactly going to fit in your new rayban specs or wrist-borne device, they might help speed things up (in some possibly rather limited, cases), however...

Many of those hyperscale companies led by personalities are guilty of massive hubris. Because they are hugely successful in one domain, their leaders assume they can repeat that in arbitrary other areas, and I think this has a massively negative effect on anyone else making progress (or even just getting funding) - 4 examples of over-claiming that have happened multiple times in recent years...

Quantum Supremacy - multiple times there have been announcements that embarassingly jumped the gun on people actually making a working quantum computer that actually a) has enough Q bits to be useful and b) has Q bits that don't get overwhealmed by noise or decoherence so fast they are not even a ghost in the machine.
Neural Interfaces - there have been beautiful experiments with bio-feedback systems that are now being used to treat various neurological conditions (and problems with phyisical consequences like Parkinsons) - but having an every day mind-computer interface is still science fiction really. Despite various people jumping the gun.
Metaverse- we've (I myself included) being doing decades of work in immersive virtual reality. It was a staple of cyberpunk SF books 30+ years back (and more). But it is still a mess outside of games. While augmented reality as a tool (e.g. for repairing things or even surgery) will totally be a thing, I think the metaverse (as per Snow Crash, at least, let alone Neuromancer or Altered Carbon or even just the old Star Trek Holodeck) is not with us yet affordably. I think this is more a failure of use case/compelling application than actual technology, however.
AGI - all the money in the world is being spent on better predictive text, but the same people getting the money promise AGI. Hah. Bunch of Marketroids (that's not an insult - they are very good at it given the levels of investment they are getting compared to the total paucity of actual usefulness of their tech beyond prompting governments to worry about their energy grids).

UK government introducing digital id by stealth - without proper inclusion or safeguards as far as I can see...

2025-12-04T02:49:00.000-08:00

If you are a director of a company in England, you will recently have received a letter requiring you to register for this: https://www.gov.uk/using-your-gov-uk-one-login to be able to continue to do some things (e.g. mandatory for directors of companies). https://www.gov.uk/guidance/verifying-your-identity-for-companies-house

When i did this, the website said that you could do this at a post office

if you only had paper documentation (e.g. birth cerfiticates, paper passports etc)

and could use notaries to verify some of the documents.

However, it seems that this alternative may not have ever worked properly,

since now, if you look at the companies house web site guidance,

you have to use the digital service first to input your document details online..

So talking to someone else more recently , they said that this meant they had to step down as a company

director since they did not wish (or perhaps could not) use a digital id upload -

Is this legal? The government is effectively ruling out people from some things in

society if they cannot use the digital service - it appears that companies house forces this, as when you look at the link for verifying your id via the postoffice,

This seems inequitable at least, and possibly completely a massive security danger,

given the risk of leakage of photo id which many online organisations have so

publically experiences in recent years....

Also, the behavoural/liveness checks on the photo-id look naive - when many biometric systems are moving to multi-modal (e.g. face+movement, + speech / speaker recognition when saying random given phrase or face + movement + fingerprint or iris etc etc)

So is the uk government a) introducing digital id by stealth? and worse b)

introducing a centralised, insecure system and c) ignoring any requirements

for inclusivity? and d) without any public debate?

Cloud versus Edge - I think the jury is still out - sort of...

2025-11-20T00:02:00.000-08:00

With recent Cloudflare and AWS (and previous meta outage) it comes down to one simple tradeoff :

On the one side is everyone running an Internet service saves some money by paying a Cloud provider to run the infrastructure for them -

So the cloud outfit get to amortize a lot of costs over all the customers by having a small number of big data centers (numbered in thousands) instead of millions of enterprise computing services run by every tom, dick harry, tescos, twitter, openai, slack, signal, ticketmaster etc etc (all actual examples of people who lost service during aws and cloudflare outages)

As Spidey says "with great power comes great responsibility". So cloud providers not only provision carefully,

but they do actually provide some levels of fault tolerance by providing redundant servers,

and even run some consistency protocols to make sure of a customer's service needs very high availability,

so long as a majority of the duplicate servers are running, the service is ok - this can operate globally,

so even if there's a whole country disconnected (e.g. international fiber cut, or national grid outage, both also real events in recent years), the rest of the world can move on ok...

But it would seem that they don't fully apply this distributed, replicated, fault tolerant/high availability,

possibly somewhat even decentralised thinking to the implementation of their own internal necessary infrastructure - so in AWS and Cloudflare case, the error was central - someone in AWS didn't consider a particular pattern of performance that meant a DNS config (their design is 95% sane) led to a slow server updating DNS and overwriting more recent entries, causing customer services that had needed those new entries to be unable to find them In the Cloudflare case,

a centrally managed configuration file grew in one overnight update by twice the size, exceeding cloudflare services maximum file size constraint (this is actually rather sad in terms of being fairly esasy to prevent by normal system checking/validation processes. The AWS one is slightly more subtle, but not much more. in fact, earlier outages in replicated/distributed services (actually at cloudflare earlier) took PhD level thinking to come up with long term solution - see this paper for one example: Examining Raft’s behaviour during partial network failures

https://dl.acm.org/doi/pdf/10.1145/3447851.3458739

The cloudflare example is also a little reminiscent of the Crowdstrike outage, but that wasn't cloud - crowdstrike has a rulebase for its firewall productsm and microsoft windows is required to allow thir parties to install firewall products (ven though a modern microsoft OS firewall is actually good) - crowdstrike had a bug in a new rule base so when all the windows machines using that product updated their rules, the firewall code (inside the OS, allowed in by microsoft due to anti-monopoly rules) read a broken file, which caused an undetected code bug to triger an exception, and, due to oversight the crowdstrike software engineers had not put in an exception handler, which would have led to a safe exit of that code, so instead the exceptin caused an OS crash (i.e. bluescreen!)...i this case, the central error affected millions of edge systems directly. - and due to the way the s/w update worked, needed a lot of manual intervention by many many people in many organisations..

In a non-cloud setup, you'd have natural levels of diversity in all the millions different enterprise deployments (even if just different versions of things running) so outages would typically be restricted to particular services, but in the cloud setup, an infrastructure outage takes down thousands of enterprises....(I think AWS reckoned about 8000 large cusomters - not sure about Cloudflare but estimates are they run about 25% of the Internet ecosystem's defenses)...

background

AWS explainer

https://aws.amazon.com/message/101925/

Cloudflare explainer

https://blog.cloudflare.com/18-november-2025-outage/

Replication failure during partial network outages

https://dl.acm.org/doi/pdf/10.1145/3447851.3458739

UK government very useful report on data center sustainabilty (has lots of useful statistics):

https://researchbriefings.files.parliament.uk/documents/CBP-10315/CBP-10315.pdf

To be fair to the cloud service folks at AWS and Cloudflare, they found, fixed and publically reported the problems in under a day, so the concentration of resources in cloud also meant a concentration of highly paid, really expert people who can troubleshoot a problem, and once its fixed, the deployment is also quick. on the other hand, a decentralised setup (more like the frowdstrike example) can also be deployed fairly fast if they had been slightly more careful about their s/w update process....

So i'd say clould v. edge, at the moment hard to pick which is more resilient, which is cheaper

prisoner of cellular blockheaded thinking #9

2025-11-13T01:16:00.000-08:00

So the cellular industry is often lauded for its planning and general coherent approach to the world.

Let's remind people how inaccurate that is.

From the get go, Bell Labs & parent AT&T decided, having invented cellular telephony, that there was no market for it. (repeating the famous IBM error that "the world will only need 3 computers, 2 in America and maybe one in England).

Later, people invened the bluetooth stack (yech - serial line emulation, modem control tones, and no sensible mesh mode for ages)

Then blackberry and nokia tried to copy apple's iphone ideas and almost totally tanked what had previously been incredibly smart and succesful industries.

They they caused the fixed telephone service to run out of telephone numbers multiple times.

Most recently, car owners and smart meter owners are finding that the ability to remote access said vehicles and devices is being turned off because providers won't be running edge, gprs (2.5G) Or even 3G pretty soon - so because there's no backwards compatibility until much later generations, goodbye to all those useful services.

And what is 6G actually for, remind me?

My wifi still works ok :-) on IPv4 (only 46 years old) and IPv6 (20+ years old...

AI versus Humanity

2025-10-31T07:17:00.000-07:00

humantity is too stupid to build an AI that would threaten its existence (humanity, not AI's existence...we could easily build a self-destructive AI - see below).

the main reason is logistical. we are rubbish at supply chains - food comes across the planet out of season, for rch folks but bypasses many people on the way who are suffering shortages. we make computing and communications devices that depend on rare earths only available in war zones or our adversaries land.

we throw away working stuff.

Any AI will have to build itself reliable supply chains for replacement parts, software maintenance and energy. To do that, it will need an army of robots (in the origianl sense of robot, Karel Capek's obedient tireless servants). But any humans spotting such a reliable supply chain will immediately take it over and steal from it, ratehr than rely on their own rubbish production line. Capitalism and natural selection mean a race to the bottom - humanity's inferiority will be AIs downfall. The seeds of our digital overlords demise are built in, due to the inherent contradictions in the rules of engagement.

zero inbox wars

2025-10-10T02:15:00.000-07:00

i've had a zero inbox policy since first getting e-mail (late 1970s) - having moved through various systems (roughly once a decade), recently landed on fastmail (which I very very much recommend- extremely fast, but also very easy to migrate to, integrate with other mail systems and calendars (!) and very good support).

so throughout various systems, I've used different tools for managing incoming, and also archives - i have kept all e-mail to/from/cc:d me since 1976:-)

at some point, everything is esentially kept in a bunch of directories (folders) organised with a small (<=3) levels of hierarchy - somewhat like internet name space (.edu .com etc) - with names of people (students) or projects or personal (money, health, house, transport etc)....

not sure what fastmail uses behind the scenes but seems to scale well and has nice rule system for automatically processing incoming too....plus I really like how it interacts with other mail systems ( I have to maintain several outlook accounts for some places I work, and at least one doesn't allow forwarding, but fastmail can pretend to be a client, and make the mail look like it was just got through imap etc)....

anyhow, through various stages (cambridge's ownbrew, then exchange baed system, gmail, and now fastmail) have seen a steady decrease in spam - really very little getting through at all these days (maybe one a day) and very low false positives too ...

but what's left has two things steadily increasing

1/ academic "spam" - e.g. calls for papers, invitations to review, offers to publish my work "for free" (like why would I ever pay?) etc

2/ mandarin - not reading or speaking any version of chinese, I'm assuming this is actually more of 1/ but just for chinese events and publications...

I'm looking for a two stage LLM to deal with those two cases - one "translate", two see if it is relevant - could train the model (or refine/fine tune) based on my own publications or conferences i'm on programme committee for...

maybe a student project!

Anyhow, in my current fastmail setup with 5G of mail, there are only 3 messages in the inbox...soon to be 0.

How to reform a National AI Institute?

2025-07-23T03:21:00.000-07:00

The Reform Club

A lot of people have been busy recently writing plans for the Turing Institute most of which revolve around criticising the pace and direction of changes that it has been attempting over the past 2 years, and several culminating in the trivial proposal to put all the eggs in one basket (defense and security) and use the big stick that the EPSRC has of deciding whether to continue the block core funding for the next 4+ years to "make it so". This got up to ministerial level.

That isn't reform. That's simply asset stripping. Not that the asset isn't a good one - the defense&security program has always been strong (at least as much as we can publically know) - other work had variable mileage, although commercial partners from banking/commerce, pharmaceuticals et al, keep coming back with more money, which suggests that they liked what they got (with their real world hard-nosed attitide to funds, especially in the UK where R&D spend is so typically low from industry). We're talking £1M per annum per partner levels.

Also the Turing managed to hire very strong people, both as postdoc researchers and as research engineers, in the face of fierce competition from the hyperscale companies (that all have significant bases in the UK, e.g. Google&Deepmind based here, Microsoft, Amazon in Cambridge, Meta and X in London, OpenAI has a London lab, etc etc) - as well as quite a reasonable set of opportunities in UK universities in places significantly more affordable than London (or the South in general) - so presumably, the interesting jobs had challenges, and an open type of work, that attracted people who had a choice.

How not to reform a National AI Institute?

Make it defense. You will lose a large fraction of the other money, and the majority of staff will not be allowed to work here. As with medicine, the UK does not have capacity to provide AI expertise at the level needed from UK citizens alone. Those left with real clue will often find it easier to work at one of those companies too. Especially since salaries start at 3 to 4 times as much.

You will lose all the cross-fertilization of ideas that happen in one domain into others especially with the Turing's unique engineeing team acting across all the possible projects, acting as a conduit for tools and techniques.
You will lose all the visitor programmes including the PhD visitors, which benefits students from all over the UK, many of whom would not be allowed to visit.
You'd lose access to much interesting data, which would not be allowed to be given to people that can't transparently say what they will do with it. Legally.
You wouldnt have a "national institute" - you'd just have another defense lab. It might be very good, but no-one would know. In fact, how come there isn't already one, e.g. in Cheltenham? They have plenty of funds?

What's my alternative?

To be honest, I don't have one. The nearest thing I have is what the Turing has managed to do in weather prediction (see Nature papers on Aardvaak and Aurora), and what we did (still do) in Finance & Economics with some very nice work in explainable AI and fraud detection, and synthetic data, which have multiple applications across many domains. Likewise the engineering teams work on data safe havens which is useful in aforesaid finance, but also in practical privacy preserving machine learning in healthcare and any other sensitive domains. And recent work on lean language models. There are quite a few other things one could mention like that.

You can't predict where good or useful ideas will come from. Who knew giving a ton of money to CERN would lead to the World Wide Web? Who knew a Dutch free OS (minix) would incentivise a Finnish gradute student to write Linux (the OS platform on most the cloud, half the smart phones out there)? Who knew that some small TV company (the BBC) request for a simple low cost computer would lead to the founding of ARM (that has more chips in the world than Intel or anyone else - again in your mobile device)? Who knew this neat little paper on Attention is all you need, would lead to all the harsh language about peoples' failure to predict the importance of LLMs (hey some of those people predicted that blockchain might not be a good idea:-) Who knew?

And who knows how to reform a national AI institute?

persistent technology design mistakes that go on giving

2025-07-22T22:23:00.000-07:00

History of technology is littered with design mistakes. Quite a few are mercifully shortlived, and some, deservedly, just don't even see the light of day at all (sadly in some cases as they might be useful lessons in what not to do:-)

some mistakes aren't mistakes - one famous one was the choice between VHS and Betamax video tape formats - this was actually a tradeoff in cost/price/distribution and quality - in the end it didn't really matter.

Others somehow survive, grow and persist, and persist in having negative consequences for decades -

In Computer Science, these are things like the C++ language....enough has been written about that and alternative histories (if only Algol 60, or if only Objective C or if only people had written better compilers and libraries for functional languages earlier) -

In Communications Systems, two examples I'd pick: Bluetooth, and Edge.

An early workshop (25+ years ago) in california had presentations on bluetooth and explained why they had made it look like a serial lines (think COM3 on a windows box), stemming from everything lookling like circuits, or telephones to the folks who made this up. And Edge was made reliable (local retransmissions) to mask wireless packet loss, when everything should just "look like an ethernet", as I think Craig Partride put it. The cellular data folks eventually got it (and have done a number of other things much better than wifi), but bluetooth's horribleness persists, partly because the broken architecture was baked in to early software stacks, and is very very hard to persuade people to just ditch. In the IoT world, this led to a plethora of other wireless technologies (zigbee, lorawan etc) at least partly so people could avoid that fate, although there were other goals too.

Anyhow, we avoided being stuck with X.25, but we are stuck with QWERTY. We avoided being stuck with VHS, but we are still stuck with IPv4.

a brief history of asking for forgiveness versus permission - the napsterisation of AI...

2025-07-12T05:23:00.000-07:00

Back in the day, the Jesuits used to say that it was better to ask forgiveness than permission. I think that this may refer to the idea that people may have committed minor errors without knowing what they did was wrong, so they were less blameworthy, especially if, after the event, when the priest or other wise person explained to them the error of their ways, they recanted and were forgiven. To ask permission implies that the answer might be "no".

So now we are in a. world where people are being paid to run stuff like this legitimised botnet, effectively becoming part of a P2P file sharing world. Once upon a time (a generation ago, or almost infinitely in the past) if you ran a thing like this (the Napsterised Internet) you would get sharp letters from lawyers or even just be fined by the copyright infringement police.

Post Napster, Google acquired Youtube and took an interesting step...they basically took the Jesuit line, with a vengance - the trick was that Google went and did massive deals with all the large copyright owners (actually paying quite serious money) and then if you or I uploaded something already covered by that agreement, then no problem. If we uplaoded something not yet covered by the agreement, Google had an offer - they could offer advertising revenue, or possibly market research (popularity metrics), or as a last resort, take down the content.

While the large copyright owners have not been the best of friends to the artists who actually create stuff, this was at least semi-legitimate (I'm not a lawyer, obviously, but it seems to follow the aforesaid Jesuit model, and that has history behind it:-)

Now we have all those GenAI tools trained on a lot of content that is avaialble on the non pay-walled Internet - this does not mean it isn't copyrighted. The AI/LLM companies are notably trying to claim fair use type arguments (which search engines 20 years ago most notably did not) - the difference may reflect a change of culture, a shift in legal interpretation (of say fair use) or perhaps simply a shift in power (AIs owned by companies that have a larger market cap than the GDP of most countries).

At least one of those AIs is run by the aforesaid search engine company. But others are not, and don't necessarily have search, and certainly don't appear to have done the large deals for content with those big copyright owning companies...

So the game is afoot... ... ...

from dyson to dolby

2025-07-02T04:37:00.000-07:00

was at a thing in imperial college in their dyson center then went to the new cavendish (3.0) lab in cambridge's new dolby building, and got worried about the idea that these might both be about noise cancellation. obviousdly vacuum cleaners make sure that your vacuum is really really high quality so sound won't propagate at all, and dolby is all about boosting the signal and reducing the noise

but what happens if we combined these technologies, I hear you cry. Actually, I don't because that would be like the eponymous xenomorph after ripley kicks it out of the nostromo. All you can hear is the irritating sound track music.

LLMs: Social Engineering for Dummies

2025-06-16T07:19:00.000-07:00

Has anyone tried to get GenAI to build Cyberattacks based on fooling humans via cognitive biases etc?

I mean it seems like prompt engineering really good suggestions for non technical (i.e. psyops) would be a really good test, especially for agentic AI systems, to see if they had even a half baked theory of mind...

don't say you havn't been warmed...

autonomy & a commons...

2025-05-21T03:39:00.000-07:00

for entities to have autonomy (i.e. be agentic) they must operate in some sort of decision space that is disjoint from the other entities with which they interact - conversely, if their decisions are fully specified by those other entities, then they are by definition not autonomous..

we could call this an agentic commons

Binding Real Live Human to Virtual Identifier with data minimisation - Ф-ML-Id

2025-05-21T00:09:00.000-07:00

People use hashes of biometric data embedded in identifier (digital documents) so then the verifier can re-acquire the biometric and (hopefully, in a privacy preserving way) verify the person standing in front of the camera is a) live and b) the same as the one in the document - this is so 20th century

Why not have a behaviour that actually relates to the verifier's actual domain requirements - say they want to check this person is allowed to drive - perhaps for renting a car - so they could measure the person's driving style, which could also be stored in their digital driving license shard of their identity. This could be very robust - graduatlly improves in uniqueness -- and also stops people re-linking across different domains when people use the same feature/behaviours for multiple roles (face, being a common example) - we can also incorporate increasingly robust defenses against GenAI building deepfake people (via deepfake 3D faces or voices) -

The verifier would then be an agentic AI which basically has two things, a measurement model (classic ML) and a physics model (of what people can do) - so now we have Ф-ML-Id...

[lots of good background on Ф-M from the Turing's interest group]

The more borng application is in multimodal biometric systems tat use face+voice+lipsynch on given phrases to work out who someone is and that they are alive- same thing - physics model predicts face movement and voice audio given text input, then verifies against previously onboarded voice&Ф-M 3D face model.

In facial reconstruction (e.g. taking oliver cromwell's skull and building up a picture of what he actually looked like in person) there's a model of how flesh covers bone - this includes models of the way the stuff we are made of folds/hangs/ages etc - these models are from first principles. Unlike deep learning systems trwained on lots of labelled data to classify faces and extract features (eyes nose mouth etc) purely based on statistics and luck, these physics models are correct because that's the way the universe works. You can build hybrid, so called Ф-ML systems, which give you benefits of both the statistical approach, and the explainability of the physics model - the cool recent work also shows that the physics model lets you reduce the amount of data necessary for the statistical model - sometimes by 2 or 3 orders of magnitude, and retain the accuracy and recall of the stats model.

In the world of biometric id, there are requirements from applications (many use cases are with a human waiting for verification of some attribute) that mean you want fast, accurate and efficient models that can run on affordable devices in tolerable time.

You also want to be future proof against deep fake, and also against adversaries with access to complex system wide attacks.

I claim that having these underlying real world explanatory component, alongside the statisitcally acquired twin, will be more resilient, and might even let you cope with things like kids and aging and other changes, as well as allowing you to verify attributes other than the standard biometrics of face, fingerprint, iris etc, in robust ways which provide better domain specific data minimisation.

Reflections on Distrusting Decentralization

2025-05-16T06:12:00.000-07:00

with apologies to ken thompson

I keep hearing a lot about how decentrlised systems can solve the massive loss of trust we are witnessing in large scale central organisations (technological, like hyperscale companies, and social. like national governements, and even economic, like banks).

I call 100% bullshit on this

(said in Natasha Lyonne's great voice, maybe we could ask her to do an audio book of Frankfurt's book on bullshit, or even graeber's book on bullshit jobs).

not that many people havn't lost trust in central agencies. that's been a factor of life for millenia - probably amongst many creatures, not just humans - i imagine the first deer to get hunted by humans suddenly found another threat to add to their collection. and tribes and nations invaded suddenly, or betrayed by "friends", "family", states in treaties etc. or banks going broke. or rivers and mines drying up.

sure, central things can lose trust, and, as the cliche has it, they will find it much harder to regain.

(though notice, people don't say often that it will never get regained, just that it is a lot harder to gain than to lose).

so what about decentralised systems and trust?

well, the answer is built into that oxymoron. a "system" isn't decentralised. it represents, at minimum, an agreement to follow a set of rules. Who makes that agreement? Answer, the participants. They buy into the API, the protocol, the rules of engamgent and behaviour. And they can renege on it. They can stop obeying the law, they can start a run on the bank, they can over-graze the commons, they can get together and gang up on a smaller group, they can go amok, or form a mob, or start a revolution.

At some point the decentralized system must have some central component, even if it is virtual.

So why would I trust such a system more than a central system? I don't think I should. The problem is that there are no coordination points which means if I am in that minority, even just one person being ostracized, I have no redress, no recourse to recompense. There's no resource I can point to, to offset the misuse of decentralized power by the unruly mob.

In syndics (socially anarchic systems), there are meta-rules of engagement that are supposed to mitigate misbehaviour. for example, you are not supposed to engage with people (nodes in the net) with whom you have no overlapping interests (i.e. no resource conflict, no need to enage). If you do, then the metarule makes that mishaviour in everyone's interest. Now there should be a people's court to try the misbehavioung group and decide on a suitable redress (which might be ostracism) - sounds tricky? yup. did it ever work? Maybe for a short time in Catalonia a long long time ago.

How would that work in a distributed communications (or energy) system? Not well if you ask me. we only have one "space" for comms, and we only have one planet. There's got to be some root of trust (or multiple roots is fine), so you can anchor redress (for example).

Of course, you can build a hierarchy, which at the leaves looks like a decentralised system, but really, what you have is federated.

uncanny cycle, hype valley - you choose

2025-05-04T02:50:00.000-07:00

The cliche of the tech world is to trot out the infamous Gartner Hype Cycle and no-where is this more prevalent than in AI, post Chat-GPT (to be fair, post "Attention is all you need").

But the other slightly less worn hype is the phase that embodied (or perhaps even virtual) AI is said to go through, which is the Uncanny Valley

So where do these two curves collide, eh? Right about now...

zero trust

2025-04-28T01:35:00.000-07:00

I'm pretty sure this sort of thing happens because of bad parenting - people that don't trust anything come up with the idea of distributed systems that have no need of any anchor for trust anywhere.

so i have lots of problems with this - starting from systems - and the classic ken thompson reflections on trusting trust. These deecntealization extremists have to confront that they run software on hardware, and even if they build their own hardware and write their own software, they probably use an OS and a Compiler from somewhere else. However, it gets worse. Why should we trust the actual zero knowledge protocols they use? who has verified them, and how? why do we trust those verfication tools (peoples' brains too). And worse still. Why should we trust this new fangled idea zero. The Romans and Greeks and ancient mesopotamians got along fine without it.

No. I have zero trust in zero trust.

powerless trio

2025-04-26T05:55:00.000-07:00

Three things we have that are between 50 and 100+ years old, still work, and do not require electricity - classic portable type writer from Remington and a Singer 66K sewing machine from 1917

and an HMV wind up record player from the 1930s....

when civilisation collapses, we'll still be able to write letters, fix clothes and listen to some old 78s!!

Artificial Intelligences as Trusted Third Parties (AI as TTPs)

2025-04-22T04:16:00.000-07:00

AI as TTPs is a recent posting by Bruce Schneier who has impeccable security credentials.

However, I'm not convinced that the paper he is highlighting is as groundbreaking as he is.

The authors of the paper also have great track records and include AI, but I think they're missing something basic that means that a single or group of TCMEs ("Trusted Capable Model Environment") can't actually do anything different than any other computation, subject to basic privacy controls (e.g. access control authorisation, auditing, encryption of data at rest, in transit, during computation (e.g. using FHE and TEEs etc etc).

But also:

a) visible communication in/out of the computation - i.e. information flow control

b) control over specificity of that data (i.e. differential privacy - can you tell if an individual record is present or not, to put it. crudely)

c) secure multiparty computations and zero knowledge systems

which the paper compares and contrast with their new TCME notion. However, I think the dimensions they use for comparison are a bit of a stretch.

The main problem I think is that the TCME seems to be indistinguishable from any other trusted program.

Any shared secret between models (e.g. federated or decentralised learning) is just the same for AI/ML as for any other algorithm. Perhaps the intersection of probability distributions looks a bit different to juse being able to say "the richest person is A" without knowing how rich A (B, or C) actually is - but in the end, the distribution has some moments and can be described by some number of those more or less precisely - a distribution of distributions can be aggregated with more or less precision or uncertainty (e.g. respecting differential privacy, and some widest level, or preventing set membership inference at the finest grain) - the model itself can be protected from outside model inversion attacks by various schemes, but I don't see what TTP function is provided that isn't just a different mix of existing techniques for providing trust.