What Apple and Google Are Doing to Your Push Notifications

Apple and Google have turned push notifications into a curated, AI‑driven experience. The article traces the evolution from simple delivery services to on‑device summarisation, ranking and rewriting, compares the push pipeline to email’s earlier transformation, and offers practical advice for marketers facing opaque platform edits.

The hidden editor behind every push

When a developer sends a notification, the payload first travels through a cloud service owned by the device maker – APNs for iOS or FCM for Android. From the start those services could throttle or drop messages, but for most of the last decade they acted as a transparent pipe. Over the past five years the pipe has been refitted with on‑device language models that rewrite, group and sometimes suppress the very text the sender crafted.

What Apple and Google are doing to your push notifications | Jacques Corby-Tuech Notification summaries on Android

Why the pipe exists: battery and attention

In 2009 Scott Forstall warned that a phone could not keep a separate network poll open for every app. The answer was a single TLS channel to Apple’s servers, later mirrored by Google’s Cloud‑to‑Device Messaging, Google Cloud Messaging and finally Firebase Cloud Messaging in 2016. The design saved battery, but it also gave the platform a choke point where it could decide what the user actually sees.

Fifteen years of incremental control

Year	Platform	Key control mechanism
2017	Android 8 (Oreo)	Notification channels – developer‑declared importance levels that users can mute per channel
2021	iOS 15	Focus, Scheduled Summary, four‑level interruption taxonomy (passive, active, time‑sensitive, critical)
2022	Android 13	Runtime permission POST_NOTIFICATIONS – explicit user grant required
2024‑25	iOS 18 / Android 14	On‑device summarisation and ranking powered by Apple Intelligence and Gemini Nano

Each step shifted a slice of control from the sender to the user, and another slice to the platform’s opaque model. The platform’s motivation is not purely user‑centric; a tidy notification surface improves retention, reduces uninstalls and showcases the OS‑level AI.

Push follows the email playbook, but with fewer lights

Email providers have long run classifiers that decide whether a message lands in Primary, Promotions or Spam. When Apple introduced Mail Privacy Protection in iOS 15, the open‑pixel metric vanished, forcing marketers to rely on clicks and conversions. Push is now at a similar stage: the on‑device model decides whether a notification is shown, summarised, or hidden, and developers receive almost no telemetry about those decisions.

Key differences:

No universal receipt – APNs and FCM confirm only that they accepted the payload; they do not tell you if the device displayed it.
No persistent inbox – a notification disappears from the tray once dismissed, leaving no searchable archive.
Limited payload – only a few fields (title, body, image) are available for the model to work with.

How the on‑device models work

Apple’s on‑device foundation model (≈3 billion parameters) runs a summarisation adapter that receives the combined text of groupable notifications and emits a single line marked with a sparkle icon. The adapter is a small LoRA‑style module that can be swapped without retraining the base model. When the BBC complained about fabricated headlines, Apple disabled the feature for News apps, added italics to AI‑generated text and gave users a per‑app toggle.

Google’s counterpart, Gemini Nano, lives inside the AICore service introduced in Android 14. It shares weights across authorised apps, runs on the device’s NPU/GPU/CPU and uses Low‑Rank Adaptation to specialise for tasks such as notification organising and smart reply.

A flow chart illustrating the architecture of an AI system, highlighting the flow of data and processing steps from the 'Client app' to the 'Service' component, including 'Input safety signals', 'Output safety signals', 'Weights' and 'Runtime' Gemini Nano architecture

Both stacks expose a hierarchy of decisions:

User‑controlled filters – Focus mode, Do‑Not‑Disturb, channel muting.
Platform ranking – learned per‑app scores that can promote or demote a notification.
Summarisation – if the notification is groupable, the model rewrites the title/body.
Delivery – the final payload reaches the lock screen or is dropped silently.

What users actually do with notifications

Empirical work from the CHI and UbiComp communities shows a consistent pattern: most notifications are ignored, a minority trigger immediate action, and the perceived value depends heavily on the source. Messaging alerts rank highest, promotional alerts rank lowest. A 2025 study of 800 billion pushes across 10 000 apps reported an average opt‑in rate of 61 %, down from 85 % a year earlier, after Android 13 made the permission explicit.

When users disable a channel or a summary, the underlying text reappears unchanged – the model only intervenes when the default settings are active. Because most people never tweak those defaults, the summariser’s output becomes the de‑facto view of the notification.

Metric	What you actually know
Accepted by platform	APNs/FCM response code – payload reached the cloud service
Delivered to device	Occasionally reported by FCM callbacks; not reliable
Shown on screen	Only observable via SDK hooks (e.g., `UNNotificationServiceExtension` on iOS)
Open / tap	SDK‑recorded tap events – but these are a biased subset, filtered by the platform’s relevance model

No API tells you whether a notification was bundled into a summary, placed in Android’s Promotions section, or silently dropped because the device’s storage limit was hit. The only reliable “middle‑of‑funnel” signal on Android is the delete‑intent that fires when a user swipes a visible notification away – a narrow win that still leaves most edits invisible.

Writing for a model you can’t see

Because the summariser keeps factual nuggets and discards brand‑voice flourishes, a practical rule of thumb is:

Lead with the fact – “Your order ships tomorrow at 10 am” survives; “Great news! 🎉 Your order is on its way!” often does not.
Keep titles concise – the first few words are what the model extracts for the summary line.
Use structured data – where possible, expose key fields (price, ETA, count) so the model can surface them directly.
Avoid time‑sensitive level for marketing – iOS will prompt the user to disable it if taps are low, and the toggle is per‑app, not per‑campaign.

Shifting weight to owned surfaces

Push should be reserved for two scenarios that the platform cannot replace:

Re‑engagement of dormant users – only push reaches a device that hasn’t opened the app in weeks.
Transactional or time‑critical alerts – live activities, delivery updates, security codes.

All other communication (cross‑sell, education, onboarding) can be moved to in‑app inboxes, persistent message centres, or contextual UI elements that the developer fully controls and measures.

The future: notifications that act for you

Both Apple and Google are exposing their on‑device models to third‑party apps (Apple’s Foundation Models framework, Google’s ML Kit GenAI). The next logical step is for the model to act on a notification: dismiss it, open the app, or even draft a reply. Patents from Microsoft and Google already describe such “digital assistant” behavior, and early experiments in Siri and Gemini show prototypes of auto‑completion for common actions.

If agents can complete the user’s intent without a tap, the classic click‑through metric will lose relevance. Marketers will need to expose the underlying action (via App Intents on iOS or App Actions on Android) so the model can invoke it directly.

Practical checklist for 2026

Reserve push for essential, user‑initiated triggers – price‑drop alerts, back‑in‑stock notices, status updates.
Ask permission in‑context – after the user experiences value, not on first launch.
Segment aggressively – send only to users who have demonstrated interest; generic blasts are now filtered out by the platform’s relevance model.
Write fact‑first titles – test by truncating the title to 4‑5 words; if the meaning remains, the model will likely keep it.
Instrument every in‑app surface – capture render, dismiss and interaction events; treat push as the gateway, not the destination.
Monitor SDK‑level delivery – use UNNotificationServiceExtension or Android’s NotificationListenerService to confirm that a payload reached the OS.
Pool campaign data – because individual push metrics are noisy, aggregate over weeks and look for trends in downstream conversion rather than raw open rates.
Prepare for agent‑driven actions – expose key flows as App Intents/App Actions so future AI assistants can complete them.
Stay aware of platform changes – Apple may toggle summarisation per‑app; Android OEMs may ship custom summarisation stacks (e.g., Samsung One UI 8.5). Subscribe to the developer blogs of both ecosystems.
Accept the new reality – the channel is no longer a pure broadcast medium; it is a curated, AI‑mediated surface. Success will belong to those who align their messaging with the platform’s relevance criteria rather than trying to outrun it.

References (selected):

Apple Developer docs on APNs registration: https://developer.apple.com/documentation/usernotifications/registering-your-app-with-apns
Firebase Cloud Messaging guide: https://firebase.google.com/docs/cloud-messaging
Android notification channels: https://developer.android.com/develop/ui/views/notifications/channels
Batch guide to iOS time‑sensitive level: https://help.batch.com/en/articles/5543431-understanding-and-managing-ios-15-time-sensitive-interruption-level
Pushwoosh study on Android 13 opt‑in impact: https://www.pushwoosh.com/blog/android-13-opt-in-rates-research/
Batch 2025 push benchmark: https://batch.com/ressources/etudes/benchmark-notifications-push-crm-mobile
Apple Intelligence technical report (July 2025): https://arxiv.org/abs/2507.13575
Gemini Nano documentation: https://developer.android.com/ai/gemini-nano
Microsoft patent on notification augmentation: https://patents.google.com/patent/US11340963B2/en
Google patent on notification prioritisation: https://patents.google.com/patent/US8707201
CHI 2014 large‑scale mobile notification study: https://weberdo.com/publications/2014-Large-Scale-Assessment-of-Mobile-Notifications.pdf
UbiComp 2016 PrefMiner paper: https://pure-oai.bham.ac.uk/ws/files/29201286/PrefMiner_Mining_User_s_Preferences_for_Intelligent_Mobile_Notification_Managemen.pdf