Content-based indexing (as in IPFS) handles that. Plus the data gets cached in the starlink you locally accessed, in your example.
It might help to think about IPFS as a public CDN operating on an open specification protocol and an open-source implementation.