Words mean something and just because the result is similar doesn't mean it's the same thing. Creating a photorealistic painting doesn't make it a photograph just like a photograph of a painting is not a painting.
An AI tool could construct a virtual scene and then take a simulated photograph or make a simulated photorealistic painting. While I would prefer that people always use words like virtual or simulated to describe it, I think it’s likely that it will often get dropped. I’m thinking of the camera in metal gear solid. At some point people get used to something and start using less precise language.
That's as much as a photograph as someone manually creating a scene in blender and running it through the Pixar Render farm or whatever to get an ultra realistic scene - as the sibling comment stated what you have is a render
Similarly the camera in MGS would be a screenshot, as in a shot of the screen
It has virtual brushes. This predates AI by quite a lot. It's important to be careful to call it a virtual or simulated painting but perhaps not all the time. Maybe every single instance it's brought up but not always repeated – if you wrote a paper about virtual painting perhaps 10 of 100 occurrences of the term painting would come with a term like virtual or simulated.