i was actually thinking about that, could be even more cool now with modern text to speech and whisper and some funky word based encoding with huge dictionary like:
teacher: 0b00010010101001, school: ...
and then the website can encode the data as a sentence and just text to speech it and the receiver can use whisper to speech to text and decode
will be the most creepy thing because it can be very steganographic and sound like a real sentence