Yes. We are SOC II certified and have strict limitations on who can access that data and under what circumstances (only if given explicit permission by the user). Definitely understand that could be a deal breaker for some, but with current methods, we believe that this approach will allow us to deliver more powerful search and greater value in the long run.
This means you store most data in plaintext in the index? Enough to reconstruct most of the content?