I have actually tried it, but from doc files on a PC and running python.
My main issue is that the model doing it well need a commercial licence. I have the paygrade to experiment by myself on my work time, but not the one to spend company's money for it. And IT just signed a contract to get GPT4 has part of bing chat pro
Android won't be easy, but you can slap together a python script that runs tesseract or easyOCR and runs it through a pretrained LLM like T5. Those are well-known and well-documented, so chatGPT can probably write the script for you without too many hiccups.
What‘s the worth of AI generated summaries if they are not factually reliable? The new Google search result previews that are generated by AI (and I believe Google as a large company has more resources than most of us do) contain so many obvious factual errors (i.e. made-up names, wrong places, false dates) that I really doubt current generation AI is ready to be a reliable help in this use case.
I, too, like the idea of not having to do all this work manually. But we’re not there yet.