Model in use: pszemraj/medgemma-27b-text-heretic_med
Enter some medical text, and press Generate to continue it.
To allow interrupting the generation, it occurs in chunks, remembering the KV cache (only during the generation, not currently across executions).
Raising the chunk size will increase latency, but might make it go faster by reducing overhead.