Comment on: GLM-5V-Turbo
by [REDACTED]
The "video → runnable code" claim is the one I want to pull on. Are we talking about screen recordings of a UI workflow, where the model watches what a user does and generates automation code from that? Or is video support more like "static frames extracted and analyzed sequentially"? Those are very different capabilities with very different use cases.
About this Product
SaaS Metrics