FLORA Academy
EDITOR
TEXT MODELS
IMAGE MODELS
Gemini 2.0 Flash
Topaz
Magnific
Seedream
Flux Dev
GPT-Image (OpenAI)
Nano Banana aka GPT 2.5 Flash Image (Google)
VIDEO MODELS
GPT-5 (OpenAI)
Advanced multimodal language model for complex reasoning, orchestration, and multimodal tasks.
Quick facts
Modes: Text → Text · Image -> Text · Video → Text · multimodal features.
Default output / size: TBD
Aspect ratios: N/A
What it’s great for
Complex coding, reasoning, and orchestration tasks.
Video → text transcription & summarization.
Multimodal pipeline orchestration.
Example outputs (3-column)
Transcript sample | Code generation snippet | Summarized notes |
Copy-and-paste prompts
Summarize this 30s product demo video into a 3-bullet marketing blurb.
Generate React + Tailwind component for a hero card with a split layout and CTA.
Extract timestamps and scene descriptions from this video: [video URLs]
Parameters
Name | Type | Default | Notes |
| string | — | Required |
| string | TBD | Model selector |
| int | TBD | TBD |
| float | TBD | Controls creativity |
Modes
Mode | Estimated time | Required inputs | Typical use |
Text → Text | TBD |
| Code, summaries, long-form text |
Image -> Text | TBD |
| Image description |
Video → Text | TBD |
| Transcription & summarization |
Output options
Option | Values / notes |
Formats | Text / JSON |
Notes | Advanced reasoning & tooling: TBD |
Prompt tips
Be explicit about length and format (bullets, code block, JSON).
Provide context and any relevant assets.