Get in Touch

Course Outline

Hunyuan Multimodal Foundations and Lab Setup

  • Exploring Hunyuan's multimodal capabilities for image, 3D, and video use cases.
  • Identifying practical business scenarios for creative, product, and content teams.
  • Preparing the lab environment, sample assets, and model access.
  • Executing initial generation tasks and reviewing outputs.

Prompt Design and Workflow Patterns

  • Structuring prompts for consistent multimodal results.
  • Utilizing text prompts, reference images, and basic input settings.
  • Selecting appropriate workflows for image, video, or 3D generation.
  • Iterating on prompts based on output quality and business objectives.

Image Generation and Review Labs

  • Creating marketing, product, and concept images from prompts.
  • Refining visual style, composition, and content consistency.
  • Evaluating outputs for usefulness, quality, and brand alignment.
  • Organizing image assets for approval and downstream usage.

Video Generation Labs

  • Producing short video outputs from prompts and prepared inputs.
  • Controlling style, scene intent, and output variations.
  • Reviewing videos for clarity, continuity, and practical application.
  • Preparing video assets for demonstrations or content workflows.

3D Asset Creation Labs

  • Generating basic 3D assets from text or image inputs.
  • Checking geometry, texture quality, and asset usability.
  • Exporting assets for visualization, prototyping, or content pipelines.
  • Comparing scenarios where 3D generation is more suitable than image or video workflows.

Integration, Governance, and Next Steps

  • Delivering generated assets through simple apps, services, or APIs.
  • Connecting multimodal outputs to product, content, and review workflows.
  • Implementing practical checks for quality, brand safety, copyright compliance, and responsible use.
  • Planning pilot use cases and next steps for internal adoption.

Requirements

  • Fundamental understanding of AI and generative AI concepts.
  • Experience using web applications, APIs, or standard developer tools.
  • Basic proficiency in Python or scripting languages.

Audience

  • Developers building AI-powered product features.
  • Technical product managers and solution architects.
  • Innovation, media, and digital teams working with image, video, or 3D content.
 14 Hours

Number of participants


Price per participant

Upcoming Courses

Related Categories