IVQ 651-700

What are the tradeoffs of running LLMs in embedded systems vs. in the cloud?
How would you deploy a quantized LLM on a mobile device?
What’s the role of 4-bit or 8-bit quantization in edge deployment?
How do you minimize memory footprint without degrading generation quality?
What’s the difference between ONNX and GGUF for local model deployment?
How would you cache results effectively for offline GenAI tools?
How do local models like Phi-2 or TinyLLaMA compare to GPT-3.5 for constrained apps?
How can you do RAG with a vector DB on-device?
What architecture supports synchronized updates between edge and cloud models?
How do you design fallback mechanisms for low-power GenAI clients?

How do you tune an LLM for high-stakes financial advice scenarios?
What guardrails are needed for GenAI use in legal document drafting?
How would you validate medical LLM output for diagnosis support?
How do you design prompts for code generation in embedded systems vs. full-stack apps?
How would you evaluate an LLM's ability to summarize legal case law?
How can GenAI support pharma research documentation workflows?
What evaluation methods are best for academic LLM assistants?
How do you incorporate domain-specific taxonomies into GenAI models?
What are typical risks of using GenAI in scientific writing?
How do you extend GenAI to support patent drafting or technical IP filings?

How does Whisper differ from traditional speech-to-text systems?
How do you chain audio transcription with LLM summarization workflows?
What are key considerations when adding TTS to a GenAI agent?
How do you align visual cues with LLM-generated video scripts?
What are good ways to caption real-time streams using GenAI?
How do you create GenAI workflows that mix voice, gesture, and text inputs?
What’s the role of VLMs (Vision Language Models) like Flamingo or GPT-4V?
How do you prompt image-generation models like DALL·E or Midjourney consistently?
What are video editing use cases where GenAI can save time or cost?
How would you generate a narrated video from a multi-step document?

Last updated 9 months ago