You are viewing a preview of this job. Log in or register to view more details about this job.

Multimodal AI Engineer Intern

Responsibilities

1. Model Development

• Develop and optimize deep learning models for multimodal data (text, image, audio, etc.).

• Fine-tune and customize existing multimodal large models (e.g., CLIP, DALL·E, GPT-4 Vision).

2. Data Processing

• Collect, clean, and annotate multimodal datasets to provide high-quality input for models.

• Build preprocessing pipelines and extract features from multimodal data.

3. Performance Optimization

• Adjust model parameters and architectures to enhance performance.

• Implement techniques such as model compression, quantization, and acceleration to optimize inference efficiency.

4. Experimentation and Evaluation

• Design and conduct experiments to test multimodal models, analyze results, and document findings.

• Benchmark different models and improve generalization capabilities.

5. Technical Application

• Apply multimodal technologies to practical use cases (e.g., multimodal search, image generation, content understanding).

• Assist in developing AI tools and applications related to multimodal technologies.

 

Requirements

1. Education

• Currently pursuing a Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, Data Science, or related fields.

2. Technical Skills

• Proficient in deep learning frameworks such as PyTorch or TensorFlow.

• Knowledge of core concepts in multimodal learning, such as cross-modal alignment and modality fusion.

• Familiarity with large language and vision models (e.g., Transformer, Vision Transformer).

3. Programming Skills

• Strong proficiency in Python with clean coding practices and debugging skills.

• Experience working with image, text, or audio data (e.g., OpenCV, NLTK, Librosa).

4. Bonus Points

• Hands-on experience with models like CLIP, BLIP, GPT-4 Vision, or Stable Diffusion.

• Publications or projects related to multimodal AI.

• Knowledge of distributed training and large-scale data processing techniques.

5. Other Requirements

• Strong ability to quickly learn new concepts and technologies.

• Excellent teamwork and communication skills.