Singapore

Multimodal Reinforcement Learning Algorithm Researcher, Singapore

Multimodal Reinforcement Learning Algorithm Researcher, Singapore
Description
Business Unit
Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as the construction and operation of R&D management and data centers, TEG provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia,TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.
What The Role Entails
Conduct research on reinforcement learning algorithms for multimodal models, including diffusion models for image and video generation, autoregressive models for multimodal understanding, and cutting-edge unified multimodal frameworks. Design and develop reinforcement learning training frameworks and reward modeling strategies to enable efficient large-scale training, improve training stability, and address issues such as reward hacking. Explore next-generation reinforcement learning paradigms that enable more direct and efficient learning from environmental feedback.
Who We Look For
Bachelor's degree or above in Computer Science or related fields. Excellent research capabilities with publications in top conferences including ICML, NeurIPS, ICLR, CVPR, ICCV, ECCV, SIGGRAPH, etc. Strong engineering and programming skills, with experience in deep learning system implementation, model training and inference optimization, CPU/GPU acceleration, and distributed training and inference. Preference given to candidates with experience in diffusion models, autoregressive models, text-to-image / text-to-video generation. Preference given to candidates with participation experience in ACM/NOIP (Informatics Olympiad).
Equal Employment Opportunity at Tencent
As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.
Highlights
Safety Tips
Be careful: if it seems too good to be true, it most likely is.
1 / 10
More info about this ad

Multimodal Reinforcement Learning Algorithm Researcher has been posted in the Bishan Education & Training category on Locanto.

In this category, there are no other ads right now posted in Bishan.

Interested in more? Widen your search to view ads in nearby areas of Bishan. This includes Education & Training in Hougang, Orchard and Toa Payoh. There are more ads within a 15 km radius for this category. If you want to view those ads, click here.

Go to next ad