Shape training data for large-scale language models focused on scientific discovery and software development
Challenge advanced language models on software engineering topics in Chinese
Converse with models on technical scenarios and verify logical accuracy and coding fluency
Assess naturalness and correctness of Chinese language usage
Capture reproducible error traces and document failure modes
Suggest improvements to prompt engineering and evaluation metrics
Requirements
Fluent in Chinese
Expertise in algorithms, data structures, software architecture, frontend and backend development, cloud infrastructure, and systems programming
Knowledge of asynchronous programming, RESTful API integration, memory management, object-oriented design, secure coding practices, and debugging distributed systems
Ability to document failure modes and capture reproducible error traces
Experience with technical writing in Chinese or open-source contributions preferred
Clear, metacognitive communication and ability to “show your work”
Bachelor's, Master's, or PhD in computer science, software engineering, or closely related field preferred; real-world coding experience signals fit