my

Rui Wang

Sr. AI Algorithm Engineer

M.Eng. Arizona State University

About Me

Ever since I wrote my first program on a 386 computer using the LOGO language in elementary school, I have been fascinated by the field of computers. Driven by my passion and enthusiasm, I have dedicated my studies and career to this industry, witnessing every technological revolution along the way. Today, as humanity steps into the age of AI and technology reaches unprecedented heights, I hope to contribute my efforts and work alongside you to ignite a transformative era and create a vibrant chapter of hope.

I hold a Master of Engineering degree from Arizona State University and am an experienced Senior Large Model Engineer specializing in the development and application of large language models, including RAG applications, chatbots, and knowledge base systems. I am passionate about new technologies in the AI field and dedicated to applying cutting-edge techniques to solve real-world problems.

Key Expertise Areas

Large Model Development

Document Retrieval

Recommendation Systems

Big Data Processing

Python Programming

MLOps

Skills

Large Model Development

Proficient with the LangChain framework for large model development, including local Q&A systems, chatbots, knowledge bases, and other common RAG applications. Mastered techniques such as Chain processing, Agent intelligent agents, Function Calling/Tool Use.

Document Retrieval & NLP

Proficient in common NLP tasks such as document search, named entity recognition (NER), and sentiment analysis. Skilled in using search engines like Solr and ElasticSearch, with an understanding of the underlying Lucene search framework.

Recommendation Systems

Experienced in developing applications based on recommender systems using common algorithms. Knowledgeable in leveraging large models within recommender systems.

Big Data Processing

Proficient in data processing frameworks such as Numpy and Pandas. Experienced with the Spark distributed framework for data cleaning, extraction, analysis, and cluster tuning.

Programming & Frameworks

Proficient in Python programming and frameworks like FastAPI and Flask. Knowledgeable in databases such as MySQL, ClickHouse, and Redis. Experienced with the Linux operating system and Docker containers.

Model Deployment & MLOps

Experienced in deploying private reasoning services for common open-source models and in distributed model training and inference using vLLM, FastChat, and DeepSpeed. Experienced in MLOps workflows, including using TensorRT-LLM for large model compilation and deploying related services on Triton.

Data Visualization

Skilled in creating insightful data visualizations and dashboards using tools like Matplotlib, Seaborn, and Tableau to communicate complex data effectively.

Cloud Computing

Experience with major cloud platforms (AWS, Azure, GCP) for deploying and scaling AI applications, including serverless functions and container orchestration.

AI Ethics & Responsible AI

Understanding of AI ethics principles, fairness, accountability, and transparency in AI systems. Committed to developing responsible AI solutions.

Work Experience

Sr. AI Algorithm Engineer

Shanghai University AI Medical Joint Innovation Research and Development Center (Boai China Enterprise Group)

Jan 2024 - Present

  • Developed RAG-related Q&A systems and local knowledge base systems.
  • Responsible for data cleaning, fine-tuning, and training of large models.
  • Researched and applied the latest technologies in the AI field.
  • Conducted local deployment, evaluation, testing, and report generation for open-source large models.
  • Researched and deployed distributed inference service architectures for large models.

Sr. Search Algorithm Engineer

LexisNexis

Jun 2017 - Dec 2023

  • Conducted development work related to search engine business.
  • Trained search reordering models using machine learning (LTR) and optimized search results.
  • Developed NLP applications, including tokenizers, NER, and document recommendation.
  • Developed data visualization systems and scoring systems.
  • Tuned distributed systems such as Solr search engine clusters and Spark clusters.

Sr. Big Data Engineer

Wealink

Apr 2016 - May 2017

  • NLP analysis and processing of resume data.
  • Debugging data crawlers and designing high-concurrency website architectures.
  • Big Data analysis using Spark, report generation, and visualization.
  • Developed business logic for resume search using ElasticSearch.

Technology Partner

Hanyou Network

May 2015 - Apr 2016

  • Co-founded a mobile game platform project.
  • Led the technical department, responsible for R&D of product technical architecture.
  • Managed the technical team and overcame technical challenges.

Sr. Python Engineer

Kingnet

Mar 2012 - May 2015

  • Developed systems for domestic and international games.
  • Debugged and integrated game back-end interfaces.
  • Worked on the development of game website frontends.
  • Developed and analyzed game log collection systems.

Projects

Large Model Q&A System

Developed RAG Q&A systems and knowledge bases based on open-source large models, fine-tuned for specific industries.

Langchain RAG SFT

Search Ranking Optimization

Utilized machine learning (Learning to Rank - LTR) methods to train models and Rerank methods to optimize search results, enhancing search experience.

LTR Ranklib AB Testing

Legal Entity Identification

Performed NER identification for legal entities such as case numbers and company names, trained models, deployed recognition services, and compared results with large model recognition.

NER BERT GPT

User Behavior Analysis System

Modeled after ELK architecture, cleaned and processed front-end data into ElasticSearch and ClickHouse. Capable of handling user log data at the 10 billion record level.

ELK Spark ClickHouse

Hyperlink Cross-Referencing System

A system that handles cross-referencing of articles and builds links. Analyzes citation relationships among billions of documents and provides relevant recommended articles using NLP.

NLP Spark Big Data

AI-Powered Content Generation

Developed a tool leveraging generative AI models to assist in creating various types of content, including articles, summaries, and creative text.

Generative AI LLM FastAPI

GitHub Contributions

Below is a snapshot of my GitHub activity.

GitHub Contributions Graph

Contact

If you are interested in my work or have any collaboration inquiries, please feel free to contact me through the following channels.

You can also find me on the following platforms: