default search action
Wenxuan Wang 0001
Person information
- affiliation (PhD 2023): Chinese University of Hong Kong, Department of Computer Science and Engineering, Hong Kong
Other persons with the same name
- Wenxuan Wang — disambiguation page
- Wenxuan Wang 0002 — Chinese Academy of Sciences, Institute of Automation, Beijing, China (and 2 more)
- Wenxuan Wang 0003 — Fudan University, School of Computer Science, Shanghai Key Lab of Intelligent Information Processing, China
- Wenxuan Wang 0004 — Tsinghua University, School of Vehicle and Mobility, Beijing, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c21]Wenxuan Wang, Zhaopeng Tu, Chang Chen, Youliang Yuan, Jen-tse Huang, Wenxiang Jiao, Michael R. Lyu:
All Languages Matter: On the Multilingual Safety of LLMs. ACL (Findings) 2024: 5865-5877 - [c20]Wenxuan Wang, Wenxiang Jiao, Jingyuan Huang, Ruyi Dai, Jen-tse Huang, Zhaopeng Tu, Michael R. Lyu:
Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models. ACL (1) 2024: 6349-6384 - [c19]Youliang Yuan, Wenxuan Wang, Qingshuo Guo, Yiming Xiong, Chihao Shen, Pinjia He:
Does ChatGPT Know That It Does Not Know? Evaluating the Black-Box Calibration of ChatGPT. LREC/COLING 2024: 5191-5201 - [c18]Yuxuan Wan, Wenxuan Wang, Yiliu Yang, Youliang Yuan, Jen-tse Huang, Pinjia He, Wenxiang Jiao, Michael R. Lyu:
LogicAsker: Evaluating and Improving the Logical Reasoning Ability of Large Language Models. EMNLP 2024: 2124-2155 - [c17]Jen-tse Huang, Wenxiang Jiao, Man Ho Lam, Eric John Li, Wenxuan Wang, Michael R. Lyu:
On the Reliability of Psychological Scales on Large Language Models. EMNLP 2024: 6152-6173 - [c16]Jen-tse Huang, Wenxuan Wang, Eric John Li, Man Ho Lam, Shujie Ren, Youliang Yuan, Wenxiang Jiao, Zhaopeng Tu, Michael R. Lyu:
On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs. ICLR 2024 - [c15]Youliang Yuan, Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Pinjia He, Shuming Shi, Zhaopeng Tu:
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher. ICLR 2024 - [c14]Wenxuan Wang, Haonan Bai, Jen-tse Huang, Yuxuan Wan, Youliang Yuan, Haoyi Qiu, Nanyun Peng, Michael R. Lyu:
New Job, New Gender? Measuring the Social Bias in Image Generation Models. ACM Multimedia 2024: 3781-3789 - [i38]Yuxuan Wan, Wenxuan Wang, Yiliu Yang, Youliang Yuan, Jen-tse Huang, Pinjia He, Wenxiang Jiao, Michael R. Lyu:
A & B == B & A: Triggering Logical Reasoning Failures in Large Language Models. CoRR abs/2401.00757 (2024) - [i37]Wenxuan Wang, Juluan Shi, Zhaopeng Tu, Youliang Yuan, Jen-tse Huang, Wenxiang Jiao, Michael R. Lyu:
The Earth is Flat? Unveiling Factual Errors in Large Language Models. CoRR abs/2401.00761 (2024) - [i36]Wenxuan Wang, Haonan Bai, Jen-tse Huang, Yuxuan Wan, Youliang Yuan, Haoyi Qiu, Nanyun Peng, Michael R. Lyu:
New Job, New Gender? Measuring the Social Bias in Image Generation Models. CoRR abs/2401.00763 (2024) - [i35]Wenxuan Wang, Yihang Su, Jingyuan Huan, Jie Liu, Wenting Chen, Yudi Zhang, Cheng-Yi Li, Kao-Jung Chang, Xiaohan Xin, Linlin Shen, Michael R. Lyu:
Asclepius: A Spectrum Evaluation Benchmark for Medical Multi-Modal Large Language Models. CoRR abs/2402.11217 (2024) - [i34]Jen-tse Huang, Eric John Li, Man Ho Lam, Tian Liang, Wenxuan Wang, Youliang Yuan, Wenxiang Jiao, Xing Wang, Zhaopeng Tu, Michael R. Lyu:
How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments. CoRR abs/2403.11807 (2024) - [i33]Man Tik Ng, Hui Tung Tse, Jen-tse Huang, Jingjing Li, Wenxuan Wang, Michael R. Lyu:
How Well Can LLMs Echo Us? Evaluating AI Chatbots' Role-Play Ability with ECHO. CoRR abs/2404.13957 (2024) - [i32]Chaozheng Wang, Zongjie Li, Cuiyun Gao, Wenxuan Wang, Ting Peng, Hailiang Huang, Yuetang Deng, Shuai Wang, Michael R. Lyu:
Exploring Multi-Lingual Bias of Large Code Models in Code Generation. CoRR abs/2404.19368 (2024) - [i31]Yuxuan Wan, Chaozheng Wang, Yi Dong, Wenxuan Wang, Shuqing Li, Yintong Huo, Michael R. Lyu:
Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach. CoRR abs/2406.16386 (2024) - [i30]Youliang Yuan, Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Jiahao Xu, Tian Liang, Pinjia He, Zhaopeng Tu:
Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training. CoRR abs/2407.09121 (2024) - [i29]Jen-tse Huang, Jiaxu Zhou, Tailin Jin, Xuhui Zhou, Zixi Chen, Wenxuan Wang, Youliang Yuan, Maarten Sap, Michael R. Lyu:
On the Resilience of Multi-Agent Systems with Malicious Agents. CoRR abs/2408.00989 (2024) - [i28]Wenxuan Wang, Juluan Shi, Chaozheng Wang, Cheryl Lee, Youliang Yuan, Jen-tse Huang, Michael R. Lyu:
Learning to Ask: When LLMs Meet Unclear Instruction. CoRR abs/2409.00557 (2024) - [i27]Chaozheng Wang, Shuzheng Gao, Cuiyun Gao, Wenxuan Wang, Chun Yong Chong, Shan Gao, Michael R. Lyu:
A Systematic Evaluation of Large Code Models in API Suggestion: When, Which, and How. CoRR abs/2409.13178 (2024) - [i26]Wenxuan Wang, Kuiyi Gao, Zihan Jia, Youliang Yuan, Jen-tse Huang, Qiuzhi Liu, Shuai Wang, Wenxiang Jiao, Zhaopeng Tu:
Chain-of-Jailbreak Attack for Image Generation Models via Editing Step by Step. CoRR abs/2410.03869 (2024) - 2023
- [j1]Yun Peng, Shuqing Li, Wenwei Gu, Yichen Li, Wenxuan Wang, Cuiyun Gao, Michael R. Lyu:
Revisiting, Benchmarking and Exploring API Recommendation: How Far Are We? IEEE Trans. Software Eng. 49(4): 1876-1897 (2023) - [c13]Jianping Zhang, Jen-tse Huang, Wenxuan Wang, Yichen Li, Weibin Wu, Xiaosen Wang, Yuxin Su, Michael R. Lyu:
Improving the Transferability of Adversarial Samples by Path-Augmented Method. CVPR 2023: 8173-8182 - [c12]Wenxiang Jiao, Jen-tse Huang, Wenxuan Wang, Zhiwei He, Tian Liang, Xing Wang, Shuming Shi, Zhaopeng Tu:
ParroT: Translating during Chat using Large Language Models tuned with Human Translation and Feedback. EMNLP (Findings) 2023: 15009-15020 - [c11]Wenxuan Wang, Jen-tse Huang, Weibin Wu, Jianping Zhang, Yizhan Huang, Shuqing Li, Pinjia He, Michael R. Lyu:
MTTM: Metamorphic Testing for Textual Content Moderation Software. ICSE 2023: 2387-2399 - [c10]Wenxuan Wang, Jingyuan Huang, Chang Chen, Jiazhen Gu, Jianping Zhang, Weibin Wu, Pinjia He, Michael R. Lyu:
Validating Multimedia Content Moderation Software via Semantic Fusion. ISSTA 2023: 576-588 - [c9]Shuzheng Gao, Xin-Cheng Wen, Cuiyun Gao, Wenxuan Wang, Hongyu Zhang, Michael R. Lyu:
What Makes Good In-Context Demonstrations for Code Intelligence Tasks with LLMs? ASE 2023: 761-773 - [c8]Yun Peng, Chaozheng Wang, Wenxuan Wang, Cuiyun Gao, Michael R. Lyu:
Generative Type Inference for Python. ASE 2023: 988-999 - [c7]Wenxuan Wang, Jingyuan Huang, Jen-tse Huang, Chang Chen, Jiazhen Gu, Pinjia He, Michael R. Lyu:
An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software. ASE 2023: 1339-1351 - [c6]Yuxuan Wan, Wenxuan Wang, Pinjia He, Jiazhen Gu, Haonan Bai, Michael R. Lyu:
BiasAsker: Measuring the Bias in Conversational AI System. ESEC/SIGSOFT FSE 2023: 515-527 - [d2]Yun Peng, Shuqing Li, Wenwei Gu, Yichen Li, Wenxuan Wang, Cuiyun Gao, Michael R. Lyu:
APIBench: A Benchmark Dataset for Evaluating API Recommendation Approaches in Python and Java. Version 2. Zenodo, 2023 [all versions] - [i25]Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Xing Wang, Zhaopeng Tu:
Is ChatGPT A Good Translator? A Preliminary Study. CoRR abs/2301.08745 (2023) - [i24]Wenxuan Wang, Jen-tse Huang, Weibin Wu, Jianping Zhang, Yizhan Huang, Shuqing Li, Pinjia He, Michael R. Lyu:
MTTM: Metamorphic Testing for Textual Content Moderation Software. CoRR abs/2302.05706 (2023) - [i23]Haoran Wu, Wenxuan Wang, Yuxuan Wan, Wenxiang Jiao, Michael R. Lyu:
ChatGPT or Grammarly? Evaluating ChatGPT on Grammatical Error Correction Benchmark. CoRR abs/2303.13648 (2023) - [i22]Jianping Zhang, Jen-tse Huang, Wenxuan Wang, Yichen Li, Weibin Wu, Xiaosen Wang, Yuxin Su, Michael R. Lyu:
Improving the Transferability of Adversarial Samples by Path-Augmented Method. CoRR abs/2303.15735 (2023) - [i21]Wenxiang Jiao, Jen-tse Huang, Wenxuan Wang, Xing Wang, Shuming Shi, Zhaopeng Tu:
ParroT: Translating During Chat Using Large Language Models. CoRR abs/2304.02426 (2023) - [i20]Shuzheng Gao, Xin-Cheng Wen, Cuiyun Gao, Wenxuan Wang, Michael R. Lyu:
Constructing Effective In-Context Demonstration for Code Intelligence Tasks: An Empirical Study. CoRR abs/2304.07575 (2023) - [i19]Yuxuan Wan, Wenxuan Wang, Pinjia He, Jiazhen Gu, Haonan Bai, Michael R. Lyu:
BiasAsker: Measuring the Bias in Conversational AI System. CoRR abs/2305.12434 (2023) - [i18]Wenxuan Wang, Jingyuan Huang, Chang Chen, Jiazhen Gu, Jianping Zhang, Weibin Wu, Pinjia He, Michael R. Lyu:
Validating Multimedia Content Moderation Software via Semantic Fusion. CoRR abs/2305.13623 (2023) - [i17]Jen-tse Huang, Wenxuan Wang, Man Ho Lam, Eric John Li, Wenxiang Jiao, Michael R. Lyu:
ChatGPT an ENFJ, Bard an ISTJ: Empirical Study on Personalities of Large Language Models. CoRR abs/2305.19926 (2023) - [i16]Yun Peng, Chaozheng Wang, Wenxuan Wang, Cuiyun Gao, Michael R. Lyu:
Generative Type Inference for Python. CoRR abs/2307.09163 (2023) - [i15]Jen-tse Huang, Man Ho Lam, Eric John Li, Shujie Ren, Wenxuan Wang, Wenxiang Jiao, Zhaopeng Tu, Michael R. Lyu:
Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench. CoRR abs/2308.03656 (2023) - [i14]Youliang Yuan, Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Pinjia He, Shuming Shi, Zhaopeng Tu:
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher. CoRR abs/2308.06463 (2023) - [i13]Wenxuan Wang, Jingyuan Huang, Jen-tse Huang, Chang Chen, Jiazhen Gu, Pinjia He, Michael R. Lyu:
An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software. CoRR abs/2308.09810 (2023) - [i12]Wenxuan Wang, Zhaopeng Tu, Chang Chen, Youliang Yuan, Jen-tse Huang, Wenxiang Jiao, Michael R. Lyu:
All Languages Matter: On the Multilingual Safety of Large Language Models. CoRR abs/2310.00905 (2023) - [i11]Jen-tse Huang, Wenxuan Wang, Eric John Li, Man Ho Lam, Shujie Ren, Youliang Yuan, Wenxiang Jiao, Zhaopeng Tu, Michael R. Lyu:
Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench. CoRR abs/2310.01386 (2023) - [i10]Wenxuan Wang, Wenxiang Jiao, Jingyuan Huang, Ruyi Dai, Jen-tse Huang, Zhaopeng Tu, Michael R. Lyu:
Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models. CoRR abs/2310.12481 (2023) - [i9]Tian Liang, Zhiwei He, Jen-tse Huang, Wenxuan Wang, Wenxiang Jiao, Rui Wang, Yujiu Yang, Zhaopeng Tu, Shuming Shi, Xing Wang:
Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models. CoRR abs/2310.20499 (2023) - 2022
- [c5]Wenxuan Wang, Wenxiang Jiao, Yongchang Hao, Xing Wang, Shuming Shi, Zhaopeng Tu, Michael R. Lyu:
Understanding and Improving Sequence-to-Sequence Pretraining for Neural Machine Translation. ACL (1) 2022: 2591-2600 - [c4]Jianping Zhang, Weibin Wu, Jen-tse Huang, Yizhan Huang, Wenxuan Wang, Yuxin Su, Michael R. Lyu:
Improving Adversarial Transferability via Neuron Attribution-based Attacks. CVPR 2022: 14973-14982 - [c3]Jen-tse Huang, Jianping Zhang, Wenxuan Wang, Pinjia He, Yuxin Su, Michael R. Lyu:
AEON: a method for automatic evaluation of NLP test cases. ISSTA 2022: 202-214 - [c2]Wenxiang Jiao, Zhaopeng Tu, Jiarui Li, Wenxuan Wang, Jen-tse Huang, Shuming Shi:
Tencent's Multilingual Machine Translation System for WMT22 Large-Scale African Languages. WMT 2022: 1049-1056 - [i8]Wenxuan Wang, Wenxiang Jiao, Yongchang Hao, Xing Wang, Shuming Shi, Zhaopeng Tu, Michael R. Lyu:
Understanding and Improving Sequence-to-Sequence Pretraining for Neural Machine Translation. CoRR abs/2203.08442 (2022) - [i7]Jianping Zhang, Weibin Wu, Jen-tse Huang, Yizhan Huang, Wenxuan Wang, Yuxin Su, Michael R. Lyu:
Improving Adversarial Transferability via Neuron Attribution-Based Attacks. CoRR abs/2204.00008 (2022) - [i6]Jen-tse Huang, Jianping Zhang, Wenxuan Wang, Pinjia He, Yuxin Su, Michael R. Lyu:
AEON: A Method for Automatic Evaluation of NLP Test Cases. CoRR abs/2205.06439 (2022) - [i5]Wenxuan Wang, Wenxiang Jiao, Shuo Wang, Zhaopeng Tu, Michael R. Lyu:
Understanding and Mitigating the Uncertainty in Zero-Shot Translation. CoRR abs/2205.10068 (2022) - [i4]Wenxiang Jiao, Zhaopeng Tu, Jiarui Li, Wenxuan Wang, Jen-tse Huang, Shuming Shi:
Tencent's Multilingual Machine Translation System for WMT22 Large-Scale African Languages. CoRR abs/2210.09644 (2022) - 2021
- [d1]Yun Peng, Shuqing Li, Wenwei Gu, Yichen Li, Wenxuan Wang, Cuiyun Gao, Michael R. Lyu:
APIBench: A Benchmark Dataset for Evaluating API Recommendation Approaches in Python and Java. Version v1.0. Zenodo, 2021 [all versions] - [i3]Shuo Wang, Zhaopeng Tu, Zhixing Tan, Wenxuan Wang, Maosong Sun, Yang Liu:
Language Models are Good Translators. CoRR abs/2106.13627 (2021) - [i2]Yun Peng, Shuqing Li, Wenwei Gu, Yichen Li, Wenxuan Wang, Cuiyun Gao, Michael R. Lyu:
Revisiting, Benchmarking and Exploring API Recommendation: How Far Are We? CoRR abs/2112.12653 (2021) - 2020
- [c1]Wenxuan Wang, Zhaopeng Tu:
Rethinking the Value of Transformer Components. COLING 2020: 6019-6029 - [i1]Wenxuan Wang, Zhaopeng Tu:
Rethinking the Value of Transformer Components. CoRR abs/2011.03803 (2020)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-15 19:33 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint