The Information Engineering Lab IELab @ UQ | The University of Queensland

2024

Does Vec2Text Pose a New Corpus Poisoning Threat?
Shengyao Zhuang, Bevan Koopman, Guido Zuccon
arXiv  ·  10 Oct 2024  ·  arXiv:2410.06628
Embark on DenseQuest: A System for Selecting the Best Dense Retriever for a Custom Collection
Ekaterina Khramtsova, Teerapong Leelanupab, Shengyao Zhuang, Mahsa Baktashmotlagh, Guido Zuccon
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval  ·  10 Jul 2024  ·  10.1145/3626772.3657674
FeB4RAG: Evaluating Federated Search in the Context of Retrieval Augmented Generation
Shuai Wang, Ekaterina Khramtsova, Shengyao Zhuang, Guido Zuccon
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval  ·  10 Jul 2024  ·  10.1145/3626772.3657853
Dense Retrieval with Continuous Explicit Feedback for Systematic Review Screening Prioritisation
Xinyu Mao, Shengyao Zhuang, Bevan Koopman, Guido Zuccon
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval  ·  10 Jul 2024  ·  10.1145/3626772.3657921
Large Language Models Based Stemming for Information Retrieval: Promises, Pitfalls and Failures
Shuai Wang, Shengyao Zhuang, Guido Zuccon
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval  ·  10 Jul 2024  ·  10.1145/3626772.3657949
A Setwise Approach for Effective and Highly Efficient Zero-shot Ranking with Large Language Models
Shengyao Zhuang, Honglei Zhuang, Bevan Koopman, Guido Zuccon
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval  ·  10 Jul 2024  ·  10.1145/3626772.3657813
Leveraging LLMs for Unsupervised Dense Retriever Ranking
Ekaterina Khramtsova, Shengyao Zhuang, Mahsa Baktashmotlagh, Guido Zuccon
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval  ·  10 Jul 2024  ·  10.1145/3626772.3657798
Revisiting Document Expansion and Filtering for Effective First-Stage Retrieval
Watheq Mansour, Shengyao Zhuang, Guido Zuccon, Joel Mackenzie
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval  ·  10 Jul 2024  ·  10.1145/3626772.3657850
The Impact of Auxiliary Patient Data on Automated Chest X-Ray Report Generation and How to Incorporate It
Aaron Nicolson, Shengyao Zhuang, Jason Dowling, Bevan Koopman
arXiv  ·  21 Jun 2024  ·  arXiv:2406.13181
An Investigation of Prompt Variations for Zero-shot LLM-based Rankers
Shuoqi Sun, Shengyao Zhuang, Shuai Wang, Guido Zuccon
arXiv  ·  21 Jun 2024  ·  arXiv:2406.14117
A Systematic Investigation of Distilling Large Language Models into Cross-Encoders for Passage Re-ranking
Ferdinand Schlatt, Maik Fröbe, Harrisen Scells, Shengyao Zhuang, Bevan Koopman, Guido Zuccon, Benno Stein, Martin Potthast, Matthias Hagen
arXiv  ·  18 Jun 2024  ·  arXiv:2405.07920
Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders
Ferdinand Schlatt, Maik Fröbe, Harrisen Scells, Shengyao Zhuang, Bevan Koopman, Guido Zuccon, Benno Stein, Martin Potthast, Matthias Hagen
arXiv  ·  18 Jun 2024  ·  arXiv:2404.06912
PromptReps: Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval
Shengyao Zhuang, Xueguang Ma, Bevan Koopman, Jimmy Lin, Guido Zuccon
arXiv  ·  18 Jun 2024  ·  arXiv:2404.18424
Large Language Models for Stemming: Promises, Pitfalls and Failures
Shuai Wang, Shengyao Zhuang, Guido Zuccon
arXiv  ·  20 Feb 2024  ·  arXiv:2402.11757
ReSLLM: Large Language Models are Strong Resource Selectors for Federated Search
Shuai Wang, Shengyao Zhuang, Bevan Koopman, Guido Zuccon
arXiv  ·  01 Feb 2024  ·  arXiv:2401.17645
Zero-Shot Generative Large Language Models for Systematic Review Screening Automation
Shuai Wang, Harrisen Scells, Shengyao Zhuang, Martin Potthast, Bevan Koopman, Guido Zuccon
Lecture Notes in Computer Science  ·  01 Jan 2024  ·  10.1007/978-3-031-56027-9_25

2023

Selecting which Dense Retriever to use for Zero-Shot Search
Ekaterina Khramtsova, Shengyao Zhuang, Mahsa Baktashmotlagh, Xi Wang, Guido Zuccon
Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region  ·  26 Nov 2023  ·  10.1145/3624918.3625330
Typos-aware Bottlenecked Pre-Training for Robust Dense Retrieval
Shengyao Zhuang, Linjun Shou, Jian Pei, Ming Gong, Houxing Ren, Guido Zuccon, Daxin Jiang
Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region  ·  26 Nov 2023  ·  10.1145/3624918.3625324
Exploring the Representation Power of SPLADE Models
Joel Mackenzie, Shengyao Zhuang, Guido Zuccon
Proceedings of the 2023 ACM SIGIR International Conference on Theory of Information Retrieval  ·  09 Aug 2023  ·  10.1145/3578337.3605129
Beyond CO2 Emissions: The Overlooked Impact of Water Consumption of Information Retrieval Models
Guido Zuccon, Harrisen Scells, Shengyao Zhuang
Proceedings of the 2023 ACM SIGIR International Conference on Theory of Information Retrieval  ·  09 Aug 2023  ·  10.1145/3578337.3605121
Augmenting Passage Representations with Query Generation for Enhanced Cross-Lingual Dense Retrieval
Shengyao Zhuang, Linjun Shou, Guido Zuccon
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval  ·  18 Jul 2023  ·  10.1145/3539618.3591952
Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation
Shengyao Zhuang, Houxing Ren, Linjun Shou, Jian Pei, Ming Gong, Guido Zuccon, Daxin Jiang
arXiv  ·  10 Jul 2023  ·  arXiv:2206.10128
AgAsk: an agent to help answer farmer’s questions from scientific documents
Bevan Koopman, Ahmed Mourad, Hang Li, Anton van der Vegt, Shengyao Zhuang, Simon Gibson, Yash Dang, David Lawrence, Guido Zuccon
International Journal on Digital Libraries  ·  19 Jun 2023  ·  10.1007/s00799-023-00369-y
Pseudo Relevance Feedback with Deep Language Models and Dense Retrievers: Successes and Pitfalls
Hang Li, Ahmed Mourad, Shengyao Zhuang, Bevan Koopman, Guido Zuccon
ACM Transactions on Information Systems  ·  10 Apr 2023  ·  10.1145/3570724
Open-source Large Language Models are Strong Zero-shot Query Likelihood Models for Document Ranking
Shengyao Zhuang, Bing Liu, Bevan Koopman, Guido Zuccon
Findings of the Association for Computational Linguistics: EMNLP 2023  ·  01 Jan 2023  ·  10.18653/v1/2023.findings-emnlp.590

2022

Robustness of Neural Rankers to Typos: A Comparative Study
Shengyao Zhuang, Xinyu Mao, Guido Zuccon
Proceedings of the 26th Australasian Document Computing Symposium  ·  15 Dec 2022  ·  10.1145/3572960.3572981
Pseudo-Relevance Feedback with Dense Retrievers in Pyserini
Hang Li, Shengyao Zhuang, Xueguang Ma, Jimmy Lin, Guido Zuccon
Proceedings of the 26th Australasian Document Computing Symposium  ·  15 Dec 2022  ·  10.1145/3572960.3572982
Reinforcement online learning to rank with unbiased reward shaping
Shengyao Zhuang, Zhihao Qiao, Guido Zuccon
Information Retrieval Journal  ·  04 Aug 2022  ·  10.1007/s10791-022-09413-y
Reduce, Reuse, Recycle
Harrisen Scells, Shengyao Zhuang, Guido Zuccon
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval  ·  06 Jul 2022  ·  10.1145/3477495.3531766
Implicit Feedback for Dense Passage Retrieval
Shengyao Zhuang, Hang Li, Guido Zuccon
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval  ·  06 Jul 2022  ·  10.1145/3477495.3531994
Asyncval
Shengyao Zhuang, Guido Zuccon
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval  ·  06 Jul 2022  ·  10.1145/3477495.3531658
To Interpolate or not to Interpolate
Hang Li, Shuai Wang, Shengyao Zhuang, Ahmed Mourad, Xueguang Ma, Jimmy Lin, Guido Zuccon
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval  ·  06 Jul 2022  ·  10.1145/3477495.3531884
CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos
Shengyao Zhuang, Guido Zuccon
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval  ·  06 Jul 2022  ·  10.1145/3477495.3531951
Improving Query Representations for Dense Retrieval with Pseudo Relevance Feedback: A Reproducibility Study
Hang Li, Shengyao Zhuang, Ahmed Mourad, Xueguang Ma, Jimmy Lin, Guido Zuccon
Advances in Information Retrieval  ·  01 Jan 2022  ·  https://doi.org/10.1007/978-3-030-99736-6_40

2021

Fast Passage Re-ranking with Contextualized Exact Term Matching and Efficient Passage Expansion
Shengyao Zhuang, Guido Zuccon
arXiv  ·  14 Sep 2021  ·  arXiv:2108.08513
BERT-based Dense Retrievers Require Interpolation with BM25 for Effective Passage Retrieval
Shuai Wang, Shengyao Zhuang, Guido Zuccon
Proceedings of the 2021 ACM SIGIR International Conference on Theory of Information Retrieval  ·  11 Jul 2021  ·  10.1145/3471158.3472233
TILDE: Term Independent Likelihood moDEl for Passage Re-ranking
Shengyao Zhuang, Guido Zuccon
Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval  ·  11 Jul 2021  ·  10.1145/3404835.3462922
How do Online Learning to Rank Methods Adapt to Changes of Intent?
Shengyao Zhuang, Guido Zuccon
Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval  ·  11 Jul 2021  ·  10.1145/3404835.3462937
Effective and Privacy-preserving Federated Online Learning to Rank
Shuyi Wang, Bing Liu, Shengyao Zhuang, Guido Zuccon
Proceedings of the 2021 ACM SIGIR International Conference on Theory of Information Retrieval  ·  11 Jul 2021  ·  10.1145/3471158.3472236
Deep Query Likelihood Model for Information Retrieval
Shengyao Zhuang, Hang Li, Guido Zuccon
Advances in Information Retrieval  ·  01 Jan 2021  ·  https://doi.org/10.1007/978-3-030-72240-1_49
Federated Online Learning to Rank with Evolution Strategies: A Reproducibility Study
Shuyi Wang, Shengyao Zhuang, Guido Zuccon
Lecture Notes in Computer Science  ·  01 Jan 2021  ·  10.1007/978-3-030-72240-1_10
Dealing with Typos for BERT-based Passage Retrieval and Ranking
Shengyao Zhuang, Guido Zuccon
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing  ·  01 Jan 2021  ·  10.18653/v1/2021.emnlp-main.225

2020

Counterfactual Online Learning to Rank
Shengyao Zhuang, Guido Zuccon
Lecture Notes in Computer Science  ·  01 Jan 2020  ·  10.1007/978-3-030-45439-5_28

Search for Shengyao Zhuang (Arvin)'s papers on the Research page