Better search in mainland China

5/31/12 | 9:00:00 AM


Over the past couple years, we’ve had a lot of feedback that Google Search from mainland China can be inconsistent and unreliable. It depends on the search query and browser, but users are regularly getting error messages like “This webpage is not available” or “The connection was reset.” And when that happens, people typically cannot use Google again for a minute or more. This video shows what’s happening:

We’ve taken a long, hard look at our systems and have not found any problems. However, after digging into user reports, we’ve noticed that these interruptions are closely correlated with searches for a particular subset of queries.

So starting today we’ll notify users in mainland China when they enter a keyword that may cause connection issues. By prompting people to revise their queries, we hope to reduce these disruptions and improve our user experience from mainland China. Of course, if users want to press ahead with their original queries they can carry on.

In order to figure out which keywords are causing problems, a team of engineers in the U.S. reviewed the 350,000 most popular search queries in China. In their research, they looked at multiple signals to identify the disruptive queries, and from there they identified specific terms at the root of the issue.

We’ve observed that many of the terms triggering error messages are simple everyday Chinese characters, which can have different meanings in different contexts. For example a search for the single character [] (Jiāng, a common surname that also means “river”) causes a problem on its own, but is also part of other common searches like [丽] (Lijiang, the name of a city in Yunnan Province), [锦之星] (the Jinjiang Star hotel chain), and [苏移动] (Jiangsu Mobile, a mobile phone service). Likewise, searching for [] (Zhōu, another common surname that also means “week”) triggers an error message, so including this character in other searches—like [杰伦] (Jay Chou, the Taiwanese pop star), [星驰] (Stephen Chow, a popular comedian from Hong Kong), or any publication that includes the word “week”—would also be problematic.

Now, when a user types in a common term like [长] (Yangtze River) from China, Google highlights the problem term [] as they type, and when they press “enter” a drop-down menu appears beneath the search box:

Notices will appear matching the user’s language settings.

To learn more, users can click on the “interruption” link, which takes them to this help center article. They can continue with their original query (which will likely lead to an error message), or click “Edit search terms,” which will remove the highlighted characters and prompt users to try other search terms:

In order to avoid connection problems, users can refine their searches without the problem keywords. For example, instead of searching for [长], they could search for [changjiang]—which also means Yangtze River, but is written using pinyin, the system used to transliterate Chinese characters into Latin script. This won’t cause a timeout, but will still generate search results related to the Yangtze River.

We’ve said before that we want as many people in the world as possible to have access to our services. Our hope is that these written notifications will help improve the search experience in mainland China. If you’re outside China and are curious to see what the notifications look like, you can visit this link to try it out.

Note: To read this blog post in Chinese, see this PDF.