Published the "Transparency Report" 2018 edition showing the detailed breakdown of URLs that Google deleted due to "forgotten rights"



May 2014European Court of Justice, It is possible to request deletion of information about itself from search results of Google etc.Right to be forgottenThe ruling was approved to admit. Therefore, since we began accepting information deletion after the judgment, Google regularly renews a "transparency report" that summarizes the breakdown of URLs excluded from the search result index. The contents of the report summarizing the information up to February 2018 were updated.

Transparency Report
https://transparencyreport.google.com/eu-privacy/overview

Updating our "right to be forgotten" Transparency Report
https://blog.google/topics/google-europe/updating-our-right-be-forgotten-transparency-report/

The number of URLs excluded from search results by Google as of February 2018 is 6,448,76. The number of applications for exclusion was 2,433,727.


This graph shows the transition of cumulative number of exclusion applications and cumulative number of excluded URLs. The red line shows the number of exclusion requests, the number of blue lines excluded, you can see that the extreme change has not occurred, except immediately after the start of reception. Both exclusion requests and excluded URLs continue to increase on the right side, but we can also see that extreme changes have not occurred, except immediately after the start of reception.


Pie chart showing "excluded URL (red)" and "URL not excluded" (blue) among "URLs judged by Google". Since the URL which is excluded after the application is excluded is 43.3%, it seems that there are many URLs which have not been judged for reasons such as not meeting the application requirements etc.


88.7% applied for excluding "private person" (blue).


The breakdown of 11.3% not included in "private person" is about 40% for juveniles (blue), about 20% for companies (red) and government officials (yellow), 14% for private public figures (green) It is becoming.


About 50% of the site categories to which the excluded URL belonged was "other" (blue), which was the largest number. After that, it continues in the order of "directory site" (red), "news site" (yellow), "social media" (green), "other" (ash). The difference between the two "other" is not explained in the report.


When categorizing reasons for exclusion application, the most frequent was "information is insufficient" (blue · 24.7%). Below, "Occupation information" "Others" "No name exists" "My work" "Crime" continues.


Also, if you look at why there are many reasons for exclusion by category of website, for example, it seems that there were many applications for "occupational information" at "government site".


This is a graph that shows the number actually excluded from exclusion applications. On average, directory sites and social media exceed 50%.


For whatever reason, it is shown in the graph as to whether it is easy to be excluded, and almost 100% excluded is a red "name not found". Probably celebrity's nameSource codeIt can be inferred that an act of trying to earn the number of accesses is done by putting it inside. The second is "the information is insufficient" (yellow), the third is "highly confidential personal information" (black), excluding applications around this excluded with a probability close to 90%.


Specific cases are also disclosed about Google's response to the application, such as "application for exclusion on articles of cases reported in the past from defendants tried as trials in the case of domestic violence and were not guilty" It is.


Below is a list of URLs excluded by Google that are excluded by a high percentage of exclusion requestsDomain nameA list is shown. Here we are using Facebook and TwitterSNSYou can see that there are many domains.

in Note,   Web Service, Posted by darkhorse_log