Wikia Search is a new, wiki-inspired search-engine project that attempts to create a transparent set of ranking algorithms that fight spam and promote good stuff to the top. This is in contrast to Google, Yahoo, and other search engines, where the ranking algorithms are treated as trade secrets and high-risk tactics that have to be guarded from spammers.
The idea of a ranking algorithm is that it produces "good results" -- returns the best, most relevant results based on the user's search terms. We have a notion that the traditional search engine algorithm is "neutral" -- that it lacks an editorial bias and simply works to fulfill some mathematical destiny, embodying some Platonic ideal of "relevance." Compare this to an "inorganic" paid search result of the sort that Altavista used to sell.
But ranking algorithms are editorial: they embody the biases, hopes, beliefs and hypotheses of the programmers who write and design them. What's more, a tiny handful of search engines effectively control the prominence and viability of the majority of the information in the world.
And those search engines use secret ranking systems to systematically and secretly block enormous swaths of information on the grounds that it is spam, malware, or using deceptive "optimization" techniques. The list of block-ees is never published, nor are the criteria for blocking. This is done in the name of security, on the grounds that spammers and malware hackers are slowed down by the secrecy.
But "security through obscurity" is widely discredited in information security circles. Obscurity stops dumb attackers from getting through, but it lets the smart attackers clobber you because the smart defenders can't see how your system works and point out its flaws.
Seen in this light, it's positively bizarre: a few companies' secret editorial criteria are used to control what information we see, and those companies defend their secrecy in the name of security-through-obscurity? Yikes!
The Wikia Search project has assembled the basic technologies for a search engine, including a search application, search algorithm and Web crawler. The project will allow technology enthusiasts to help filter sites and rank search results, using a community model akin to that of Wikipedia.
The idea is to challenge the established players by offering a search service that is more transparent to end users, meaning they can see how search results are arrived at. Wales has described Yahoo and Google as opaque services that don't explain how results are arrived at.
(Disclosure: Jimmy Wales and I are writing a book together about a related subject)
Yahoo has released a machine-learning model called open_nsfw that is designed to distinguish not-safe-for-work images from worksafe ones. By tweaking the model and combining it with places-CNN, MIT’s scene-recognition model, Gabriel Goh created a bunch of machine-generated scenes that score high for both models — things that aren’t porn, but look porny.
Historically, being an elected prosecutor was a sweet gig: operating with “unchecked power and no transparency,” you generally got to run unopposed for re-election, and on the rare instances in which someone did dare to run against the incumbent, the incumbent usually won.
I dote on fidget gadgets — soothing gizmos intended to give your hands something to keep busy with, like modern worry-beads — and while you can’t buy Chris Bathgate’s amazing machined sliders, and the Fidget Cube Kickstarter just closed, there’s still Thinkgeek’s new Jumbo Noah Fidget Toy, which looks like a lot of fun and […]
This Python Mega Course will help you learn to code by teaching you to build 10 real-world apps that each highlight a unique use of Python.Job prospects for coders are still growing steadily—and with Python being one of the most popular coding languages out there today, it’s important for job seekers to demonstrate a widespread understanding of the […]
The Atmos R2 may be bigger than the brand’s previously-released vapes, but we argue that in this case it’s definitely a good thing. A bigger heating chamber means more room for packing it full. And the bigger battery means longer, more fulfilling vape sessions. In fact, you can use the Atmos R2 for up to about 25 […]
These days, there is huge demand for ethical hackers. Companies pay these professionals to identify and remedy security holes in their networks before malicious hackers find and exploit them. What’s great about this is that if you love hacking or think you may love hacking, you can do it for a living and not as […]