📕 subnode [[@maya/altsearch]] in 📚 node [[altsearch]]

I'm the daughter of a degree-holding librarian daughter of a degree-holding librarian. Library and information science is one of the coolest fields techie folks could benefit from scratching into. Since I'm into altweb-type projects, let's talk altsearch!

examples

dato.rss

It's a search engine for content that's in an RSS feed1 I am not honestly sure exactly what the point is of this but I love indie search, I love RSS, I am gonna mess around with it until I find stuff to use it for. Weirdly, not a lot of personal sites in the index? Seems like it'd be perfect for that. (also not sure what the "positive" and "negative" tags in the results are)

daniel's network

  • holy shit what a VIBE
  • pulls a site's background image to display behind its results
  • currently very neocities-centric
    • this is good; neocities is often quite neglected by other projects like this
  • still checking it out to see quality of results, but the energy here is so good I'm willing to give it all the time in the world to iterate it was shut down :(

wiby.me / wiby.org

  • refuses pages with "intrusive" ads
  • "Pages must be simple in design"
  • "Pages should not use much scripts/css for cosmetic effect."
  • I'm so ideologically opposed to this I don't use it

searchmysite

  • self hostable!
  • provides search as an API to indexed sites
  • promotes ad-free pages
  • domain: "independent" and "personal"
  • I wish I could figure out how to kick back a few bucks to this -- the content is still kinda sparse, but I really like it both ideologically and in practice

ninfex

  • has a link-aggregator-like upvoting mechanism
  • newest page seems real tech-heavy
  • I haven't used it much, still need to try it out more

lieu

  • indexes2 the semi-closed corpus of sites in the merveilles3 webring
  • search as a tool to facilitate cross-pollination of ideas within a community
    • feels like it is most useful for keeping up with the times by rooting around in all the members' stuff that doesn't show up on an RSS feed
  • mostly, the sites I already follow popped up when I searched things, but it seems like a nice way for someone to get the vibe of the scene

"edge" (search.marginalia.nu)

  • "If you are looking for fact, this is almost certainly the wrong tool. If you are looking for serendipity, you're on the right track. When was the last time you just stumbled onto something interesting, by the way?"
    • it does well at this
  • favors old content: "The hypothesis is something akin to the Lindy-effect: If a webpage has been around for a long time, then odds are it has fundamental redeeming quality that has motivated keeping it around all for that time."
    • this is strictly delusional in a world where there is an ongoing cost to maintain URLs' presence (domain fees) and host content
  • "The more markup it has in relation to its text, the lower it will score. Each script tag is punished."
    • this includes such "evils" as markup for accessibility and alt text
  • I don't agree ideologically at all with the way this is done, but it does succeed at surfacing good old content

million short

  • lets you specify what threshold of popularity (by rank) to exclude
  • I haven't submitted my site because they want a phone number. hell no.
  • good results across a wide variety of queries

teclis

  • I haven't exactly figured out what the sweet spot is for this one. It wants submissions of particular articles, not sites, and it seeded its index with stuff like hacker news and reddit, so it isn't exactly my scene, but let me know if you have good results with it.

ideas on which I have no intention of acting

  • I want lazy-loaded screenshot previews of search results by their links. This incentivises the world to fight against gwalb-powered content collapse
    • maybe reward results where the searched-for content lands above-the-fold?
  • two tiers of results, one powered by submissions, one powered by a million-short-like approach?
  • my web ethic (obviously) is that expressive/artistic CSS and JS is great, with the accessbility and bandwidth caveat that, wherever possible, stuff should Just Work with CSS and JS off.
    • is there a way to detect that kind of "Just Works" to reward/penalize based on it? sounds real hard, but also like there might be good heuristics
  1. synecdoche that includes Atom

  2. It excludes some, so it's a subset.

  3. Merveilles is sort of a school of art/tech and sort of an online community and sort of a Mastodon instance.

📖 stoas
⥱ context