← Back to context

Comment by gbear605

16 hours ago

I’m not sure why a model needs to be hosted in order to make network calls?

Is there a library of good tools for LLMs to call? I have to imagine the bot-detection avoidance mechanisms are a major engineering effort and not likely to work out of the box with a simple harness and random local LLM.

  • Even the hosted ones are blocked from searching certain sites, for example Claude is banned from searching Reddit:

    `Error: "The following domains are not accessible to our user agent: ['reddit.com']."`

  • If your volume is low enough, it should be pretty fine. It can just piggy back onto your personal browser cookies for Cloudflare.

  • Tavily, Exa, Firecrawl, Perplexity, and Linkup are all tools for agents to search the web.

    I’ve been building a harness the past few months and supports them all out of the box with an API key.

    • be warned though:

      firecrawl: "if you post content or intellectual property within the Services or give us Feedback about the Services, you hereby grant to us a worldwide, irrevocable, non-exclusive, royalty-free license to use, reproduce, modify, publish, translate and distribute any content that you submit in any form [...] You also grant to us the right to sub-license these rights"

      exa: "Query Data is used to improve our products and technology, including by training and fine-tuning models that power our Services"

      perplexity: "Perplexity may retain, copy, distribute and otherwise use Search Data for its lawful business purposes, including the improvement and development of products and services."

      linkup: "Client grants Linkup a worldwide right to use, reproduce and modify the Client Data, including prompts, for the purposes of providing, maintaining, developing, training"

      tavily: "we may use certain portions of your query data to improve our responses to future queries"..."We may share your query data with third-party search index providers (e.g., Google)"

    • Kagi also has an API. People who hate ads are probably the same folk that should be paying for Kagi. That's the sane alternative world where companies respect their users.

      1 reply →