karlis-repsons wrote:As I saw in my yacy page, there are only 50 senior peers, is that right? I think is very little, have you been thinking about what holds people from participating more actively? Maybe summarised results?
Noob here.
When I started to run yacy today, I wasn't expecting to see a peer count as low as 89. But rather unexpectedly, the 'freeworld' still works, albeit with none to very few results.
Within the recent climate -- where search engine giants aren't considered all benevolent entities anymore -- I got the impression that freeworld has a chance. Assuming it scales well.
Does it?- Even if it's java, there must be a lot of admins who would love to run yacy on their servers. I'll try to persuade a few. But current penetration is too low: they just haven't heard of yacy, or haven't thought of a reason to run it. However, there are lots of reasons!

- I haven't even downloaded the windows version, so this could be totally irrelevant: Yacy could be stripped down to something like the seti@home screen-saver. That would be a good marketing perspective to attract curious altruistics. Disk space is very cheap nowadays. Port forwarding could be optional. I don't think proxy cache is an issue, people would like to use yacy without it, or take the risk instead, which is little.
- I could be over-emphasizing this but yacy -- with a little bit larger freeworld -- is a viable option as open web search API. Developers wouldn't need to worry about a particular company's marketing strategies (like when Google discontinued their SOAP search API) and limitations (number of searches per day, limited api calls, etc).
(I guess this is a good point to mention the LACK OF DOCUMENTATION though.)These were the first things that came to my mind when I read your post. Surely none would be new ideas...
EDIT: Another thing tangentially related to the C++ talk: (And yes, what's up with these DHT people and their Javaphilia?) The most important issue about such a network seems to be persistency of information. I guess if data doesn't fall off the network that easily (i.e. more redundancy), freeworld would be much more usable. It should be fairly easy to implement a server with bare storage and DHT functionality (along with the search interface), with weak configurability, from scratch. Without crawling, memory footprint and network activity would be miniscule, and installation pretty easy. OTOH, while a standalone cache is possible, a standalone crawler isn't, so this might introduce additional development overhead. All in all, I think freeworld needs much more redundant storage than crawlers, as crawling is useless if the data falls off. Is this worth discussing?
(Another question would be about how yacy works, which I'm totally ignorant of: Would a totally redundant node disconnecting be harmful in any way? If not, a lot of desktop users, who are not using their computers 24/7 could run caching only reduncancy nodes to the benefit of the network.)