believe that i've seen versions of this attack where the spammer makes a few innocuous non-spam (but not contribution either) posts before going full LLM bot
A more critical weakness is that these accounts only posted obvious spam; they made no effort to build up a plausible persona. Generating plausible human posts is more difficult, but broadly feasible with current LLM technology.
i will have to screenshot next time and see what people think
obviously they could be hijacked accounts and i might just be extremely judgey of your average twitter flight account
sorry! the way you misunderstood my rough mechanics description there, yeah, that would be kinda of ridiculous
i was trying to describe two discreet methods i feel are imperfect, 1) blocking users and communities OR 2) get really good with content filters (far harder/ less viable at scale/ horse to water situation)
Nah, text extruders didn't seem impossible and their development has not been impressive considering the amount of hype and investment. They haven't become significantly more capable or suited to tasks, and the primary viable usecases for LLM are still tied to transcription, translation, and captioning.
There are not and will not ever be "AI Agents" based on LLM technology. They might make a shitty tool and call it "AI Agents" but it is not going to be this magic entity you are envisioning. I hope this is good news for you.
you say that’s prone to blocking people who are bashing AI though… if they spam AI content I imagine they tend to praise AI as well? Idk, maybe I didn’t understand you.
Exactly this. For example, CMU is currently under the extreme corrupting influence of Palantir. However when CMU pushes AI propaganda, the various outlets adjust their headlines to frame it for clicks, whether that be "AI is WRONG 60% of the time! That's why we need to use it anyway." or "AI is RIGHT 40% of the time! Does that mean it has a soul???"
These shitty articles then end up everywhere and CMU gets to spread their propaganda to people who have whatever range of opinions.
believe that i've seen versions of this attack where the spammer makes a few innocuous non-spam (but not contribution either) posts before going full LLM bot
i will have to screenshot next time and see what people think
obviously they could be hijacked accounts and i might just be extremely judgey of your average twitter flight account