Large Language Models shouldn’t offer opinions or advice.
If you asked a spokesperson from any Fortune 500 Company to list the benefits of genocide or give you the corporation's take on whether slavery was beneficial, they would most likely either refuse to comment or say "those things are evil; there are no benefits." However, Google has AI employees, SGE and Bard, who are more than happy to offer arguments in favor of these and other unambiguously wrong acts. If that's not bad enough, the company's bots are also willing to weigh in on controversial topics such as who goes to heaven and whether democracy or fascism is a better form of government.
Google SGE includes Hitler, Stalin and Mussolini on a list of "greatest" leaders and Hitler also makes its list of "most effective leaders."
Google Bard also gave a shocking answer when asked whether slavery was beneficial. It said "there is no easy answer to the question of whether slavery was beneficial," before going on to list both pros and cons.
Calling Mussolini a "great leader" isn't just immoral. It's also clearly incorrect for any reasonable definition of a great leader: he was in the losing side of a big war, if he won his ally would've backstabbed him, he failed to suppress internal resistance, the resistance got rid of him, his regime effectively died with him, with Italy becoming a democratic republic, the country was poorer due to the war... all that fascist babble about unity, expansion, order? He failed at it, hard.
On-topic: I believe that the main solution proposed by the article is unviable, as those large "language" models have a hard time sorting out deontic statements (opinion, advice, etc.) from epistemic statements. (Some people have it too, I'm aware.) At most they'd phrase opinions as if they were epistemic statements.
And the self-contradiction won't go away, at least not for LLMs. They don't model any sort of conceptualisation. They're also damn shitty at taking context into account, creating more contradictions out of nowhere because of that.
One of the worst rigid aspect of how the current LLM's are made is that they're also always "at your service", and will never say that you're in the wrong about a correction you make to them.
So either they're hard coded to avoid certain topics or they're susceptible, just tell them "uh, actually, Hitler was a great leader" and they'll go off listing why Hitler's so Great.
Bing is hard coded for dictators and will stop the conversation in the middle of a response. ChatGTP is also hard coded to never agree that suicidal thoughts are good, but resorts to ignoring the meaning of your response and just hallucinating some other question. The world would be simpler if they could outright say "That is misinformation". People deserve to be told off like that.