DApp Store | Web3 Hub for Events & Games | OKX Wallet

Trending topics

sorry, but i keep seeing posts of this nature so i need to clarify. we've known LMs are invertible for TWO YEARS. i showed this during my PhD. quoted paper adds some sophisticated extensions, but "Language Model Inversion" (Morris et al., ICLR 2024) did it first :)

- you can recover prompts from outputs alone, given enough sampling time - you can recover them faster by binary-searching the API if it allows 'logit bias' parameter - there's a cool extension in (Finlayson et al., 2024): you can recover the *last layer of the model itself*

Language Model Inversion

Logits of API-Protected LLMs Leak Proprietary Information

373

Top

Ranking

Favorites