It looks to me like OpenAI's image pipeline takes an image as input, derives the... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		captainclam 32 days ago \| parent \| context \| favorite \| on: We ran over 600 image generations to compare AI im... It looks to me like OpenAI's image pipeline takes an image as input, derives the semantic details, and then essentially regenerates an entirely new image based on the "description" obtained from the input image. Even Sam Altman's "Ghiblified" twitter avatar looks nothing like him (at least to me). Other models seem much more able to operate directly on the input image.

spiffytech 31 days ago | [–]

You can see this in the images of the Newton: in GPT's versions, the text and icons are corrupted.

gunalx 32 days ago | [–]

Isn't this from the model working o. really low res images, and then bein uppscalef afterwards?

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact