Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
ssnistfajen
on Sept 2, 2023
|
parent
|
context
|
favorite
| on:
A GPT-4 capability forecasting challenge
CJK characters are almost always split into multiple tokens per individual character. I'm not too familiar with Unicode mappings so it's interesting that the the outputs are still very coherent.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: