Supported Japanese sites
Each site has a support status, supported URL pattern, allowed outputs, and known limitations.
No affiliation: crawl-hub is not affiliated with, endorsed by, or an official API of any listed site. Support status describes crawl-hub behavior for public URL patterns only.
| Site | Status | Domains | Operations | Outputs | Known limitations |
|---|---|---|---|---|---|
| 2Nd Street | beta | 2ndstreet.jp, www.2ndstreet.jp | auto, sold | html, json | P0 managed reuse EC candidate. |
| Amazon Jp | beta | amazon.co.jp, www.amazon.co.jp | auto, sold | html, json | P0 managed candidate, but anti-bot volatility means registry should emphasize unsupported/failure observation over guaranteed success. |
| At Cosme | beta | cosme.net, www.cosme.net | auto, sold | html, json | P0 managed parse-html candidate; direct fetch fixture and parser golden output required. |
| At Cosme L L M | beta | cosme.net, www.cosme.net | auto, sold | html, json | Datastructor parser variant promoted to beta for local validation; registry aliasing and expected schema still need normalization. |
| Axes | beta | axes-net.com, www.axes-net.com | auto, sold | html, json | P0 managed fashion/luxury EC candidate. |
| Beauty Hotpepper | beta | beauty.hotpepper.jp | auto, sold | html, json | P0 managed parse-html candidate for beauty/salon data. |
| Beauty Hotpepper Jp | beta | beauty.hotpepper.jp | auto, sold | html, json | Parser variant from managed; keep separate until normalized into one site/work_type mapping. |
| Brandoff | beta | brandoff-store.com, www.brandoff-store.com | auto, sold | html, json | P0 reuse/luxury EC overlap candidate. |
| Daikokuya | beta | daikokuya78.com, www.daikokuya78.com | auto, sold | html, json | P0 managed reuse/luxury EC candidate. |
| Digimart | beta | digimart.net, www.digimart.net | auto, sold | html, json | P0 managed musical-instrument marketplace candidate. |
| Gallery Rare | beta | galleryrare.jp, www.galleryrare.jp | auto, sold | html, json | P0 reuse/luxury EC overlap candidate. |
| Ginzo | beta | ginzo.jp, www.ginzo.jp | auto, sold | html, json | P0 reuse/luxury EC overlap candidate. |
| Goo Net Car | beta | goo-net.com, www.goo-net.com | auto, sold | html, json | P0 managed parse-html candidate for used-car data. |
| Hedy | beta | hedy.jp, www.hedy.jp | auto, sold | html, json | P0 managed reuse/luxury EC candidate. |
| Hello Work | beta | hellowork.mhlw.go.jp | auto, sold | html, json | P0 managed public job-data candidate. Confirm acceptable request pattern before live tests. |
| Hojyokin Portal | beta | hojokin-portal.jp, www.hojyokin-portal.jp, hojyokin-portal.jp | auto, sold | html, json | P0 managed subsidy-data candidate. Domain spelling should be verified during registry import. |
| Houjin Goo | beta | houjin.goo.to | auto, sold | html, json | P0 managed company-data candidate; verify domain and representative URL before live tests. |
| Irbank Net | beta | irbank.net | auto, sold | html, json | P0 managed financial/company-data candidate. |
| Itownpage | beta | itp.ne.jp, www.itp.ne.jp | auto, sold | html, json | P0 managed local-business data candidate. Confirm privacy/compliance fit before broad enablement. |
| Kitamura Camera | beta | shop.kitamura.jp | auto, sold | html, json | P0 managed candidate. Internal helper dependencies must stay hidden behind managed fetch. |
| Komehyo | beta | komehyo.jp, www.komehyo.jp | auto, sold | html, json | P0 reuse/luxury EC overlap candidate. |
| Mandarake | beta | mandarake.co.jp, order.mandarake.co.jp | auto, sold | html, json | P0 managed collector/reuse EC candidate. |
| Maonline | beta | maonline.jp, www.maonline.jp | auto, sold | html, json | P0 managed business/news parser candidate. Avoid storing article body artifacts by default. |
| Mercari | beta | jp.mercari.com, mercari.com | auto, sold | html, json | P0 managed candidate promoted to beta for local live-route testing; signature/DPoP dependencies can still affect fetch reliability. |
| Monotaro | beta | monotaro.com, www.monotaro.com | auto, sold | html, json | P0 overlap candidate with industrial EC parser coverage. |
| Netmall | beta | netmall.hardoff.co.jp | auto, sold | html, json | P0 overlap candidate for reuse EC. Promote only after fixture tests cover sold/active item differences. |
| Okura | beta | okura-online.jp, www.okura-online.jp | auto, sold | html, json | P0 managed reuse/luxury EC candidate. |
| Rakuma | beta | fril.jp, item.fril.jp | auto, sold | html, json | P0 managed marketplace candidate. |
| Rakuten | beta | rakuten.co.jp, item.rakuten.co.jp | auto, sold | html, json | P0 managed EC candidate; fixture URL and golden JSON required before stable. |
| Reclo | beta | reclo.jp, www.reclo.jp | auto, sold | html, json | P0 managed reuse/luxury EC candidate. |
| Recruit Agent | beta | r-agent.com, www.r-agent.com | auto, sold | html, json | P0 managed job-data candidate. Confirm allowed modes and rate limits before live tests. |
| Retro | beta | retro.jp, www.retro.jp | auto, sold | html, json | P0 managed reuse/luxury EC candidate. |
| Shareview | beta | shareview.jp, www.shareview.jp | auto, sold | html, json | P0 managed review/product-data candidate. |
| Shimamura | beta | shop-shimamura.com, www.shop-shimamura.com | auto, sold | html, json | P0 managed retail parser candidate. |
| Snkrdunk | beta | snkrdunk.com, www.snkrdunk.com | auto, sold | html, json | P0 overlap candidate; route behavior should be observed before stable promotion. |
| Subsidy | beta | jgrants.go.jp, www.jgrants.go.jp | auto, sold | html, json | P0 managed subsidy-data candidate; verify canonical domain and schema. |
| Surugaya | beta | suruga-ya.jp, www.suruga-ya.jp | auto, sold | html, json | P0 overlap candidate with managed scrape and managed parse-html coverage. |
| Suumo | beta | suumo.jp | auto, sold | html, json | P0/P1 managed candidate. TLS may be enabled later by feature flag if direct is unreliable. |
| Suumo Base | beta | suumo.jp | auto, sold | html, json | Parser base/alias row from managed promoted to beta for local validation; work_type mapping still needs normalization. |
| Suumo Buy New Apartment | beta | suumo.jp | auto, sold | html, json | Datastructor SUUMO new-apartment parser candidate. |
| Suumo Buy New Apartment Child | beta | suumo.jp | auto, sold | html, json | Child/detail parser variant promoted to beta for local validation; schema and parent mapping still need finalization. |
| Suumo Rental | beta | suumo.jp | auto, sold | html, json | Datastructor SUUMO rental parser candidate. |
| Tamtam | beta | hs-tamtam.co.jp, www.hs-tamtam.co.jp | auto, sold | html, json | P0 managed retail/hobby parser candidate. |
| Techno Aid | beta | techno-aids.or.jp, www.techno-aids.or.jp | auto, sold | html, json | P0 managed public/assistive-device data candidate. |
| Townwork | beta | townwork.net, www.townwork.net | auto, sold | html, json | P0 managed job-data candidate. Confirm live-test policy and rate limits before enabling release gating. |
| Trefac | beta | trefac.jp, ec.treasure-f.com | auto, sold | html, json | P0 overlap candidate. Datastructor parser name may be trefac_online; normalize aliases in registry import. |
| Ueda | beta | ueda78.com, www.ueda78.com | auto, sold | html, json | P0 managed reuse/luxury EC candidate. |
| Unsupported Playwright Required | disabled | example-playwright-required.invalid | auto, sold | html | Sentinel row for unsupported tests. Phase 1 must return explicit unsupported/disabled behavior for sites requiring browser execution. |
| Unsupported Scraperapi Required | disabled | example-scraperapi-required.invalid | auto, sold | html | Sentinel row for unsupported tests. Phase 1 must not silently route to ScraperAPI/proxy providers. |
| Vector | beta | vector-park.jp, www.vector-park.jp | auto, sold | html, json | P0 candidate, but technical architecture calls out possible internal Lambda/helper dependency. Keep external route normalized. |
| Yahoo Auction | beta | auctions.yahoo.co.jp, page.auctions.yahoo.co.jp | auto, sold | html, json | P0 candidate. Any Lambda/proxy helper dependency must remain internal to managed fetch and must not be exposed as a route. |
| Yahoo Fleamarket | beta | paypayfleamarket.yahoo.co.jp | auto, sold | html, json | P0 managed marketplace candidate; fixture URL and parser golden output still required. |
| Yahoo Shopping | beta | shopping.yahoo.co.jp, store.shopping.yahoo.co.jp | auto, sold | html, json | P0 managed EC candidate; use registry observation before promising broad Yahoo Shopping coverage. |
| Yodobashi | beta | yodobashi.com, www.yodobashi.com | auto, sold | html, json | P0 overlap candidate with managed parser coverage. |