← Back to context

Comment by efromvt

22 days ago

Yeah I have a 'species' info table that's built by curating wikipedia and a few other sources and passing them through a structured LLM pipeline; ecological benefit; blooming season; native regions, etc. This is very much a 'rough cut' at the moment; I want to put more quality gates and evals in it. If you're interested in collaborating all the raw parquet datasets I have are in a public GCS bucket - happy to have them pulled in anywhere else!

DBH I'm doing for the "size" right now, though I'd love to figure out how to get canopy shape/size as well, and height where possible. (and then maybe proxy height a species level from DBH, since that's more common).

(apologies for belated response)