26 comments

  • mchinen 10 hours ago

    Cool to see this from Brian Hie, who was doing interesting computational bio research at Meta's FAIR before they axed it. Interesting that this is work on the more physical/testing/manufacturing level than the computational, but it seems very useful.

    It's hard to quantify the impact of new foundational tools like this at launch. Most of the time it falls flat, but even the successes are difficult. For example, CRISPR has led to interesting experiments and treatments on the way, but the effect does feel muted compared to the initial predictions. But there are many other related techniques that can be pulled out of this original research (e.g. dCas9 which lets you operate without cutting).

    Similar story with cellular reprogramming.

    Eventually one of these things will surface that will be GPU/transistor type innovations.

    • dsign 9 hours ago

      > but even the successes are difficult.

      Yeah, it feels like we need a phase transition in the speed and practicality of the process. But I don't believe we need a single concrete lab tech.

      Years ago when I did research, my impression was that there was complexity galore. A researcher on Drosophila developmental signaling would have a very disjoint knowledge domain than that of a researcher in horizontal gene transfer and antibiotic resistance. Both would exist in a different planet altogether than a clinician prescribing a cancer treatment. And the three of them would generally lack the tooling that somebody doing systems biology was used to.

      So, to me, the key thing we need is some sort of "domain cement", or a good way to pull operative knowledge and usable skills from everywhere.

      • fc417fc802 9 hours ago

        > the key thing we need is some sort of "domain cement", or a good way to pull operative knowledge and usable skills from everywhere.

        Isn't that what LLMs are shaping up to be? Once we manage to divorce the knowledge from the weights in some way we could have in effect a frontier model whose awareness was limited to the sum total of the scientific literature.

    • LarsDu88 26 minutes ago

      I attended a talk by Brian at Stanford, and I asked why not just use Gibson assembly to stitch together 5 kb synthesized strands from a company like Twist Biosciences.

      The answer I got was along the lines that they were simply going to get around to do the actual lab work at some point.

      DNA synthesis technology hasn't really been a blocker for generative bio projects except at the full chromosome level.

      And I think simply generating a full chromosome and booting it up without doing due diligence is probably a recipe for disaster.

      We honestly aren't that far away from AI slop enzymes, AI slop ligases, and eventually AI slop bio weapons...

    • BigTTYGothGF 6 hours ago

      > Eventually one of these things will surface that will be GPU/transistor type innovations

      Why do you think that?

      • mchinen 3 hours ago

        I just meant a big innovation that reshapes everything. I should have used 'level' instead of 'type' here.

        But there are a lot of analogies to computation in bio as a physical, atomic forces-driven, massively parallel computer, so it's possible there will be something related to electronics and computers that falls out. For example, there's also applications directly related to other fields including DNA storage of data and neuron-based computation.

  • Tade0 6 hours ago

    > Sequences of that length can encode entire biochemical pathways, laying the groundwork for engineered microbes that manufacture drugs, biofuels, or specialty chemicals, and eventually to the assembly of vast DNA constructs approaching complete artificial genomes.

    Never mind artificial genomes - let me have a snapshot of my DNA sequenced and re-created from scratch say 20 years later - telomeres and all.

    • vintermann 5 hours ago

      What if the new one doesn't like you?

      • mattbettinson 28 minutes ago

        Just need the organs!

      • clark_dent 5 hours ago

        Exactly like the old one!

      • Tade0 3 hours ago

        Then it's back to the drawing board of course.

  • dnautics 2 hours ago

    cool, but i dont feel like dna synthesis is a rate limiting step. unless im missing something, its sort of a micro-optimization that is likely to cost more since you still need an oligo synthesizer (and you will need to write out more bases)

  • bonsai_spool 7 hours ago

    This is not a practical challenge - I order DNA from Twist at these ‘large’ scales trivially without needing to do oligo hybridization magic. The DNA arrives in a month - but considering how many oligos sidewinder calls for, not clear how they could be faster.

    • cge 5 hours ago

      At a basic level, methods of combining oligos to produce long strands have been known for ages. The challenge is to be able to produce them with low enough error, high enough yield, and enough freedom on sequence. Low error improves your yield, reduces the amount of purification and amplification needed, and lets you make longer strands. Sequence constraints can be significant, too, especially around repeats.

      If you're talking about Twist's gene fragment product, they advertise that as maxing out at 5 kb. Most, if not essentially all, of that month delivery time is likely the combination, not the oligo pool production. I think the Sidewinder people are actually using Twist pools; they're doing up to 12.5 kb.

      By comparison, we recently needed something in the 20 kb range, with a not-so-great sequence, and it was a multi-month process to have a company produce it.

      • amirhirsch an hour ago

        https://ansabio.com/ advertising 50 kb

        • LarsDu88 25 minutes ago

          How is ANSA bio? I just got hit up by a recruiter for this company at random, and now I see them mentioned on Hackernews

      • dnautics 2 hours ago

        > not-so-great sequence

        are the limits on not-so-greatness for sidewinder known?

      • kardianos 5 hours ago

        Yes. Whole genome sequencing has... some limits. CYP2D6 for instance is an important gene address, yet is rather hard to sequence do to its many copies and minor mutations. If you don't use targeted copy callers, it can be hard to correctly sequence in WGS.

        • bonsai_spool 5 hours ago

          > Yes. Whole genome sequencing has

          We're speaking about gene synthesis, not about DNA sequencing

    • dnautics 2 hours ago

      12kb, you can get three gblocks in a week and change and have them assembled in two days.

  • bottlepalm 10 hours ago

    I can't be the only one reading this who doesn't have alarm bells going off in their heads.

  • the__alchemist 3 hours ago

    Bad-ass! Getting small seqs "oglios" is cheap/fast. (as stated in the article) (IDT or Sigma will courier it to you same or next day, and is ~$10 per) But longer ones is much more expensive!

    So... is the "Generative AI" tie in (Mentioned many times, starting at the top of the article) used mainly get funding and press? The core part of this seems to have nothing to do with AI. I am sus this is bullshit marketing based on #1: How many times I've clicked an article, and have AI blasted all over it for sus reasons, and #2: Being able to cheaply and reliably synthesis custom DNA seqs longer than a few hundred/thousand bps is a broadly useful tech for current and future applications.

    So: #1: This is really good news. #2: Do better with the hype/bull. It undermines your credibility. So now I start questioning whether this works as well as advertised, and what else they are being shady about.

    • biophysboy 2 hours ago

      I used to order $10 oligos all the time. The extra length is cool, but I was honestly more impressed by the claims on sequence accuracy in the article. Even a single base pair change can affect the genetics and physics of DNA.

      I broadly agree with you on the AI hyping. Data quality and quantity is not high enough in my opinion.

  • vi_sextus_vi 6 hours ago

    Nobel for Seeman and Guo!

  • shevy-java 10 hours ago

    > that predictive models are now producing faster than anyone can construct them.

    Erm ... you have A T C G. You can have a gazillion of combinations there.

    Of course BY DEFAULT it will always be slower than ANY combination you would desire to have - and you most definitely do not need AI slop to have that either. Do we need AI slop for generating any permutation of those 4 letters now? So what is the point of stating "can construct".

    IF the synthesis method works, then that is the focus to be debated, not the AI slop is our master-thinker now.

    > “We really want this to be an enabling platform,” says Robinson. “We want people to do cool things with the technology.”

    And I think they patented this (if it really works), so ... enabling platform, right.

    Interestingly the article omits many key questions to be asked here. If the method already works as-is, why isn't everyone using it? If it is cheaper and faster, then logically it would already be used or usable.

    • ben_w 4 hours ago

      > > that predictive models are now producing faster than anyone can construct them.

      > Erm ... you have A T C G. You can have a gazillion of combinations there.

      > Of course BY DEFAULT it will always be slower than ANY combination you would desire to have - and you most definitely do not need AI slop to have that either. Do we need AI slop for generating any permutation of those 4 letters now? So what is the point of stating "can construct".

      The bit right before your quote says why:

        giving scientists a fast, affordable, and accurate way to physically build the novel genetic sequences that predictive models are now producing faster than anyone can construct them.
      
      Also, predictive models is broader than Transformers, but even then Transformers in the context of DNA is somewhat different from the context of natural (or even programming) languages; and even more than that, given how effective even mediocre early models were for code not useful to dismiss all of it even when it is definitely "slop" in other domains: https://www.nature.com/articles/s41592-024-02523-z