There are several questions similar to mine but none are satisfactory. I would like to be able to remove duplicates from a DB having more than 26 million lines, therefore in the most efficient way with the condition that I want to keep the row which has the most recent update date (update_date is a field).

Is it possible to use the Django ORM or do we have to use SQL in that case?

Thank you in avance for your help.


Your Answer

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge that you have read and understand our privacy policy and code of conduct.

Browse other questions tagged or ask your own question.