Leetcode|Repeated DNA Sequences
来源:互联网 发布:linux 查看关机原因 编辑:程序博客网 时间:2024/05/05 12:38
All DNA is composed of a series of nucleotides abbreviated as A, C, G, and T, for example: "ACGAATTCCG". When studying DNA, it is sometimes useful to identify repeated sequences within the DNA.
Write a function to find all the 10-letter-long sequences (substrings) that occur more than once in a DNA molecule.
For example,
Given s = "AAAAACCCCCAAAAACCCCCCAAAAAGGGTTT",Return:["AAAAACCCCC", "CCCCCAAAAA"].解法1:暴力法(Memory Limit Exceeded)从头到尾依次查询,借助map统计出现次数。大于1的就算
vector<string> findRepeatedDnaSequences(string s) { map<string,int> key; vector<string> res; if(s.size()<=10) return res; for(int i=0;i<=s.size()-10;i++) { string tmp=s.substr(i,10); key[tmp]++; if(key[tmp]==2) res.push_back(tmp); } return res; }解法2:存储太多字符串会导致memory过大,因为字符只有四种,这个好办了,把字符串表示成4进制的数字就OK了。(72ms)
int ACGT2INT(char c){ switch(c) { case 'A': return 0; case 'C': return 1; case 'G': return 2; case 'T': return 3; } return -1;}int DNA2INT(string& m){ const int MAX=10; int res=0; for(int i=0;i<MAX;i++) { res=res*4+ACGT2INT(m[i]); } return res;}vector<string> findRepeatedDnaSequences(string s) { const int N=1048576; int key[N]; memset(key,0,sizeof(key)); vector<string> res; if(s.size()<=10) return res; for(int i=0;i<=s.size()-10;i++) { string tmp=s.substr(i,10); key[DNA2INT(tmp)]++; if(key[DNA2INT(tmp)]==2) res.push_back(tmp); } return res; }注意这里我用了一个数组来记录出现次数,
const int N=1048576;//因为四进制的10位数,最大值不会超过1024^2 int key[N]; memset(key,0,sizeof(key));但是如果把这些换成unordered_map<int,int> key; 运行时间为150ms左右。(leetcode 30个例子测试时间)
如果换成map<int,int> key;测试时间为280ms。
所以可以看出数组和map还有unordered_map的效率问题。
能不用后两者的就用数组记录hash情况。
0 0
- Leetcode Repeated DNA Sequences
- Repeated DNA Sequences [leetcode]
- [LeetCode] Repeated DNA Sequences
- Leetcode Repeated DNA Sequences
- Leetcode:Repeated DNA Sequences
- Leetcode: Repeated DNA Sequences
- LeetCode: Repeated DNA Sequences
- LeetCode: Repeated DNA Sequences
- LeetCode Repeated DNA Sequences
- LeetCode--Repeated DNA Sequences
- [LeetCode]Repeated DNA Sequences
- [Leetcode]Repeated DNA Sequences
- [leetcode]Repeated DNA Sequences
- Repeated DNA Sequences - LeetCode
- Leetcode: Repeated DNA Sequences
- Leetcode:Repeated DNA Sequences
- leetcode:Repeated DNA Sequences
- LeetCode - Repeated DNA Sequences
- (5)风色从零单排《C++ Primer》 const,typedef,auto,decltype
- C语言学习笔记 之 结构体指针变量
- 理解Java中的String数据类型
- GlusterFS集群文件系统研究
- linux线程间同步(1)读写锁
- Leetcode|Repeated DNA Sequences
- 通过分析 JDK 源代码研究 TreeMap 红黑树算法实现
- 程序员常去的14个顶级开发社区
- UI界面并不是只有白色设计才叫极简设计
- Fatal signal 11 (SIGSEGV)问题解决
- 有关宏定义的经验与技巧-简化代码-增强Log
- Design Patterns Elements of Reusable Object-Oriented Software
- SSL协议详解
- 反射型XSS绕过案例总结